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To our teachers and our students 
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PREFACE 


n writing this seventh edition of Biochemistry, we 

have balanced the desire to present up-to-the minute 
advances with the need to make biochemistry as clear 
and engaging as possible for the student approaching 
the subject for the first time. Instructors and students 
have long relied on Biochemistry for: 


¢ Clear writing The language of biochemistry is 
made as accessible as possible. A straightforward 
and logical organization leads the reader through 
processes and helps navigate complex pathways 
and mechanisms. 


¢ Single-concept illustrations [llustrations in 
this book address one point at a time so that each 
illustration clearly tells the story of a mechanism, 
pathway, or process without the distraction of 
excess detail. 


* Physiological relevance Biochemistry is the 
study of life on the smallest scale, and it has always 
been our goal to help students connect biochemistry 
to their own lives. Pathways and processes are 
presented in a physiological context so that the 
reader can see how biochemistry works in different 
parts of the body and under different environmental 
and hormonal conditions. 


¢ Clinical insights Wherever appropriate, pathways 
and mechanisms are applied to health and disease. 
These applications show students how biochemistry 
is relevant to them while reinforcing the concepts that 
they have just learned. (For a full list, see p. xi.) 


* Evolutionary perspective Evolution is evident 
in the structures and pathways of biochemistry and 
is woven into the narrative of the textbook. (For a 
full list, see p. x.) 


New to This Edition 


Researchers are making new discoveries in biochemistry 
every day. The seventh edition takes into account the 
discoveries that have changed how we think about the 
fundamental concepts in biochemistry and human health. 
New aspects of the book include: 


* Metabolism integrated in a new context New 
information about the role of leptins in hunger and 
satiety has greatly influenced how we think about 
obesity and the growing epidemic of diabetes. In 
this edition, we cover the integration of metabolism 
in the context of diet and obesity. 


¢ New chapters on gene regulation To relate 
to the rapidly growing understanding of the 
biochemical aspect of eukaryotic gene regulation, 


we have greatly expanded our discussion of 
regulation and have split the chapter in the 
preceding editions into two: Chapter 31, “The 
Control of Gene Expression in Prokaryotes,” and 
Chapter 32, “The Control of Gene Expression 

in Eukaryotes.” These chapters address 

recent discoveries such as quorum sensing in 
prokaryotes, induced pluripotent stem cells, 

and the role of microRNAs in regulating gene 
expression. 


¢ Experimental techniques updated and 
clarified We have revised Chapters 3 (“Exploring 
Proteins and Proteomes”), 5 (“Exploring Genes 
and Genomes’’), and 6 (“Exploring Evolution and 
Bioinformatics”) to give students a practical 
understanding of the benefits and limitations 
of the techniques that they will be using in the 
laboratory. We have expanded explanations of 
mass spectrometry and x-ray crystallography, 
for instance, and made them even clearer for the 
first-time student. We explain new techniques 
such as next-generation sequencing and real-time 
PCR in the context of their importance to 
modern research in biochemistry. (For a full list, 
see p. Xil.) 


Glucose 


4 
Pancreas 
Chapter 27 A schematic representation illustrates a few of the 


many metabolic pathways that must be coordinated to meet the 
demands of living. 
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Recent Advances (A) () 


Some of the exciting advances and new topics 
that we present in the seventh edition include: 


Osteogenesis imperfecta, or brittle bone 
disease (Chapter 2) 


Intrinsically unstructured proteins and 
metamorphic proteins (Chapter 2) 
Recent updates in protein-misfolding GFA Pr 
diseases (Chapter 2) ~~ — — & 


The use of recombinant DNA technology 
in protein purification (Chapter 3) 


Expanded discussion of mass spectrometry 
and x-ray crystallography (Chapter 3) 


\ Six-bladed 
propeller 
structure 


Next-generation sequencing methods 
(Chapter 5) 


Real-time PCR (Chapter 5) agua uaa 


DNA microarrays (Chapter 5) 
: F : Figure 26.24 LDL receptor releases LDL in the endosomes. [After |. D. Campbell, 
Carbon monoxide poisoning (Chapter 7) Biochem. Soc. Trans. 31:1107—1114, 2003, Fig 1A] 


Single-molecule studies of enzyme kinetics ere : 
(Chapter 8) ¢ Aromatase inhibitors in the treatment of breast and 


ovarian cancer (Chapter 26) 
Myosins as a model of a catalytic strategy for ATP 


hydrolysis (Chapter 9) 
Glycobiology and glycomics (Chapter 11) 
Hurler disease (Chapter 11) 


The role of leptin in long-term caloric homeostasis 
(Chapter 27) 


Obesity and diabetes (Chapter 27) 


¢ Exercise and its effects on cellular biochemistry 


Avian influenza H5N1 (Chapter 11) (Chapter 27) 

Lipid rafts (Chapter 12) ¢ Updated detailed mechanism of helicase’s action 
Transferrin as an example of receptor-mediated (Chapter 28) 

endocytosis (Chapter 12) * Updated detailed mechanism of topoisomerase’ s 
Long QT syndrome and arrhythmia caused by the action (Chapter 28) 

inhibition of potassium channels (Chapter 13) * Riboswitches (Chapter 29) 

Defects in the citric acid cycle and the development © The production of small regulatory RNAs (Chapter 29) 


f Chapter 17 as 
° ail ( sagen: ¢ Vanishing white matter disease (Chapter 30) 
Synthesizing a more efficient rubisco (Chapter 20) 

* Quorum sensing (Chapter 31) 


Biofilms (Chapter 31) 
Induced pluripotent stem cells (Chapter 32) 


The structure of mammalian fatty acid synthetase 
(Chapter 22) 


Pyrimidine salvage pathways (Chapter 25) 


¢ The role of microRNAs in gene regulation 
(Chapter 32) 


¢ How vaccines work (Chapter 34) 


Physical association of enzymes in metabolic 
pathways (Chapter 25) 


Phosphatidic acid phosphatase in the regulation of 


lipid metabolism (Chapter 26) ¢ The structure of myosin head domains (Chapter 35) 
The regulation of SCAP-SREBP movement Cldaved segmentsat mRNA 
in cholesterol metabolism (Chapter 26) S\N > NW" 


Mutations in the LDL receptor (Chapter 26) miRNA one SIN 


Argonaute 


The role of HDL in protecting against 
arteriosclerosis (Chapter 26) Figure 32.27 MicroRNA action. 


New End-of-Chapter Problems 


Biochemistry is best learned by practicing it and, to 
help students practice biochemistry, we have increased 
the number of end-of-chapter problems by 50%. In 
addition to many traditional problems that test bio- 
chemical knowledge and the ability to use this knowl- 
edge, we have three categories of problems to address 
specific problem-solving skills. 


¢ Mechanism problems ask students to suggest or 
elaborate a chemical mechanism. 


* Data interpretation problems ask questions about 
a set of data provided in tabulated or graphic form. 
These problems give students a sense of how 
scientific conclusions are reached. 


¢ Chapter integration problems require students 
to use information from several chapters to reach 
a solution. These problems reinforce a student’s 
awareness of the interconnectedness of the different 
aspects of biochemistry. 


Brief solutions to these problems are presented at the 
end of the book; expanded solutions are available in the 
accompanying Student Companion. 


Visualizing Molecular Structure 


All molecular structures have been selected and ren- 
dered by Jeremy Berg and Gregory Gatto. To help stu- 
dents read and understand these structures, we include 
the following tools: 


¢ A molecular-model “primer” explains the different 
types of protein models and examines their strengths 
and weaknesses (see appendices to Chapters 1 and 2). 


Preface vii 


¢ Figure legends direct students explicitly to the key 
features of a model. 


* A great variety of types of molecular structures 
are represented, including clearer renderings of 
membrane proteins. 


¢ For most molecular models, the PDB number at 
the end of the figure legend gives the reader easy 
access to the file used in generating the structure 
from the Protein Data Bank Web site (www.pdb. 
org). At this site, a variety of tools for visualizing 
and analyzing the structure are available. 


* Living figures for most molecular structures now 
appear on the Web site in Jmol to allow students 
to rotate three-dimensional molecules and view 
alternative renderings online. 


h Figure 28.12 Helicase asymmetry. Notice that only four of the 
subunits, those shown in blue and yellow, bind AMP-PNP. [Drawn from 
1EOK.pdb.] 


Media and Supplements 


A full package of media resources and supplements provides instructors and students 
with innovative tools to support a variety of teaching and learning approaches. 


eBook 
http://ebooks.bfwpub.com/berg7e 


This online version of the textbook combines the con- 
tents of the printed book, electronic study tools, and 
a full complement of student media specifically cre- 
ated to support the text. Problems and resources from 
the printed textbook are incorporated throughout the 
eBook, to ensure that students can easily review specific 
concepts. The eBook enables students to: 


* Access the complete book and its electronic study 
tools from any internet-connected computer by 
using a standard Web browser; 


¢ Navigate quickly to any section or subsection of the 
book or any page number of the printed book; 


¢ Add their own bookmarks, notes, and highlighting; 


* Access all the fully integrated media resources asso- 
ciated with the book; 


¢ Review quizzes and personal notes to help prepare 
for exams; and 


¢ Search the entire eBook instantly, including the 
index and spoken glossary. 


Instructors teaching from the eBook can assign either 
the entire textbook or a custom version that includes 
only the chapters that correspond to their syllabi. They 
can choose to add notes to any page of the eBook and 
share these notes with their students. These notes may 
include text, Web links, animations, or photographs. 


BiochemPortal. 
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BiochemPortal (Berg 7e) 
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6 BiochemPortal QuickStart 


BIOCHEM P@RTA I. 


http://courses.bfwpub.com/berg7e 


BiochemPortal is a dynamic, fully integrated learning 
environment that brings together all of our teaching 
and learning resources in one place. It features easy- 
to-use assessment tracking and grading tools that 
enable instructors to assign problems for practice, as 
homework, quizzes, or tests. A personalized calendar, 
an announcement center, and communication tools 
help instructors manage the course. In addition to 
all the resources found on the Companion Web site, 
BiochemPortal includes several other features: 


¢ The interactive eBook integrates the complete text 
with all relevant media resources. 


¢ Hundreds of self-graded practice problems 
allow students to test their understanding of 
concepts explained in the text, with immediate 


feedback. 


¢ The metabolic map helps students understand 
the principles and applications of the core metabolic 
pathways. Students can work through guided 
tutorials with embedded assessment questions, or 
explore the Metabolic Map on their own using the 
dragging and zooming functionality of the map. 


* Jmol tutorials by Jeffrey Cohlberg, California State 
University at Long Beach, teach students how to 
create models of proteins in Jmol based on data from 
the Protein Database. By working through the tutorial 

and answering assessment questions at 

the end of each exercise, students learn 
to use this important database and 
fully realize the relationship between 
structure and function of enzymes. 


BOOK The onleve version of Biochemistry etegrater a 
personalized study plan and interactive multrneds resources. 
© RESOULCES: Anmebors, activities, quires, matructor-ecded 
content, resources ler instructors, and other too's. Organized by 
(evource type OF Dy Chapter. 
. Homework end assessments that have been 
anuUigned By the matructer. 


To get arena the #hoak, cane Chapel hare) 


e Announcements 


¢ Animated techniques illustrate 
laboratory techniques described in 
the text. 


* Concept tutorials walk students 
through complex ideas in enzyme 
kinetics and metabolism. 


Companion Web Site 
www.whfreeman.com/berg7e 


For students 


Living figures allow students to explore 
protein structure in 3-D. Students can zoom 
and rotate the “live” structures to get a better 
understanding of their three-dimensional nature 
and can experiment with different display styles 
(space-filling, ball-and-stick, ribbon, backbone) 
by means of a user-friendly interface. 


Concept-based tutorials by Neil D. Clarke help 
students build an intuitive understanding of some of 
the more difficult concepts covered in the textbook. 


Animated techniques help students grasp 
experimental techniques used for exploring genes 
and proteins. 


The self-assessment tool helps students evaluate 
their progress. Students can test their understanding 
by taking an online multiple-choice quiz provided 
for each chapter, as well as a general chemistry 
review. 


The glossary of key terms. 


Web links connect students with the world of 
biochemistry beyond the classroom. 


For Instructors 


All of the student resources plus: 


All illustrations and tables from the textbook, 
in jpeg and PowerPoint formats optimized for 
classroom projection. 

The Assessment Bank offers more than 1500 
questions in editable Microsoft Word format. 


Instructor's Resource DVD 
[1-4292-8411-0] 


The CD includes all the instructor’s resources from the 
Web site. 


Overhead Transparencies 
[1-4292-8412-9] 


200 full-color illustrations from the textbook, optimized 
for classroom projection 


Student Companion 
[1-4292-3115-7] 


For each chapter of the textbook, the Student Companion 
includes: 


Chapter Learning Objectives and Summary 


Self-Assessment Problems, including multiple- 
choice, short-answer, matching questions, and 
challenge problems, and their answers 


Expanded Solutions to end-of-chapter problems in 
the textbook 


Molecular Evolution 


MY This icon signals the start of the many discussions that highlight protein 
commonalities or other molecular evolutionary insights. 


Only L amino acids make up proteins (p. 27) 
Why this set of 20 amino acids? (p. 33) 
Additional human globin genes (p. 211) 
Fetal hemoglobin (p. 213) 
Catalytic triads in hydrolytic enzymes (p. 260) 
Major classes of peptide-cleaving enzymes (p. 263) 
Zinc-based active sites in carbonic anhydrases (p. 271) 
Common catalytic core in type II restriction 
enzymes (p. 278) 
P-loop NTPase domains (p. 283) 
Conserved catalytic core in protein kinases (p. 302) 
Why might human blood types differ? (p. 335) 
Archaeal membranes (p. 350) 
Ion pumps (p. 374) 
P-type ATPases (p. 378) 
ATP-binding cassettes (p. 378) 
Sequence comparisons of Na* and Ca‘ channels (p. 386) 
Small G proteins (p. 410) 
Metabolism in the RNA world (p. 447) 
Why is glucose a prominent fuel? (p. 455) 
NAD* binding sites in dehydrogenases (p. 469) 
The major facilitator superfamily of transporters (p. 477) 
Isozymic forms of lactate dehydrogenase (p. 490) 
Evolution of glycolysis and gluconeogenesis (p. 491) 
The a-ketoglutarate dehydrogenase complex (p. 507) 
Domains of succinyl CoA synthase (p. 509) 
Evolution of the citric acid cycle (p. 518) 
Mitochondria evolution (p. 527) 
Conserved structure of cytochrome c (p. 543) 
Common features of ATP synthase and G proteins (p. 550) 
Related uncoupling proteins (p. 557) 
Chloroplast evolution (p. 568) 
Evolutionary origins of photosynthesis (p. 584) 
Evolution of the C4 pathway (p. 600) 
The coordination of the Calvin cycle and the pentose 
phosphate pathway (p. 609) 
Evolution of glycogen phosphorylase (p. 627) 


Increasing sophistication of glycogen phosphorylase 
regulation (p. 628) 

The a-amylase family (p. 629) 

A recurring motif in the activation of carboxyl groups (p. 645) 

Prokaryotic counterparts of the ubiquitin pathway 
and the proteasome (p. 677) 

A family of pyridoxal-dependent enzymes (p. 684) 

Evolution of the urea cycle (p. 688) 

The P-loop NTPase domain in nitrogenase (p. 708) 

Similar transaminases determine amino acid chirality (p. 713) 

Feedback inhibition (p. 724) 

Recurring steps in purine ring synthesis (p. 741) 

Ribonucleotide reductases (p. 747) 

Increase in urate levels during primate evolution (p. 754) 

The cytochrome P450 superfamily (p. 783) 

DNA polymerases (p. 821) 

Thymine and the fidelity of the genetic message (p. 841) 

Sigma factors in bacterial transcription (p. 858) 

Similarities in transcription between archaea and 
eukaryotes (p. 869) 

Evolution of spliceosome-catalyzed splicing (p. 881) 

Classes of aminoacyl-tRNA synthetases (p. 897) 

Composition of the primordial ribosome (p. 900) 

Homologous G proteins (p. 903) 

A family of proteins with common ligand-binding 
domains (p. 926) 

The independent evolution of DNA-binding sites of 
regulatory proteins (p. 927) 

Regulation by attenuator sites (p. 932) 

CpG islands (p. 946) 

Iron-response elements (p. 952) 

miRNAs in gene evolution (p. 954) 

The odorant-receptor family (p. 959) 

Photoreceptor evolution (p. 969) 

The immunoglobulin fold (p. 984) 

Relationship of actin to hexokinase and prokaryotic 
proteins (p. 1019) 


Clinical Applications 


e This icon signals the start of a clinical application in the text. Additional, briefer 


clinical correlations appear in the text as appropriate. 


Osteogenesis imperfecta (p. 45) 

Protein-misfolding diseases (p. 55) 

Protein modification and scurvy (p. 55) 

Antigen detection with ELISA (p. 88) 

Synthetic peptides as drugs (p. 96) 

Gene therapy (p. 167) 

Functional magnetic resonance imaging (p. 197) 

Carbon monoxide poisoning (p. 213) 

Sickle-cell anemia (p. 209) 

Thalessemia (p. 210) 

Aldehyde dehydrogenase deficiency (p. 232) 

Action of penicillin (p. 244) 

Protease inhibitors (p. 264) 

Carbonic anhydrase and osteoporosis (p. 266) 

Isozymes as a sign of tissue damage (p. 297) 

Emphysema (p. 306) 

Vitamin K (p. 310) 

Hemophilia (p. 311) 

Tissue-type plasminogen activator (p. 312) 

Monitoring changes in glycosylated hemoglobin (p. 325) 

Erythropoietin (p. 330) 

Hurler disease (p. 331) 

Blood groups (p. 335) 

I-cell disease (p. 336) 

Influenza virus binding (p. 339) 

Clinical applications of liposomes (p. 354) 

Aspirin and ibuprofen (p. 358) 

Digitalis and congenital heart failure (p. 377) 

Multidrug resistance (p. 378) 

Long QT syndrome (p. 392) 

Signal-transduction pathways and cancer (p. 420) 

Monoclonal antibodies as anticancer drugs (p. 421) 

Protein kinase inhibitors as anticancer drugs (p. 421) 

Vitamins (p. 441) 

Lactose intolerance (p. 471) 

Galactosemia (p. 472) 

Exercise and cancer (p. 478) 

Phosphatase deficiency (p. 514) 

Defects in the citric acid cycle and the development 
of cancer (p. 515) 

Beriberi and mercury poisoning (p. 517) 

Mitochondrial diseases (p. 558) 

Hemolytic anemia (p. 609) 

Glucose 6-phosphate deficiency (p. 611) 

Glycogen-storage diseases (p. 634) 

Carnitine deficiency (p. 646) 

Zellweger syndrome (p. 652) 

Diabetic ketosis (p. 655) 

The use of fatty acid synthase inhibitors as 
drugs (p. 663) 

Effects of aspirin on signaling pathways (p. 665) 


Diseases resulting from defects in E3 proteins (p. 676) 
Diseases of altered ubiquitination (p. 678) 
Using proteasome inhibitors to treat tuberculosis (p. 679) 


Inherited defects of the urea cycle (hyperammonemia) (p. 688) 


Alcaptonuria, maple syrup urine disease, and 
phenylketonuria (p. 697) 

High homocysteine levels and vascular disease (p. 719) 

Inherited disorders of porphyrin metabolism (p. 730) 


Anticancer drugs that block the synthesis of thymidylate (p. 749) 


Adenosine deaminase and severe combined 
immunodeficiency (p. 752) 

Gout (p. 753) 

Lesch—Nyhan syndrome (p. 754) 

Folic acid and spina bifida (p. 755) 

Second messengers derived from sphingolipids and 
diabetes (p. 765) 

Respiratory distress syndrome and Tay-Sachs 
disease (p. 765) 

Diagnostic use of blood-cholesterol levels (p. 774) 

Hypercholesterolemia and atherosclerosis (p. 776) 

Mutations in the LDL receptor (p. 777) 

The role of HDL in protecting against 
arteriosclerosis (p. 778) 

Clinical management of cholesterol levels (p. 779) 

Aromatase inhibitors in the treatment of breast 
and ovarian cancer (p. 785) 

Rickets and vitamin D (p. 786) 

Antibiotics that target DNA gyrase (p. 831) 

Blocking telomerase to treat cancer (p. 837) 

Huntington disease (p. 842) 

Defective repair of DNA and cancer (p. 842) 

Detection of carcinogens (Ames test) (p. 843) 

Antibiotic inhibitors of transcription (p. 861) 

Burkitt lymphoma and B-cell leukemia (p. 869) 

Diseases of defective RNA splicing (p. 877) 

Vanishing white matter disease (p. 908) 

Antibiotics that inhibit protein synthesis (p. 909) 

Diphtheria (p. 910) 

Ricin, a lethal protein-synthesis inhibitor (p. 911) 

Induced pluripotent stem cells (p. 944) 

Anabolic steroids (p. 948) 

Color blindness (p. 970) 

The use of capsaicin in pain management (p. 974) 

Immune-system suppressants (p. 990) 

MHC and transplantation rejection (p. 998) 

AIDS vaccine (p. 999) 

Autoimmune diseases (p. 1001) 

Immune system and cancer (p. 1001) 

Vaccines (p. 1002) 

Charcot-Marie-Tooth disease (p. 1016) 

Taxol (p. 1019) 


xi 


Tools and Techniques 


The seventh edition of Biochemistry offers three chapters that present the tools 
and techniques of biochemistry: “Exploring Proteins and Proteomes’” (Chapter 3), 
“Exploring Genes and Genomes” (Chapter 5), and “Exploring Evolution and 
Bioinformatics” (Chapter 6). Additional experimental techniques are presented 


throughout the book, as appropriate. 


Exploring Proteins and Proteomes (Chapter 3) 


Protein purification (p. 66) 

Differential centrifugation (p. 67) 

Salting out (p. 68) 

Dialysis (p. 69) 

Gel-filtration chromatography (p. 69) 

Ion-exchange chromatography (p. 69) 

Affinity chromatography (p. 70) 

High-pressure liquid chromatography (p. 71) 

Gel electrophoresis (p. 71) 

Isoelectric focusing (p. 73) 

Two-dimensional electrophoresis (p. 74) 

Qualitative and quantitative evaluation of protein 
purification (p. 75) 

Ultracentrifugation (p. 76) 

Edman degradation (p. 80) 

Protein sequencing (p. 82) 

Production of polyclonal antibodies (p. 86) 

Production of monoclonal antibodies (p. 86) 

Enzyme-linked immunoabsorbent assay (ELISA) (p. 88) 

Western blotting (p. 89) 

Fluorescence microscopy (p. 89) 

Green fluorescent protein as a marker (p. 89) 

Immunoelectron microscopy (p. 91) 

MALDI-TOF mass spectrometry (p. 91) 

Tandem mass spectrometry (p. 93) 

Proteomic analysis by mass spectrometry (p. 94) 

Automated solid-phase peptide synthesis (p. 95) 

X-ray crystallography (p. 98) 

Nuclear magnetic resonance spectroscopy (p. 101) 

NOESY spectroscopy (p. 102) 


Exploring Proteins (other chapters) 

Basis of fluorescence in green fluorescent protein (p. 58) 
Using irreversible inhibitors to map the active site (p. 241) 
Enzyme studies with catalytic antibodies (p. 243) 
Single-molecule studies (p. 246) 


Exploring Genes and Genomes (Chapter 5) 


Restriction-enzyme analysis (p. 141) 

Southern and northern blotting techniques (p. 142) 
Sanger dideoxy method of DNA sequencing (p. 143) 
Solid-phase synthesis of nucleic acids (p. 144) 
Polymerase chain reaction (PCR) (p. 145) 
Recombinant DNA technology (p. 148) 


Mutagenesis techniques (p. 156) 
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Biochemistry: An Evolving Science 


Biochemistry is the study of the chemistry of life processes. Since the dis- 
covery that biological molecules such as urea could be synthesized from 
nonliving components in 1828, scientists have explored the chemistry of life 
with great intensity. Through these investigations, many of the most funda- 
mental mysteries of how living things function at a biochemical level have 
now been solved. However, much remains to be investigated. As is often the 
case, each discovery raises at least as many new questions as it answers. 
Furthermore, we are now in an age of unprecedented opportunity for the 
application of our tremendous knowledge of biochemistry to problems in 
medicine, dentistry, agriculture, forensics, anthropology, environmental 
sciences, and many other fields. We begin our journey into biochemistry 
with one of the most startling discoveries of the past century: namely, the 
great unity of all living things at the biochemical level. 


1.1 Biochemical Unity Underlies Biological Diversity 


The biological world is magnificently diverse. The animal kingdom is rich 
with species ranging from nearly microscopic insects to elephants and 
whales. The plant kingdom includes species as small and relatively simple 
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Chemistry in action. Human activities require energy. The interconversion 
of different forms of energy requires large biochemical machines 
comprising many thousands of atoms such as the complex shown above. 
Yet, the functions of these elaborate assemblies depend on simple 
chemical processes such as the protonation and deprotonation of the 
carboxylic acid groups shown on the right. The photograph is of Nobel 
Prize winners Peter Agre, M.D., and Carol Greider, Ph.D., who used 
biochemical techniques to study the structure and function of proteins. 
[Courtesy of Johns Hopkins Medicine.] 
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2 as algae and as large and complex as giant sequoias. This diversity extends 
CHAPTER 1 Biochemistry: further when we descend into the microscopic world. Single-celled organ- 
An Evolving Science isms such as protozoa, yeast, and bacteria are present with great diversity 
in water, in soil, and on or within larger organisms. Some organisms can 
survive and even thrive in seemingly hostile environments such as hot 
springs and glaciers. 

The development of the microscope revealed a key unifying feature that 
underlies this diversity. Large organisms are built up of cells, resembling, to 
some extent, single-celled microscopic organisms. The construction of ani- 
mals, plants, and microorganisms from cells suggested that these diverse 
organisms might have more in common than is apparent from their outward 
appearance. With the development of biochemistry, this suggestion has 
been tremendously supported and expanded. At the biochemical level, all 
organisms have many common features (Figure 1.1). 

As mentioned earlier, biochemistry is the study of the chemistry of life 


tae processes. These processes entail the interplay of two different classes of mol- 

. a ecules: large molecules such as proteins and nucleic acids, referred to as bio- 

OH HO—C—H logical macromolecules, and low-molecular-weight molecules such as glucose 

HO OH a and glycerol, referred to as metabolites, that are chemically transformed in 
Mis Svea biological processes. Members of both these classes of molecules are com- 

diastss mon, with minor variations, to all living things. For example, deoxyribonucleic 


acid (DNA) stores genetic information in all cellular organisms. Proteins, the 
macromolecules that are key participants in most biological processes, are 
built from the same set of 20 building blocks in all organisms. Furthermore, 
proteins that play similar roles in different organisms often have very similar 
three-dimensional structures (see Figure 1.1). 


Sulfolobus acidicaldarius 


Figure 1.1 Biological diversity and similarity. The shape of a key molecule in gene regulation 
(the TATA-box-binding protein) is similar in three very different organisms that are separated from 
one another by billions of years of evolution. [(Left) Dr. T. J. Beveridge/Visuals Unlimited; (middle) 
Holt Studios/Photo Researchers; (right) Time Life Pictures/Getty Images.] 
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Figure 1.2 A possible time line for biochemical evolution. Selected key events are indicated. 
Note that life on Earth began approximately 3.5 billion years ago, whereas human beings emerged 
quite recently. 


Key metabolic processes also are common to many organisms. For 
example, the set of chemical transformations that converts glucose and oxy- 
gen into carbon dioxide and water is essentially identical in simple bacteria 
such as Escherichia coli (E. coli) and human beings. Even processes that 
appear to be quite distinct often have common features at the biochemical 
level. Remarkably, the biochemical processes by which plants capture light 
energy and convert it into more-useful forms are strikingly similar to steps 
used in animals to capture energy released from the breakdown of glucose. 

These observations overwhelmingly suggest that all living things on 
Earth have a common ancestor and that modern organisms have evolved 
from this ancestor into their present forms. Geological and biochemical 
findings support a time line for this evolutionary path (Figure 1.2). On the 
basis of their biochemical characteristics, the diverse organisms of the 
modern world can be divided into three fundamental groups called 
domains: Eukarya (eukaryotes), Bacteria, and Archaea. Domain Eukarya 
comprises all multicellular organisms, including human beings as well as 
many microscopic unicellular organisms such as yeast. The defining char- 
acteristic of eukaryotes is the presence of a well-defined nucleus within 
each cell. Unicellular organisms such as bacteria, which lack a nucleus, are 
referred to as prokaryotes. The prokaryotes were reclassified as two sepa- 
rate domains in response to Carl Woese’s discovery in 1977 that certain 
bacteria-like organisms are biochemically quite distinct from other previ- 
ously characterized bacterial species. These organisms, 
now recognized as having diverged from bacteria early 
in evolution, are the archaea. Evolutionary paths from a BACTERIA EUKARYA ARCHAEA 
common ancestor to modern organisms can be deduced 
on the basis of biochemical information. One such path 
is shown in Figure 1.3. 

Much of this book will explore the chemical reactions 
and the associated biological macromolecules and metab- 
olites that are found in biological processes common to all 
organisms. The unity of life at the biochemical level 
makes this approach possible. At the same time, different 
organisms have specific needs, depending on the particu- 
lar biological niche in which they evolved and live. By 
comparing and contrasting details of particular biochemi- 
cal pathways in different organisms, we can learn how 
biological challenges are solved at the biochemical level. 
In most cases, these challenges are addressed by the adap- 
tation of existing macromolecules to new roles rather than 
by the evolution of entirely new ones. Figure 1.3 The tree of life. A possible evolutionary path from a 

Biochemistry has been greatly enriched by our ability to common ancestor approximately 3.5 billion years ago at the bottom of 
examine the three-dimensional structures of biological the tree to organisms found in the moder world at the top. 
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macromolecules in great detail. Some of these structures are simple and ele- 
gant, whereas others are incredibly complicated but, in any case, these struc- 
tures provide an essential framework for understanding function. We begin 
our exploration of the interplay between structure and function with the 
genetic material, DNA. 


1.2 DNA Illustrates the Interplay Between 
Form and Function 


A fundamental biochemical feature common to all cellular organisms is the 
use of DNA for the storage of genetic information. The discovery that DNA 
plays this central role was first made in studies of bacteria in the 1940s. This 
discovery was followed by the elucidation of the three-dimensional struc- 
ture of DNA in 1953, an event that set the stage for many of the advances in 
biochemistry and many other fields, extending to the present. 

The structure of DNA powerfully illustrates a basic principle common 
to all biological macromolecules: the intimate relation between structure 
and function. The remarkable properties of this chemical substance allow it 
to function as a very efficient and robust vehicle for storing information. We 
start with an examination of the covalent structure of DNA and its exten- 
sion into three dimensions. 


DNA is constructed from four building blocks 


DNA isa linear polymer made up of four different types of monomers. It has a 
fixed backbone from which protrude variable substituents (Figure 1.4). The 
backbone is built of repeating sugar—phosphate units. The sugars are molecules 
of deoxyribose from which DNA receives its name. Each sugar is connected to 
two phosphate groups through different linkages. Moreover, each sugar is ori- 
ented in the same way, and so each DNA strand has directionality, with one end 
distinguishable from the other. Joined to each deoxyribose is one of four possi- 
ble bases: adenine (A), cytosine (C), guanine (G), and thymine (T)). 
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Figure 1.4 Covalent structure of DNA. 
Each unit of the polymeric structure is 
composed of a sugar (deoxyribose), a 
phosphate, and a variable base that protrudes 
from the sugar—phosphate backbone. 


Cytosine (C) Guanine (G) Thymine (T) 


These bases are connected to the sugar components in the DNA back- 
bone through the bonds shown in black in Figure 1.4. All four bases are pla- 
nar but differ significantly in other respects. Thus, each monomer of DNA 
consists of a sugar—phosphate unit and one of four bases attached to the sugar. 
These bases can be arranged in any order along a strand of DNA. 
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Figure 1.5 The double helix. The double-helical structure of DNA proposed by Watson and Crick. 
The sugar—phosphate backbones of the two chains are shown in red and blue, and the bases are 
shown in green, purple, orange, and yellow. The two strands are antiparallel, running in opposite 
directions with respect to the axis of the double helix, as indicated by the arrows. 


Two single strands of DNA combine to form a double helix 


Most DNA molecules consist of not one but two strands (Figure 1.5). In 
1953, James Watson and Francis Crick deduced the arrangement of these 
strands and proposed a three-dimensional structure for DNA molecules. 
This structure is a double helix composed of two intertwined strands 
arranged such that the sugar—-phosphate backbone lies on the outside and 
the bases on the inside. The key to this structure is that the bases form 
specific base pairs (bp) held together by hydrogen bonds (Section 1.3): ade- 
nine pairs with thymine (A—T) and guanine pairs with cytosine (G—C), as 
shown in Figure 1.6. Hydrogen bonds are much weaker than covalent bonds 
such as the carbon-carbon or carbon—nitrogen bonds that define the struc- 
tures of the bases themselves. Such weak bonds are crucial to biochemical 
systems; they are weak enough to be reversibly broken in biochemical pro- 
cesses, yet they are strong enough, when many form simultaneously, to 
help stabilize specific structures such as the double helix. 


DNA structure explains heredity and the storage of information 


The structure proposed by Watson and Crick has two properties of central 
importance to the role of DNA as the hereditary material. First, the struc- 
ture is compatible with any sequence of bases. The base pairs have essen- 
tially the same shape (see Figure 1.6) and thus fit equally well into the center 
of the double-helical structure of any sequence. Without any constraints, 
the sequence of bases along a DNA strand can act as an efficient means of 
storing information. Indeed, the sequence of bases along DNA strands is 
how genetic information is stored. The DNA sequence determines the 
sequences of the ribonucleic acid (RNA) and protein molecules that carry 
out most of the activities within cells. 

Second, because of base-pairing, the sequence of bases along one strand 
completely determines the sequence along the other strand. As Watson and 
Crick so coyly wrote: “It has not escaped our notice that the specific pairing 
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Figure 1.6 Watson-Crick base pairs. Adenine pairs with thymine (A-T), and guanine with 
cytosine (G-C). The dashed green lines represent hydrogen bonds. 
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Figure 1.7 DNA replication. If a DNA 
molecule is separated into two strands, each 
strand can act as the template for the 
generation of its partner strand. 


Figure 1.8 Formation of a double helix. 
When two DNA strands with appropriate, 
complementary sequences are mixed, they 
spontaneously assemble to form a double 
helix. 


we have postulated immediately suggests a possible copying mechanism for 
the genetic material.” Thus, if the DNA double helix is separated into two 
single strands, each strand can act as a template for the generation of its 
partner strand through specific base-pair formation (Figure 1.7). The three- 
dimensional structure of DNA beautifully illustrates the close connection 
between molecular form and function. 


1.3 Concepts from Chemistry Explain the Properties of 
Biological Molecules 


We have seen how a chemical insight, into the hydrogen-bonding capabili- 
ties of the bases of DNA, led to a deep understanding of a fundamental 
biological process. To lay the groundwork for the rest of the book, we begin 
our study of biochemistry by examining selected concepts from chemistry 
and showing how these concepts apply to biological systems. The concepts 
include the types of chemical bonds; the structure of water, the solvent in 
which most biochemical processes take place; the First and Second Laws of 
Thermodynamics; and the principles of acid—base chemistry. We will use 
these concepts to examine an archetypical biochemical process—namely, 
the formation of a DNA double helix from its two component strands. The 
process is but one of many examples that could have been chosen to illus- 
trate these topics. Keep in mind that, although the specific discussion is 
about DNA and double-helix formation, the concepts considered are quite 
general and will apply to many other classes of molecules and processes that 
will be discussed in the remainder of the book. 


The double helix can form from its component strands 


The discovery that DNA from natural sources exists in a double-helical 
form with Watson—Crick base pairs suggested, but did not prove, that such 
double helices would form spontaneously outside biological systems. 
Suppose that two short strands of DNA were chemically synthesized to 
have complementary sequences so that they could, in principle, form a 
double helix with Watson—Crick base pairs. Two such sequences are 
CGATTAAT and ATTAATCG. The structures of these molecules in 
solution can be examined by a variety of techniques. In isolation, each 
sequence exists almost exclusively as a single-stranded molecule. However, 
when the two sequences are mixed, a double helix with Watson—Crick base 
pairs does form (Figure 1.8). This reaction proceeds nearly to completion. 
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What forces cause the two strands of DNA to bind to each other? To 
analyze this binding reaction, we must consider several factors: the types 
of interactions and bonds in biochemical systems and the energetic favor- 
ability of the reaction. We must also consider the influence of the solution 
conditions—in particular, the consequences of acid—base reactions. 


Covalent and noncovalent bonds are important for the structure and 
stability of biological molecules 


Atoms interact with one another through chemical bonds. These bonds 
include the covalent bonds that define the structure of molecules as well asa 
variety of noncovalent bonds that are of great importance to biochemistry. 


Covalent bonds. The strongest bonds are covalent bonds, such as the 
bonds that hold the atoms together within the individual bases shown on 
page 4. A covalent bond is formed by the sharing of a pair of electrons 
between adjacent atoms. A typical carbon-carbon (C—C) covalent bond has 
a bond length of 1.54 A and bond energy of 355 kJ mol! (85 kcal mol” !). 
Because covalent bonds are so strong, considerable energy must be expended 
to break them. More than one electron pair can be shared between two 
atoms to form a multiple covalent bond. For example, three of the bases in 
Figure 1.6 include carbon—oxygen (C=O) double bonds. These bonds are 
even stronger than C—C single bonds, with energies near 730 kJ mol! 
(175 kcal mol!) and are somewhat shorter. 

For some molecules, more than one pattern of covalent bonding can be 
written. For example, adenine can be written in two equivalent ways called 
resonance structures. 
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These adenine structures depict alternative arrangements of single and 
double bonds that are possible within the same structural framework. 
Resonance structures are shown connected by a double-headed arrow. 
Adenine’s true structure is a composite of its two resonance structures. The 
composite structure is manifested in the bond lengths such as that for 
the bond joining carbon atoms C-4 and C-5. The observed bond length 
of 1.40 A is between that expected for a C—C single bond (1.54 A) anda 
C=C double bond (1.34 A). A molecule that can be written as several 
resonance structures of approximately equal energies has greater stability 
than does a molecule without multiple resonance structures. 


Noncovalent bonds. Noncovalent bonds are weaker than covalent bonds 
but are crucial for biochemical processes such as the formation of a double 
helix. Four fundamental noncovalent bond types are electrostatic interactions, 
hydrogen bonds, van der Waals interactions, and hydrophobic interactions. They 
differ in geometry, strength, and specificity. Furthermore, these bonds are 
affected in vastly different ways by the presence of water. Let us consider 
the characteristics of each type: 


1. Electrostatic Interactions. A charged group on one molecule can attract 
an oppositely charged group on another molecule. The energy of an electro- 
static interaction is given by Coulomb’s law: 


E = kqiqg2/Dr 


where F is the energy, q; and q2 are the charges on the two atoms (in units of 
the electronic charge), ris the distance between the two atoms (in angstroms), 
Dis the dielectric constant (which accounts for the effects of the intervening 


7 
1.3. Chemical Concepts 


Distance and energy units 


InterZatomic distances and bond lengths 
are usually measured in angstrom (A) units: 


use. One joule (J) is the amount of energy 


req 
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of water 1 degree Celsius. A kilocalorie (kcal) 


A= 107'? m= 1078 cm = 0.1 nm 


Several energy units are in common 
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Figure 1.9 Hydrogen bonds. Hydrogen 
bonds are depicted by dashed green 
lines. The positions of the partial charges 
(8* and 8) are shown. 
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Figure 1.10 Energy of a van der Waals 
interaction as two atoms approach each 
other. The energy is most favorable at the 
van der Waals contact distance. Owing to 
electron-electron repulsion, the energy rises 
rapidly as the distance between the atoms 
becomes shorter than the contact distance. 
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medium), and k is a proportionality constant (k = 1389, for energies in units 
of kilojoules per mole, or 332 for energies in kilocalories per mole). 

By convention, an attractive interaction has a negative energy. The elec- 
trostatic interaction between two ions bearing single opposite charges sepa- 
rated by 3 Ai in water (which has a dielectric constant of 80) has an energy of 
5.8 kJ mol! (—1.4 kcal mol” !). Note how i important the dielectric constant 
of the medium is. For the same ions separated by 3 A in a nonpolar solvent 
such as hexane (which has a dielectric constant of 2), the energy of this inter- 
action is —232 kJ mol! (—55 kcal mol7?). 


2. Hydrogen Bonds. These interactions are fundamentally electrostatic 
interactions. Hydrogen bonds are responsible for specific base-pair for- 
mation in the DNA double helix. The hydrogen atom in a hydrogen bond 
is partially shared by two electronegative atoms such as nitrogen or oxy- 
gen. The hydrogen-bond donor is the group that includes both the atom to 
which the hydrogen atom is more tightly linked and the hydrogen atom 
itself, whereas the hydrogen-bond acceptor is the atom less tightly linked to 
the hydrogen atom (Figure 1.9). The electronegative atom to which the 
hydrogen atom is covalently bonded pulls electron density away from the 
hydrogen atom, which thus develops a partial positive charge (8"). Thus, 
the hydrogen atom can interact with an atom having a partial negative 
charge (6) through an electrostatic interaction. 

Hydrogen bonds are much weaker than covalent bonds. They have ener- 
gies ranging from 4 to 20 kJ mol! (from 1 to 5 kcal mol~'). Hydrogen 
bonds are also somewhat longer than covalent bonds; their bond lengths 
(measured from the hydrogen atom) range from 1.5 A to 2.6 A; hence, a dis- 
tance ranging from 2.4 A to 3.5 A separates the two nonhydrogen atoms in 
a hydrogen bond. The strongest hydrogen bonds have a tendency to be 
approximately straight, such that the hydrogen-bond donor, the hydrogen 
atom, and the hydrogen-bond acceptor lie along a straight line. Hydrogen- 
bonding interactions are responsible for many of the properties of water that 
make it such a special solvent, as will be described shortly. 


3. van der Waals Interactions. The basis of a van der Waals interaction is 
that the distribution of electronic charge around an atom fluctuates with 
time. At any instant, the charge distribution is not perfectly symmetric. 
This transient asymmetry in the electronic charge about an atom acts 
through electrostatic interactions to induce a complementary asymmetry 
in the electron distribution within its neighboring atoms. The atom and 
its neighbors then attract one another. This attraction increases as two 
atoms come closer to each other, until they are separated by the van der 
Waals contact distance (Figure 1.10). At distances shorter than the van der 
Waals contact distance, very strong repulsive forces become dominant 
because the outer electron clouds of the two atoms overlap. 

Energies associated with van der Waals interactions are quite small; 
typical interactions contribute from 2 to 4 kJ mol! (from 0.5 to 1 kcal 
mol !) per atom pair. When the surfaces of two large molecules come 
together, however, a large number of atoms are in van der Waals contact, 
and the net effect, summed over many atom pairs, can be substantial. 


Properties of water. Water is the solvent in which most biochemical reac- 
tions take place, and its properties are essential to the formation of macro- 
molecular structures and the progress of chemical reactions. Two properties 
of water are especially relevant: 


1. Water is a polar molecule. The water molecule is bent, not linear, and so 
the distribution of charge is asymmetric. The oxygen nucleus draws elec- 


trons away from the two hydrogen nuclei, which leaves the region around 
each hydrogen atom with a net positive charge. The water molecule is thus 
an electrically polar structure. 


2. Water is highly cohesive. Water molecules interact strongly with one 
another through hydrogen bonds. These interactions are apparent in the 
structure of ice (Figure 1.11). Networks of hydrogen bonds hold the struc- 
ture together; similar interactions link molecules in liquid water and account 
for the cohesion of liquid water, although, in the liquid state, approximately 
one-fourth of the hydrogen bonds present in ice are broken. The polar 
nature of water is responsible for its high dielectric constant of 80. Molecules 
in aqueous solution interact with water molecules through the formation of 
hydrogen bonds and through ionic interactions. These interactions make 
water a versatile solvent, able to readily dissolve many species, especially 
polar and charged compounds that can participate in these interactions. 


The hydrophobic effect. A final fundamental interaction called the hydro- 
phobic effect is a manifestation of the properties of water. Some molecules 
(termed nonpolar molecules) cannot participate in hydrogen bonding or ionic 
interactions. The interactions of nonpolar molecules with water molecules 
are not as favorable as are interactions between the water molecules them- 
selves. The water molecules in contact with these nonpolar molecules form 
“cages” around them, becoming more well ordered than water molecules 
free in solution. However, when two such nonpolar molecules come 
together, some of the water molecules are released, allowing them to interact 
freely with bulk water (Figure 1.12). The release of water from such cages is 
favorable for reasons to be considered shortly. The result is that nonpolar 
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Figure 1.11 Structure of ice. Hydrogen 
bonds (shown as dashed green lines) are 
formed between water molecules to produce 
a highly ordered and open structure. 


Figure 1.12 The hydrophobic effect. The 
aggregation of nonpolar groups in water leads 
to the release of water molecules, initially 
interacting with the nonpolar surface, into bulk 
water. The release of water molecules into 
solution makes the aggregation of nonpolar 
groups favorable. 


Figure 1.13 Electrostatic interactions in 
DNA. Each unit within the double helix 
includes a phosphate group (the phosphorus 
atom being shown in purple) that bears a 
negative charge. The unfavorable interactions 
of one phosphate with several others are 
shown by red lines. These repulsive interactions 
oppose the formation of a double helix. 
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Figure 1.14 Base stacking. In the DNA 
double helix, adjacent base pairs are stacked 
nearly on top of one another, and so many 
atoms in each base pair are separated by their 
van der Waals contact distance. The central 
base pair is shown in dark blue and the two 
adjacent base pairs in light blue. Several van 
der Waals contacts are shown in red. 
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molecules show an increased tendency to associate with one another in 
water compared with other, less polar and less self-associating, solvents. 
This tendency is called the hydrophobic effect and the associated interac- 
tions are called hydrophobic interactions. 


The double helix is an expression of the rules of chemistry 


Let us now see how these four noncovalent interactions work together in 
driving the association of two strands of DNA to form a double helix. First, 
each phosphate group in a DNA strand carries a negative charge. These 
negatively charged groups interact unfavorably with one another over dis- 
tances. Thus, unfavorable electrostatic interactions take place when two 
strands of DNA come together. These phosphate groups are far apart in the 
double helix with distances greater than 10 A, but many such interactions 
take place (Figure 1.13). Thus, electrostatic interactions oppose the forma- 
tion of the double helix. The strength of these repulsive electrostatic inter- 
actions is diminished by the high dielectric constant of water and the 
presence of ionic species such as Na* or Mg?" ions in solution. These 
positively charged species interact with the phosphate groups and partly 
neutralize their negative charges. 

Second, as already noted, hydrogen bonds are important in determining 
the formation of specific base pairs in the double helix. However, in single- 
stranded DNA, the hydrogen-bond donors and acceptors are exposed to 
solution and can form hydrogen bonds with water molecules. 
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When two single strands come together, these hydrogen bonds with water 
are broken and new hydrogen bonds between the bases are formed. Because 
the number of hydrogen bonds broken is the same as the number formed, 
these hydrogen bonds do not contribute substantially to driving the overall 
process of double-helix formation. However, they contribute greatly to the 
specificity of binding. Suppose two bases that cannot form Watson—Crick 
base pairs are brought together. Hydrogen bonds with water must be bro- 
ken as the bases come into contact. Because the bases are not complemen- 
tary in structure, not all of these bonds can be simultaneously replaced by 
hydrogen bonds between the bases. Thus, the formation of a double helix 
between noncomplementary sequences is disfavored. 

Third, within a double helix, the base pairs are parallel and stacked 
nearly on top of one another. The typical separation between the planes of 
adjacent base pairs is 3.4 A, and the distances between the most closely 
approaching atoms are approximately 3.6 A. This separation distance cor- 
responds nicely to the van der Waals contact distance (Figure 1.14). Bases 
tend to stack even in single-stranded DNA molecules. However, the base 
stacking and associated van der Waals interactions are nearly optimal in a 
double-helical structure. 

Fourth, the hydrophobic effect also contributes to the favorability of 
base stacking. More-complete base stacking moves the nonpolar surfaces of 
the bases out of water into contact with each other. 

The principles of double-helix formation between two strands of DNA 
apply to many other biochemical processes. Many weak interactions con- 
tribute to the overall energetics of the process, some favorably and some 


unfavorably. Furthermore, surface complementarity is a key feature: when 
complementary surfaces meet, hydrogen-bond donors align with hydrogen- 
bond acceptors and nonpolar surfaces come together to maximize van der 
Waals interactions and minimize nonpolar surface area exposed to the aque- 
ous environment. The properties of water play a major role in determining 
the importance of these interactions. 


The laws of thermodynamics govern the behavior of biochemical systems 


We can look at the formation of the double helix from a different perspec- 
tive by examining the laws of thermodynamics. These laws are general 
principles that apply to all physical (and biological) processes. They are of 
great importance because they determine the conditions under which spe- 
cific processes can or cannot take place. We will consider these laws from a 
general perspective first and then apply the principles that we have devel- 
oped to the formation of the double helix. 

The laws of thermodynamics distinguish between a system and its sur- 
roundings. A system refers to the matter within a defined region of space. 
The matter in the rest of the universe is called the surroundings. The First 
Law of Thermodynamics states that the total energy of a system and its sur- 
roundings is constant. In other words, the energy content of the universe is 
constant; energy can be neither created nor destroyed. Energy can take dif- 
ferent forms, however. Heat, for example, is one form of energy. Heat is a 
manifestation of the kinetic energy associated with the random motion of 
molecules. Alternatively, energy can be present as potential energy—energy 
that will be released on the occurrence of some process. Consider, for 
example, a ball held at the top of a tower. The ball has considerable potential 
energy because, when it is released, the ball will develop kinetic energy asso- 
ciated with its motion as it falls. Within chemical systems, potential energy 
is related to the likelihood that atoms can react with one another. For 
instance, a mixture of gasoline and oxygen has a large potential energy 
because these molecules may react to form carbon dioxide and water and 
release energy as heat. The First Law requires that any energy released in 
the formation of chemical bonds must be used to break other bonds, released 
as heat, or stored in some other form. 

Another important thermodynamic concept is that of entropy, a measure 
of the degree of randomness or disorder in a system. The Second Law of 
Thermodynamics states that the total entropy of a system plus that of its sur- 
roundings always increases. For example, the release of water from nonpolar 
surfaces responsible for the hydrophobic effect is favorable because water 
molecules free in solution are more disordered than they are when they are 
associated with nonpolar surfaces. At first glance, the Second Law appears to 
contradict much common experience, particularly about biological systems. 
Many biological processes, such as the generation of a leaf from carbon diox- 
ide gas and other nutrients, clearly increase the level of order and hence 
decrease entropy. Entropy may be decreased locally in the formation of such 
ordered structures only if the entropy of other parts of the universe is increased 
by an equal or greater amount. The local decrease in entropy is often accom- 
plished by a release of heat, which increases the entropy of the surroundings. 

We can analyze this process in quantitative terms. First, consider the 
system. The entropy (S) of the system may change in the course of a chem- 
ical reaction by an amount AS,y tem. If heat flows from the system to its 
surroundings, then the heat content, often referred to as the enthalpy (H), of 
the system will be reduced by an amount AH system. To apply the Second 
Law, we must determine the change in entropy of the surroundings. If 
heat flows from the system to the surroundings, then the entropy of the 
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surroundings will increase. The precise change in the entropy of the surround- 
ings depends on the temperature; the change in entropy is greater when heat 
is added to relatively cold surroundings than when heat is added to sur- 
roundings at high temperatures that are already in a high degree of disorder. 
To be even more specific, the change in the entropy of the surroundings will 
be proportional to the amount of heat transferred from the system and 
inversely proportional to the temperature (T) of the surroundings. In biologi- 
cal systems, T [in kelvins (K), absolute temperature] is usually assumed to be 
constant. Thus, a change in the entropy of the surroundings is given by 


ASsurroundines = —AHsystem/T (1) 
The total entropy change is given by the expression 
NS ial = NS sycteta =; AS parsoundings (2) 
Substituting equation 1 into equation 2 yields 
AStotal = Ssyatem “= A ston T (3) 
Multiplying by —T gives 
=TASeial = AF gystem ~ FAS eaten (4) 


The function — TAS has units of energy and is referred to as free energ’y or 
Gibbs free energy, after Josiah Willard Gibbs, who developed this function 
in 1878: 


AG = I ccehety ~ TAS ateni (5) 


The free-energy change, AG, will be used throughout this book to describe 
the energetics of biochemical reactions. The Gibbs free energy is essentially 
an accounting tool that keeps track of both the entropy of the system 
(directly) and the entropy of the surroundings (in the form of heat released 
from the system). 

Recall that the Second Law of Thermodynamics states that, for a process 
to take place, the entropy of the universe must increase. Examination of 
equation 3 shows that the total entropy will increase if and only if 


DSS satan > AVevtems 1 (6) 


Rearranging gives TAS.,stem > AH or, in other words, entropy will increase 
if and only if 


AG = A eystenn i TAS system <0 (7) 


Thus, the free-energy change must be negative for a process to take place 
spontaneously. There is negative free-energy change when and only when the 
overall entropy of the universe is increased. Again, the free energy represents 
a single term that takes into account both the entropy of the system and the 
entropy of the surroundings. 


Heat is released in the formation of the double helix 


Let us see how the principles of thermodynamics apply to the formation of 
the double helix (Figure 1.15). Suppose solutions containing each of the two 
single strands are mixed. Before the double helix forms, each of the single 
strands is free to translate and rotate in solution, whereas each matched pair 
of strands in the double helix must move together. Furthermore, the free 
single strands exist in more conformations than possible when bound 
together in a double helix. Thus, the formation of a double helix from two 
single strands appears to result in an increase in order for the system, that is, 
a decrease in the entropy of the system. 


On the basis of this analysis, we expect that 
the double helix cannot form without violating 
the Second Law of Thermodynamics unless 
heat is released to increase the entropy of the 
surroundings. Experimentally, we can measure 
the heat released by allowing the solutions con- 
taining the two single strands to come together 
within a water bath, which here corresponds to 
the surroundings. We then determine how 
much heat must be absorbed by the water bath 
or released from it to maintain it at a constant 
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temperature. This experiment reveals that a 
substantial amount of heat is released—namely, 


approximately 250 kJ mol! (60 kcal mol). 
This experimental result reveals that the change 
in enthalpy for the process is quite large, 
—250 kJ mol” ?, consistent with our expecta- 
tion that significant heat would have to be 
released to the surroundings for the process not 
to violate the Second Law. We see in quantita- 
tive terms how order within a system can 
be increased by releasing sufficient heat to the 
surroundings to ensure that the entropy of the 
universe increases. We will encounter this gen- 
eral theme again and again throughout this 


book. 


iF Reacting 


Acid-base reactions are central in many 
biochemical processes 


Throughout our consideration of the formation 
of the double helix, we have dealt only with the 
noncovalent bonds that are formed or broken in 
this process. Many biochemical processes entail 
the formation and cleavage of covalent bonds. A 
particularly important class of reactions promi- 
nent in biochemistry is acid—base reactions. 

In acid and base reactions, hydrogen ions 
are added to molecules or removed from them. 
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Throughout the book, we will encounter many 

processes in which the addition or removal of hydrogen atoms is crucial, 
such as the metabolic processes by which carbohydrates are consumed to 
release energy for other uses. Thus, a thorough understanding of the basic 
principles of these reactions is essential. 

A hydrogen ion, often written as H™, corresponds to a proton. In fact, 
hydrogen ions exist in solution bound to water molecules, thus forming 
what are known as hydronium ions, H3O~. For simplicity, we will continue 
to write H*, but we should keep in mind that H" is shorthand for the 
actual species present. 

The concentration of hydrogen ions in solution is expressed as the pH. 
Specifically, the pH of a solution is defined as 


pH = —log{H*] 


where (H*] is in units of molarity. Thus, pH 7.0 refers to a solution for 
which —log[H*] = 7.0, and so log[H*] = —7.0 and [H*] = 10°8lH#+] = 
10°“ =1.0 10°’ M, 


Figure 1.15 Double-helix formation and 
entropy. When solutions containing DNA 
strands with complementary sequences are 
mixed, the strands react to form double 
helices. This process results in a loss of 
entropy from the system, indicating that heat 
must be released to the surroundings to 
prevent a violation of the Second Law of 
Thermodynamics. 
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Fraction of molecules in 
double-helical form 


Figure 1.16 DNA denaturation by the 
addition of a base. The addition of a base 
to a solution of double-helical DNA initially at 
pH 7 causes the double helix to separate into 
single strands. The process is half complete at 
slightly above pH 9. 


The pH also indirectly expresses the concentration of hydroxide ions, 
[OH], in solution. To see how, we must realize that water molecules dis- 
sociate to form H* and OH 7 ions in an equilibrium process. 


H,O == H* + OH” 


The equilibrium constant (K) for the dissociation of water is defined as 


K = [H*][OH7]/[H,0] 


and has a value of K = 1.8 X 107 1°. Note that an equilibrium constant does 
not formally have units. Nonetheless, the value of the equilibrium constant 
given assumes that particular units are used for concentration; in this case 
and in most others, units of molarity (M) are assumed. 

The concentration of water, [HO], in pure water is 55.5 M, and this 
concentration is constant under most conditions. Thus, we can define a new 
constant, Kw: 


Kw = K[H,0] = [H*][OH] 


K[H,O] = 1.8 x 107'° x 55.5 
1osci0- 


Because Ky = [H*][OH] = 1.0 x 107 '*, we can calculate 
[OH ] = 10° '*/[H*] and [H*] = 10 '*/[OH] 


With these relations in hand, we can easily calculate the concentration of 
hydroxide ions in an aqueous solution, given the pH. For example, at pH = 
7.0, we know that [H*] = 1077 Mand so [OH] = 107 4/107’ = 107’ M. In 
acidic solutions, the concentration of hydrogen ions is higher than 10°’ and, 
hence, the pH is below 7. For example, in 0.1 M HCl, [H*] = 10°! Mand 
so pH = 1.0 and [OH ] = 10 '*/10°' = 10 13M. 


Acid—base reactions can disrupt the double helix 


The reaction that we have been considering between two strands of DNA to 
form a double helix takes place readily at pH 7.0. Suppose that we take the 
solution containing the double-helical DNA and treat it with a solution of 
concentrated base (i.e., with a high concentration of OH). As the base is 
added, we monitor the pH and the fraction of DNA in double-helical form 
(Figure 1.16). When the first additions of base are made, the pH rises, but 
the concentration of the double-helical DNA does not change significantly. 
However, as the pH approaches 9, the DNA double helix begins to dissoci- 
ate into its component single strands. As the pH continues to rise from 9 to 
10, this dissociation becomes essentially complete. Why do the two strands 
dissociate? The hydroxide ions can react with bases in DNA base pairs to 
remove certain protons. The most susceptible proton is the one bound to the 
N-1 nitrogen atom in a guanine base. 


0) O 
N UH N = 
<r P= 97 eel on 
a we 
N N NH N N NH 
/ , / ° 


Guanine (G) 


Proton dissociation for a substance HA has an equilibrium constant defined 15 
by the expression 1.3, Chemical Concepts 


K, = [H"][A’]/[HA] 


The susceptibility of a proton to removal by reaction with a base is described 
by its pK, value: 
pK, = —log( Ka) 

When the pH is equal to the pK,, we have 

pH = pK, 
and so 

—log{H*] = —log({H*][A”]/[HA]) 
and 
[H*] = (H"][A]/[HA] 

Dividing by [H™] reveals that 


and so 
[A’] = [HA] 


Thus, when the pH equals the pK,, the concentration of the deprotonated 
form of the group or molecule is equal to the concentration of the proto- 
nated form; the deprotonation process is halfway to completion. 

The pK, for the proton on N-1 of guanine is typically 9.7. When the pH 
approaches this value, the proton on N-1 is lost (see Figure 1.16). Because 
this proton participates in an important hydrogen bond, its loss substan- 
tially destabilizes the DNA double helix. The DNA double helix is also 
destabilized by low pH. Below pH 5, some of the hydrogen bond acceptors 
that participate in base-pairing become protonated. In their protonated 
forms, these bases can no longer form hydrogen bonds and the double helix 
separates. Thus, acid—base reactions that remove or donate protons at spe- 
cific positions on the DNA bases can disrupt the double helix. 


Buffers regulate pH in organisms and in the laboratory 


These observations about DNA reveal that a significant change in pH can ” 
disrupt molecular structure. The same is true for many other biological 
macromolecules; changes in pH can protonate or deprotonate key groups, 
potentially disrupting structures and initiating harmful reactions. Thus, 
systems have evolved to mitigate changes in pH in biological systems. 
Solutions that resist such changes are called buffers. Specifically, when acid A 
is added to an unbuffered aqueous solution, the pH drops in proportion to 
the amount of acid added. In contrast, when acid is added to a buffered solu- 2 
tion, the pH drops more gradually. Buffers also mitigate the pH increase 
caused by the addition of base and changes in pH caused by dilution. 
Compare the result of adding a 1 M solution of the strong acid HCl drop 
by drop to pure water with adding it to a solution containing 100 mM of the 
buffer sodium acetate (Na*CH3COO ; Figure 1.17). The process of Figure 1.17 Buffer action. The addition of 
gradually adding known amounts of reagent to a solution with which the a strong acid, 1 M HCl, to pure water results 
reagent reacts while monitoring the results is called a titration. For pure nee Wi gens 7 a eaheae an ' 
water, the pH drops from 7 to close to 2 on the addition of the first few drops pe aaa ee op a. - 
of acid. However, for the sodium acetate solution, the pH first falls rapidly fesuilis, iva much more Sadal change in pH 
from its initial value near 10, then changes more gradually until the pH until the pH drops below 3.5. 


0.1 M Na*CH,COO- 


Gradual pH change 
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Figure 1.18 Buffer protonation. When acid 
is added to sodium acetate, the added 
hydrogen ions are used to convert acetate ion 
into acetic acid. Because the proton 
concentration does not increase significantly, 
the pH remains relatively constant until all of 
the acetate has been converted into acetic 
acid. 


reaches 3.5, and then falls more rapidly again. Why does the pH decrease 
gradually in the middle of the titration? The answer is that, when hydrogen 
ions are added to this solution, they react with acetate ions to form acetic acid. 
This reaction consumes some of the added hydrogen ions so that the pH does 
not drop. Hydrogen ions continue reacting with acetate ions until essentially 
all of the acetate ion is converted into acetic acid. After this point, added pro- 
tons remain free in solution and the pH begins to fall sharply again. 

We can analyze the effect of the buffer in quantitative terms. The equi- 
librium constant for the deprotonation of an acid is 


K, = [H*][A ]/[HA] 
Taking logarithms of both sides yields 
log(K,) = log([H*]) + log([A”]/[HA]) 
Recalling the definitions of pK, and pH and rearranging gives 
pH = pK, + log({[A’ ]/[HA]) 


This expression is referred to as the Henderson—Hasselbalch equation. 

We can apply the equation to our titration of sodium acetate. The pK, of 
acetic acid is 4.75. We can calculate the ratio of the concentration of acetate 
ion to the concentration of acetic acid as a function of pH by using the 
Henderson—Hasselbalch equation, slightly rearranged. 


[Acetate ion] /[acetic acid] = [A~]/[HA] = 10° ~PKa 


At pH 9, this ratio is 10?-*+75 = 10445 = 17,800; very little acetic acid has 
been formed. At pH 4.75 (when the pH equals the pK,), the ratio is 
{oe = 10" = 1; At pH 3; the tatioas 10°" = 10° = 0,02: 
almost all of the acetate ion has been converted into acetic acid. We can fol- 
low the conversion of acetate ion into acetic acid over the entire titration 
(Figure 1.18). The graph shows that the region of relatively constant pH 
corresponds precisely to the region in which acetate ion is being protonated 
to form acetic acid. 

From this discussion, we see that a buffer functions best close to the pK, 
value of its acid component. Physiological pH is typically about 7.4. An 
important buffer in biological systems is based on phosphoric acid (H3PO4). 
The acid can be deprotonated in three steps to form a phosphate ion. 


H,PO,~ HPO,2- 


Ht 
a aN PO 3- 
=. 4 


pK, = 12.67 


Ht 
ee 
<o 


pK, = 7.21 


Ht 
H,PO, ss 


pK, = 2.12 


At about pH 7.4, inorganic phosphate exists primarily as a nearly equal mix- 
ture of HpPO, and HPO,* . Thus, phosphate solutions function as effec- 
tive buffers near pH 7.4. The concentration of inorganic phosphate in blood 
is typically approximately 1 mM, providing a useful buffer against processes 
that produce either acid or base. We can examine this utility in quantitative 
terms with the use of the Henderson—Hasselbalch equation. What concen- 
tration of acid must be added to change the pH of 1 mM phosphate buffer 
from 7.4 to 7.3? Without buffer, this change in [H™ ] corresponds to a change 
of 1073-10-74 M =(5.0 x 10 ®-— 4.0 x 10 8)M=1.0 X 10 °M. Let 
us now consider what happens to the buffer components. At pH 7.4, 


(HPO, |/[HsPO, |) S100" = 10°" = 1,55 
The total concentration of phosphate is 1 mM, (HPO, ] + [H2PO,7]. Thus, 
(HPO, ] = (1.55/2.55) X 1mM = 0.608 mM 
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[H,PO,] = (1/2.55) x 1mM = 0.392 mM 1.4 The Genomic Revolution 


At pH 7.3, 
(HPO? ]/(HaPO, | = 107-9" = 10°" = 1,33 
and so 
[HPO,? ] = (1.23/2.23) = 0.552 mM 
and 


[H»PO,] = (1/2.23) = 0.448 mM 


Thus, (0.608 — 0.552) = 0.056 mM HPO,’ is converted into HPO, , 
consuming 0.056 mM = 5.6 X 10°? M[H”]. Thus, the buffer increases the 
amount of acid required to produce a drop in pH from 7.4 to 7.3 by a factor 
of 5.6 X 10-°/1.0 X 10 * = 5600 compared with pure water. 


1.4 The Genomic Revolution Is Transforming Biochemistry 
and Medicine 


Watson and Crick’s discovery of the structure of DNA suggested the 
hypothesis that hereditary information is stored as a sequence of bases along 
long strands of DNA. This remarkable insight provided an entirely new 
way of thinking about biology. However, at the time that it was made, 
Watson and Crick’s discovery was full of potential but the practical conse- 
quences were unclear. Tremendously fundamental questions remained to 
be addressed. Is the hypothesis correct? How is the sequence information 
read and translated into action? What are the sequences of naturally occur- 
ring DNA molecules and how can such sequences be experimentally deter- 
mined? Through advances in biochemistry and related sciences, we now 
have essentially complete answers to these questions. Indeed, in the past 
decade or so, scientists have determined the complete genome sequences of 
hundreds of different organisms, including simple microorganisms, plants, 
animals of varying degrees of complexity, and human beings. Comparisons 
of these genome sequences with the use of methods introduced in Chapter 6 
have been sources of insight into many aspects of biochemistry. Because of 
these achievements, biochemistry has been transformed. In addition to its 
experimental and clinical aspects, biochemistry has now become an infor- 
mation science. 


The sequencing of the human genome is a landmark in human history 


The sequencing of the human genome was a daunting task because it con- 
tains approximately 3 billion (3 X 10°) base pairs. For example, the 
sequence 


ACATTTGCTTCTGACACAACTGTGTTCACTAGCAACCTC 
AAACAGACACCATGGTGCATCTGACTCCTGAGGAGAAGT 
CTGCCGTTACTGCCCTGTGGGGCAAGGTGAACGTGGA. ... 


is a part of one of the genes that encodes hemoglobin, the oxygen carrier in 
our blood. This gene is found on the end of chromosome 9 among our 24 
distinct chromosomes. If we were to include the complete sequence of our 
entire genome, this chapter would run to more than 500,000 pages. The 
sequencing of our genome is truly a landmark in human history. This 
sequence contains a vast amount of information, some of which we can now 
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extract and interpret, but much of which we are only beginning to 
understand. For example, some human diseases have been linked to par- 
ticular variations in genomic sequence. Sickle-cell anemia, discussed in 
detail in Chapter 7, is caused by a single base change of an A (noted in 
boldface type in the preceding sequence) to a T. We will encounter many 
other examples of diseases that have been linked to specific DNA 
sequence changes. 

In addition to the implications for understanding human health and 
disease, the genome sequence is a source of deep insight into other aspects 
of human biology and culture. For example, by comparing the sequences of 
different individual persons and populations, we can learn a great deal about 
human history. On the basis of such analysis, a compelling case can be made 
that the human species originated in Africa, and the occurrence and even 
the timing of important migrations of groups of human beings can be dem- 
onstrated. Finally, comparisons of the human genome with the genomes of 
other organisms are confirming the tremendous unity that exists at the level 
of biochemistry and are revealing key steps that have been taken in the 
course of evolution from relatively simple, single-celled organisms to com- 
plex, multicellular organisms such as human beings. For example, many 
genes that are key to the function of the human brain and nervous system 
have evolutionary and functional relatives that can be recognized in the 
genomes of bacteria. Because many studies that are possible in model 
organisms are difficult or unethical to conduct in human beings, these 
discoveries have many practical implications. Comparative genomics has 
become a powerful science, linking evolution and biochemistry. 


Genome sequences encode proteins and patterns of expression 


The structure of DNA revealed how information is stored in the base 
sequence along a DNA strand. But what information is stored and how is 
this information expressed? The most fundamental role of DNA is to 
encode the sequences of proteins. Like DNA, proteins are linear polymers. 
However, proteins differ from DNA in two important ways. First, proteins 
are built from 20 building blocks, called amino acids, rather than just four, 
as in DNA. The chemical complexity provided by this variety of 

building blocks enables proteins to 

perform a wide range of functions. 

Second, proteins spontaneously fold 


up into elaborate three-dimensional 
000 * 666 ? structures, determined only by their 


Amino acid sequence 1 


Amino acid sequence 2 


Figure 1.19 Protein folding. Proteins are linear polymers of amino acids that fold into elaborate 
structures. The sequence of amino acids determines the three-dimensional structure. Thus amino 


amino acid sequences (Figure 1.19). 
We have explored in depth how 
solutions containing two appropri- 
ate strands of DNA come together 
to form a solution of double-helical 
molecules. A similar spontaneous 
folding process gives proteins their 
three-dimensional structure. A bal- 
ance of hydrogen bonding, van der 
Waals interactions, and hydrophobic 
interactions overcome the entropy 
lost in going from an unfolded 
ensemble of proteins to a homoge- 
nous set of well-folded molecules. 


acid sequence 1 gives rise only to a protein with the shape depicted in blue, not the shape Proteins and protein folding will be 


depicted in red. 


discussed extensively in Chapter 2. 


The fundamental unit of hereditary information, the gene, is becom- 
ing increasingly difficult to precisely define as our knowledge of the 
complexities of genetics and genomics increases. The genes that are sim- 
plest to define encode the sequences of proteins. For these protein- 
encoding genes, a block of DNA bases encodes the amino acid sequence 
of a specific protein molecule. A set of three bases along the DNA strand, 
called a codon, determines the identity of one amino acid within the protein 
sequence. The relation that links the DNA sequence to the encoded pro- 
tein sequence is called the genetic code. One of the biggest surprises from 
the sequencing of the human genome is the small number of protein- 
encoding genes. Before the genome-sequencing project began, the con- 
sensus view was that the human genome would include approximately 
100,000 protein-encoding genes. The current analysis suggests that the 
actual number is between 20,000 and 25,000. We shall use an estimate of 
23,000 throughout this book. However, additional mechanisms allow 
many genes to encode more than one protein. For example, the genetic 
information in some genes is translated in more than one way to produce 
a set of proteins that differ from one another in parts of their amino acid 
sequences. In other cases, proteins are modified after they have been 
synthesized through the addition of accessory chemical groups. Through 
these indirect mechanisms, much more complexity is encoded in our 
genomes than would be expected from the number of protein-encoding 
genes alone. 

On the basis of current knowledge, the protein-encoding regions account 
for only about 3% of the human genome. What is the function of the rest of 
the DNA? Some of it contains information that regulates the expression of 
specific genes (i.e., the production of specific proteins) in particular cell 
types and physiological conditions. Essentially every cell contains the same 
DNA genome, yet cell types differ considerably in the proteins that they 
produce. For example, hemoglobin is expressed only in precursors of red 
blood cells, even though the genes for hemoglobin are present in essentially 
every cell. Specific sets of genes are expressed in response to hormones, even 
though these genes are not expressed in the same cell in the absence of the 
hormones. The control regions that regulate such differences account for 
only a small amount of the remainder of our genomes. The truth is that we 
do not yet understand all of the function of much of the remainder of the 
DNA. Some of it appears to be “junk,” stretches of DNA that were inserted 
at some stage of evolution and have remained. In some cases, this DNA 
may, in fact, serve important functions. In others, it may serve no function 
but, because it does not cause significant harm, it has remained. 


Individuality depends on the interplay between genes and environment 


With the exception of monozygotic (“identical”) twins, each person has a 
unique sequence of DNA base pairs. How different are we from one anoth- 
er at the genomic level? An examination of variation across the genome 
reveals that, on average, each pair of individual people has a different base 
in one position per 200 bases; that is, the difference is approximately 0.5%. 
This person-to-person variation is quite substantial compared with differ- 
ences in populations. The average difference between two people within one 
ethnic group is greater than the difference between the averages of two dif- 
ferent ethnic groups. 

The significance of much of this genetic variation is not understood. As 
noted earlier, variation in a single base within the genome can lead to a disease 
such as sickle-cell anemia. Scientists have now identified the genetic varia- 
tions associated with hundreds of diseases for which the cause can be traced 
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Figure 1.20 Food pyramid. A healthful diet includes a 
balance of food groups to supply an appropriate number of 


Vegetables Fruits Oils Milk 


to a single gene. For other diseases and traits, we know that variation in many 
different genes contributes in significant and often complex ways. Many of 
the most prevalent human ailments such as heart disease are linked to varia- 
tions in many genes. Furthermore, in most cases, the presence of a particular 
variation or set of variations does not inevitably result in the onset of a disease 
but, instead, leads to a predisposition to the development of the disease. 

In addition to these genetic differences, epigenetic factors are important. 
These factors are associated with the genome but not simply represented in 
the sequence of DNA. For example, the consequences of some of this genetic 
variation depend, often dramatically, on whether the unusual gene sequence 
is inherited from the mother or from the father. This phenomenon, known as 
genetic imprinting, depends on the covalent modification of DNA, particu- 
larly the addition of methy] groups to particular bases. Epigenetics is a very 
active field of study and many novel discoveries can be expected. 

Although our genetic makeup and associated epigenetic characteristics 
are important factors that contribute to disease susceptibility and to other 
traits, factors in a person’s environment also are significant. What are these 
environmental factors? Perhaps the most obvious are chemicals that we eat 
or are exposed to in some other way. The adage “you are what you eat” has 
considerable validity; it applies both to substances that we ingest in signif1- 
cant quantities and to those that we ingest in only trace amounts. Throughout 
our study of biochemistry, we will encounter vitamins and trace elements and 
their derivatives that play crucial roles in many processes. In many cases, 
the roles of these chemicals were first revealed through investigation of defi- 
ciency disorders observed in people who do not take in a sufficient quantity 
of a particular vitamin or trace element. Despite the fact that the most 
important vitamins and trace elements have been known for some time, new 
roles for these essential dietary factors continue to be discovered. 

A healthful diet requires a balance of major food groups (Figure 1.20). 
In addition to providing vitamins and trace elements, food provides calories 
in the form of substances that can be broken down to release energy to drive 
other biochemical processes. Proteins, fats, and carbohydrates provide the 
building blocks used to construct the molecules of life. Finally, it is possible 
to get too much of a good thing. Human beings evolved under circum- 
stances in which food, particularly rich foods such as meat, was 
scarce. With the development of agriculture and modern econo- 
mies, rich foods are now plentiful in parts of the world. Some of the 
most prevalent diseases in the so-called developed world, such as 
heart disease and diabetes, can be attributed to the large quantities 
of fats and carbohydrates that are present in modern diets. We are 
now developing a deeper understanding of the biochemical conse- 
quences of these diets and the interplay between diet and genetic 
factors. 

Chemicals are only one important class of environmental fac- 
tors. The behaviors in which we engage also have biochemical 
consequences. Through physical activity, we consume the calories 
that we take in, ensuring an appropriate balance between food 
intake and energy expenditure. Activities ranging from exercise to 
emotional responses such as fear and love may activate specific 
eae biochemical pathways, leading to changes in levels of gene expres- 

and sion, the release of hormones, and other consequences. For exam- 
beans _ ple, recent discoveries reveal that high stress levels are associated 
with the shortening of telomeres, structures at the ends of chro- 
mosomes. Furthermore, the interplay between biochemistry and 


calories and an appropriate mixture of biochemical building behavior is bidirectional. Just ae NE biochemistry is attected by 
blocks. [Courtesy of the U. S. Department of Agriculture] our behavior, so, too, our behavior is affected, although certainly 


not completely determined, by our genetic makeup and other aspects of our 
biochemistry. Genetic factors associated with a range of behavioral charac- 
teristics have been at least tentatively identified. 

Just as vitamin deficiencies and genetic diseases revealed fundamental 
principles of biochemistry and biology, investigations of variations in 
behavior and their linkage to genetic and biochemical factors are potential 
sources of great insight into mechanisms within the brain. For example, 
studies of drug addiction have revealed neural circuits and biochemical 
pathways that greatly influence aspects of behavior. Unraveling the inter- 
play between biology and behavior is one of the great challenges in modern 
science, and biochemistry is providing some of the most important concepts 
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and tools for this endeavor. 


APPENDIX: Visualizing Molecular Structures |: Small Molecules 


The authors of a biochemistry textbook face the prob- 
lem of trying to present three-dimensional molecules in 
the two dimensions available on the printed page. The 
interplay between the three-dimensional structures of 
biomolecules and their biological functions will be dis- 
cussed extensively throughout this book. Toward this 
end, we will frequently use representations that, although 
of necessity are rendered in two dimensions, emphasize 
the three-dimensional structures of molecules. 


Stereochemical Renderings 

Most of the chemical formulas in this book are drawn to 
depict the geometric arrangement of atoms, crucial to 
chemical bonding and reactivity, as accurately as pos- 
sible. For example, the carbon atom of methane is tetra- 
hedral, with H-C-H angles of 109.5 degrees, whereas 
the carbon atom in formaldehyde has bond angles of 
120 degrees. 


H H _ 
Ve I 
HH HH 
Methane Formaldehyde 


To illustrate the correct stereochemistry about tetra- 
hedral carbon atoms, wedges will be used to depict the 
direction of a bond into or out of the plane of the page. 
A solid wedge with the broad end away from the carbon 
atom denotes a bond coming toward the viewer out of 
the plane. A dashed wedge, with its broad end at the 
carbon atom, represents a bond going away from the 
viewer behind the plane of the page. The remaining two 
bonds are depicted as straight lines. 


Fischer Projections 

Although representative of the actual structure of a 
compound, stereochemical structures are often difficult 
to draw quickly. An alternative, less-representative 


method of depicting structures with tetrahedral carbon 
centers relies on the use of Fischer projections. 


Ww W Z WwW 
Z X= l= C —X = \s 
: yo x 
Y Y 
Fischer Stereochemical 
projection rendering 


In a Fischer projection, the bonds to the central car- 
bon are represented by horizontal and vertical lines 
from the substituent atoms to the carbon atom, which is 
assumed to be at the center of the cross. By convention, 
the horizontal bonds are assumed to project out of the 
page toward the viewer, whereas the vertical bonds are 
assumed to project behind the page away from the 
viewer. 


Molecular Models for Small Molecules 

For depicting the molecular architecture of small mol- 
ecules in more detail, two types of models will often be 
used: space filling and ball and stick. These models 
show structures at the atomic level. 


1. Space-Filling Models. The space-filling models are 
the most realistic. The size and position of an atom ina 
space-filling model are determined by its bonding 
properties and van der Waals radius, or contact dis- 
tance. A van der Waals radius describes how closely 
two atoms can approach each other when they are not 
linked by a covalent bond. The colors of the model are 
set by convention. 


Carbon, black Hydrogen, white Nitrogen, blue 
Oxygen, red Sulfur, yellow Phosphorus, purple 


Space-filling models of several simple molecules are 
shown in Figure 1.21. 
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2. Ball-and-Stick Models. Ball-and-stick models are 
not as realistic as space-filling models, because the 
atoms are depicted as spheres of radii smaller than their 
van der Waals radii. However, the bonding arrange- 
ment is easier to see because the bonds are explicitly 
represented as sticks. In an illustration, the taper of a 
stick, representing parallax, tells which of a pair of 


bonded atoms is closer to the reader. A ball-and-stick 
model reveals a complex structure more clearly than a 
space-filling model does. Ball-and-stick models of sev- 
eral simple molecules are shown in Figure 1.21. 

Molecular models for depicting large molecules will 
be discussed in the appendix to Chapter 2. 


Acetate Formamide Cysteine 


ado 


2782 & 


Figure 1.21 Molecular representations. 
Structural formulas (bottom), ball-and-stick 
models (middle), and space-filling 
representations (top) of selected molecules are 
shown. Black = carbon, red = oxygen, white = 
hydrogen, yellow = sulfur, blue = nitrogen. 


Key Terms 


biological macromolecule (p. 2) 
metabolite (p. 2) 

deoxyribonucleic acid (DNA) (p. 2) 
protein (p. 2) 

Eukarya (p. 3) 

Bacteria (p. 3) 

Archaea (p. 3) 

eukaryote (p. 3) 

prokaryote (p. 3) 


H,0 H,C —C 


double helix (p. 5) 

covalent bond (p. 5) 

resonance structure (p. 7) 
electrostatic interaction (p. 7) 
hydrogen bond (p. 8) 

van der Waals interaction (p. 8) 
hydrophobic effect (p. 9) 
hydrophobic interaction (p. 10) 
entropy (p. 11) 


H,N—C 


enthalpy (p. 11) 


free energy (Gibbs free energy) 


(p. 12) 
pH (p. 13) 
pK, value (p. 15) 
buffer (p. 15) 
amino acid (p. 18) 
genetic code (p. 19) 
predisposition (p. 20) 


Problems 


1. Donors and acceptors. Identify the hydrogen-bond donors 
and acceptors in each of the four bases on page 4. 


2. Resonance structures. The structure of an amino acid, 
tyrosine, is shown here. Draw an alternative resonance 
structure. 


H 
Ho 
H 
H 
as 
+H3N coo- 


3. It takes all types. What types of noncovalent bonds hold 
together the following solids? 


(a) Table salt (NaCl), which contains Na* and Cl” ions. 


(b) Graphite (C), which consists of sheets of covalently 
bonded carbon atoms. 


4. Don’t break the law. Given the following values for the 
changes in enthalpy (AH) and entropy (AS), which of the 
following processes can take place at 298 K without violat- 
ing the Second Law of Thermodynamics? 


(a) AH = —84kJ mol! (—20 kcal mol7!), 
AS = +125 J mol”! K7! (430 cal mol” 'K7!) 
(b) AH = —84kJ mol! (—20 kcal mol”), 
AS = —125J mol !K~!(—30 cal mol! K74) 
(c) AH = +84kJ mol™! (+20 kcal mol™'),_ 
AS = —125J mol! K~! (430 cal mol”! K7) 


(d) AH = +84 kJ mol! (+20 kcal mol” ?), 
AS = —125J mol! K~!(—30 cal mol”! K7}) 


5. Double-helix-formation entropy. For double-helix forma- 
tion, AG can be measured to be —54 kJ mol! (—13 kcal 
mol !) at pH 7.0 in 1 M NaCl at 25°C (298 K). The heat 
released indicates an enthalpy change of —251 kJ mol! 
(—60 kcal mol~'). For this process, calculate the entropy 
change for the system and the entropy change for the 
surroundings. 


6. Find the pH. What are the pH values for the following 
solutions? 


(a) 0.1 M HCl 
(b) 0.1 M NaOH 
(c) 0.05 M HCl 
(d) 0.05 M NaOH 


23 


Problems 


7. A weak acid. What is the pH of a0.1 M solution of acetic 
acid (pK, = 4.75)? 


(Hint: Let x be the concentration of H* ions released from 
acetic acid when it dissociates. The solutions to a quadratic 
equation of the form ax? + bx + c = 0 are 


x =(-b + Vb? — 4ac) /2a.) 


8. Substituent effects. What is the pH of a 0.1 M solution of 
chloroacetic acid (CICH2COOH, pK, = 2.86)? 


9. Basic fact. What is the pH of a 0.1 M solution of eth- 
ylamine, given that the pK, of ethylammonium ion 
(CH3CH,NH3°) is 10.70? 


10. Comparison. A solution is prepared by adding 0.01 M 
acetic acid and 0.01 M ethylamine to water and adjusting 
the pH to 7.4. What is the ratio of acetate to acetic acid? 
What is the ratio of ethylamine to ethylammonium ion? 


11. Concentrate. Acetic acid is added to water until the pH 
value reaches 4.0. What is the total concentration of the 
added acetic acid? 


12. Dilution. 100 mL of a solution of hydrochloric acid 
with pH 5.0 is diluted to 1 L. What is the pH of the diluted 
solution? 


13. Buffer dilution. 100 mL of a 0.1 mM buffer solution 
made from acetic acid and sodium acetate with pH 5.0 is 
diluted to 1 L. What is the pH of the diluted solution? 


14. Find the pKy. For an acid HA, the concentrations of 
HA and A are 0.075 and 0.025, respectively, at pH 6.0. 
What is the pK, value for HA? 


15. pH indicator. A dye that is an acid and that appears as 
different colors in its protonated and deprotonated forms 
can be used as a pH indicator. Suppose that you have a 
0.001 M solution of a dye with a pK, of 7.2. From the color, 
the concentration of the protonated form is found to be 
0.0002 M. Assume that the remainder of the dye is in the 
deprotonated form. What is the pH of the solution? 


16. What’s the ratio? An acid with a pK, of 8.0 is present in 
a solution with a pH of 6.0. What is the ratio of the proto- 
nated to the deprotonated form of the acid? 


17. Phosphate buffer. What is the ratio of the concentra- 
tions of H,PO4 and HPO,’ at (a) pH 7.0; (b) pH 7.5; 
(c) pH 8.0? 


18. Buffer capacity. Two solutions of sodium acetate are pre- 
pared, one with a concentration of 0.1 M and the other with 
a concentration of 0.01 M. Calculate the pH values when the 
following concentrations of HCl have been added to each of 
these solutions: 0.0025 M, 0.005 M, 0.01 M, and 0.05 M. 
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19. Buffer preparation. You wish to prepare a buffer con- 
sisting of acetic acid and sodium acetate with a total acetic 
acid plus acetate concentration of 250 mM and a pH of 5.0. 
What concentrations of acetic acid and sodium acetate 
should you use? Assuming you wish to make 2 liters of this 
buffer, how many moles of acetic acid and sodium acetate 
will you need? How many grams of each will you need 
(molecular weights: acetic acid 60.05 g mol !, sodium ace- 
tate, 82.03 g mol‘)? 


20. An alternative approach. When you go to prepare the 
buffer described in Problem 19, you discover that your 
laboratory is out of sodium acetate, but you do have sodium 
hydroxide. How much (in moles and grams) acetic acid and 
sodium hydroxide do you need to make the buffer? 


21. Another alternative. Your friend from another labo- 
ratory was out of acetic acid so he tries to prepare the 
buffer in Problem 19 by dissolving 41.02 g of sodium 
acetate in water, carefully adding 180.0 ml of 1 M HCl, 
and adding more water to reach a total volume of 2 liters. 
What is the total concentration of acetate plus acetic 
acid in the solution? Will this solution have pH 5.0? 
Will it be identical with the desired buffer? If not, how 
will it differ? 


22. Blood substitute. As noted in this chapter, blood con- 
tains a total concentration of phosphate of approximately 
1 mM and typically has a pH of 7.4. You wish to make 
100 liters of phosphate buffer with a pH of 7.4 from 
NaH»PO, (molecular weight, 119.98 g mol‘) and 
NayHPO, (molecular weight, 141.96 g mol _!). How much 
of each (in grams) do you need? 


23. A potential problem. You wish to make a buffer with 
pH 7.0. You combine 0.060 grams of acetic acid and 
14.59 grams of sodium acetate and add water to yield a total 
volume of 1 liter. What is the pH? Will this be the useful 
pH 7.0 buffer you seek? 


24. Charge! Suppose two phosphate groups in DNA (each 
with a charge of —1) are separated by 12 A. What is the 
energy of the electrostatic interaction between these two 
phosphates assuming a dielectric constant of 80? Repeat 
the calculation assuming a dielectric constant of 2. 


25. Viva la différence. On average, how many base differ- 
ences are there between two human beings? 


CHAPTER 2 


Protein Composition 
and Structure 


Crystals of human insulin. Insulin is a protein hormone, crucial for maintaining 
blood sugar at appropriate levels. (Below) Chains of amino acids in a specific 
sequence (the primary structure) define a protein such as insulin. These chains 
fold into well-defined structures (the tertiary structure)—in this case, a single 
insulin molecule. Such structures assemble with other chains to form arrays such 
as the complex of six insulin molecules shown at the far right (the quarternary 
structure). These arrays can often be induced to form well-defined crystals 
(photograph at left), which allows a determination of these structures in detail. 
[Photograph from Alfred Pasieka/Photo Researchers.] 
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2.1 Proteins Are Built from a 


2 baer are the most versatile macromolecules in living systems and serve 
Repertoire of 20 Amino Acids 


crucial functions in essentially all biological processes. They function as 
catalysts, transport and store other molecules such as oxygen, provide mech- 
anical support and immune protection, generate movement, transmit nerve 
impulses, and control growth and differentiation. Indeed, much of this 
book will focus on understanding what proteins do and how they perform 


2.2 Primary Structure: Amino Acids 
Are Linked by Peptide Bonds to 
Form Polypeptide Chains 


these functions. 2.3 Secondary Structure: Polypeptide 
Several key properties enable proteins to participate in a wide range Chains Can Fold into Regular 
of functions. Structures Such As the Alpha 
Helix, the Beta Sheet, and Turns 
1. Proteins are linear polymers built of monomer units called amino acids, and Loops 


which are linked end to end. The sequence of linked amino acids is called 2/4 Tertiary Structure: Water-Soluble 
the primary structure. Remarkably, proteins spontaneously fold up into Proteins Fold into Compact 

three-dimensional structures that are determined by the sequence of amino Structures with Nonpolar Cores 
acids in the protein polymer. Three-dimensional structure formed by 
hydrogen bonds between amino acids near one another is called secondary 
structure, whereas tertiary structure is formed by long-range interactions 
between amino acids. Protein function depends directly on this three- 


2.5 Quaternary Structure: Polypeptide 
Chains Can Assemble into 
Multisubunit Structures 


dimensional structure (Figure 2.1). Thus, proteins are the embodiment of the 2.6 The Amino Acid Sequence of a 
transition from the one-dimensional world of sequences to the three-dimensional Protein Determines Its Three- 
world of molecules capable of diverse activities. Many proteins display Dimensional Structure 
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Figure 2.2 A complex protein assembly. 


An electron micrograph of insect flight tissue 
in cross section shows a hexagonal array of 
two kinds of protein filaments. [Courtesy of 
Dr. Michael Reedy.] 


hk Figure 2.1 Structure dictates 
function. A protein component of the DNA 
replication machinery surrounds a section of 
DNA double helix depicted as a cylinder. The 
protein, which consists of two identical 
subunits (shown in red and yellow), acts as a 
clamp that allows large segments of DNA to 
be copied without the replication machinery 
dissociating from the DNA. [Drawn from 
2POL pdb] 


quaternary structure, in which the functional protein is composed of sev- 
eral distinct polypeptide chains. 


2. Proteins contain a wide range of functional groups. These functional 
groups include alcohols, thiols, thioethers, carboxylic acids, carboxamides, 
and a variety of basic groups. Most of these groups are chemically reactive. 
When combined in various sequences, this array of functional groups 
accounts for the broad spectrum of protein function. For instance, their 
reactive properties are essential to the function of enzymes, the proteins that 
catalyze specific chemical reactions in biological systems (see Chapters 8 
through 10). 


3. Proteins can interact with one another and with other biological macromol- 
ecules to form complex assemblies. The proteins within these assemblies can 
act synergistically to generate capabilities that individual proteins may lack 
(Figure 2.2). Examples of these assemblies include macromolecular 
machines that replicate DNA, transmit signals within cells, and carry out 
many other essential processes. 


4. Some proteins are quite rigid, whereas others display a considerable flexibility. 
Rigid units can function as structural elements in the cytoskeleton (the 
internal scaffolding within cells) or in connective tissue. Proteins with some 
flexibility may act as hinges, springs, or levers that are crucial to protein 


h Figure 2.3 Flexibility and function. On binding iron, the protein lactoferrin undergoes a 
substantial change in conformation that allows other molecules to distinguish between the iron-free 
and the iron-bound forms. [Drawn from 1 LFH.pdb and 1 LFG.pdb_] 


function, to the assembly of proteins with one another and with other mol- 
ecules into complex units, and to the transmission of information within 
and between cells (Figure 2.3). 


2.1 Proteins Are Built from a Repertoire of 20 Amino Acids 


Amino acids are the building blocks of proteins. An a-amino acid consists 
of a central carbon atom, called the a carbon, linked to an amino group, a 
carboxylic acid group, a hydrogen atom, and a distinctive R group. The 
R group is often referred to as the side chain. With four different groups 
connected to the tetrahedral a-carbon atom, a-amino acids are chiral: they 
may exist in one or the other of two mirror-image forms, called the L isomer 
and the D isomer (Figure 2.4). 


D isomer 


L isomer 


Figure 2.4 The t and p isomers of amino acids. The letter R refers to the side chain. The L and 
D isomers are mirror images of each other. 


Only L amino acids are constituents of proteins. For almost all amino 

acids, the L isomer has S (rather than R) absolute configuration 
(Figure 2.5). What is the basis for the preference for L amino acids? The 
answer is not known, but evidence shows that L amino acids are slightly 
more soluble than is a racemic mixture of D and L amino acids, which tend 
to form crystals. This small solubility difference could have been amplified 
over time so that the L isomer became dominant in solution. 

Amino acids in solution at neutral pH exist predominantly as dipolar 
ions (also called zwitterions). In the dipolar form, the amino group is proto- 
nated (—NH3,) and the carboxyl group is deprotonated (—COO7 ). The 
ionization state of an amino acid varies with pH (Figure 2.6). In acid 
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Notation for distinguishing stereoisomers 


The four different substituents of an asym- 
metric carbon atom are assigned a priority 
according to atomic number. The lowest- 
priority substituent, often hydrogen, is pointed 
away from the viewer. The configuration 
about the carbon atom is called S (from the 
Latin sinister, “left") if the progression from 
the highest to the lowest priority is counter- 
clockwise. The configuration is called R (from 
the Latin rectus, “right") if the progression 

is clockwise. 


Figure 2.5 Only t amino acids are found 
in proteins. Almost all L amino acids have an 
S absolute configuration. The counterclockwise 
direction of the arrow from highest- to lowest- 
priority substituents indicates that the chiral 
center is of the S configuration. 


Figure 2.6 lonization state as a function 
of pH. The ionization state of amino acids is 
altered by a change in pH. The zwitterionic 
form predominates near physiological pH. 
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solution (e.g., pH 1), the amino group is protonated (—NH3") and the 
carboxyl group is not dissociated (—COOH). As the pH is raised, the 
carboxylic acid is the first group to give up a proton, inasmuch as its pK, is 
near 2. The dipolar form persists until the pH approaches 9, when the pro- 
tonated amino group loses a proton. 

Twenty kinds of side chains varying in size, shape, charge, hydrogen- 
bonding capacity, hydrophobic character, and chemical reactivity are com- 
monly found in proteins. Indeed, all proteins in all species—bacterial, 
archaeal, and eukaryotic—are constructed from the same set of 20 amino 
acids with only a few exceptions. This fundamental alphabet for the con- 
struction of proteins is several billion years old. The remarkable range of 
functions mediated by proteins results from the diversity and versatility of 
these 20 building blocks. Understanding how this alphabet is used to create 
the intricate three-dimensional structures that enable proteins to carry out 
so many biological processes is an exciting area of biochemistry and one that 
we will return to in Section 2.6. 

Although there are many ways to classify amino acids, we will assort 
these molecules into four groups, on the basis of the general chemical char- 
acteristics of their R groups: 


1. Hydrophobic amino acids with nonpolar R groups 


2. Polar amino acids with neutral R groups but the charge is not evenly 
distributed 


3. Positively charged amino acids with R groups that have a positive charge 
at physiological pH 


4. Negatively charged amino acids with R groups that have a negative 
charge at physiological pH 


Hydrophobic amino acids The simplest amino acid is glycine, which has 
a single hydrogen atom as its side chain. With two hydrogen atoms 
bonded to the a-carbon atom, glycine is unique in being achiral. Alanine, 
the next simplest amino acid, has a methyl group (—CH3) as its side chain 
(Figure 2.7). 

Larger hydrocarbon side chains are found in valine, leucine, and isoleu- 
cine. Methionine contains a largely aliphatic side chain that includes a 
thioether (—S—) group. The side chain of isoleucine includes an additional 
chiral center; only the isomer shown in Figure 2.7 is found in proteins. The 
larger aliphatic side chains are especially hydrophobic; that is, they tend to 
cluster together rather than contact water. The three-dimensional struc- 
tures of water-soluble proteins are stabilized by this tendency of hydro- 
phobic groups to come together, which is called the hydrophobic effect 
(Chapter 1). The different sizes and shapes of these hydrocarbon side chains 
enable them to pack together to form compact structures with little empty 
space. Proline also has an aliphatic side chain, but it differs from other mem- 
bers of the set of 20 in that its side chain is bonded to both the nitrogen and 
the a-carbon atoms. Proline markedly influences protein architecture 
because its ring structure makes it more conformationally restricted than 
the other amino acids. 

Two amino acids with relatively simple aromatic side chains are part of the 
fundamental repertoire. Phenylalanine, as its name indicates, contains a phe- 
nyl ring attached in place of one of the hydrogen atoms of alanine. Tryptophan 
has an indole group joined to a methylene (—CH—) group; the indole group 
comprises two fused rings containing an NH group. Phenylalanine is purely 
hydrophobic, whereas tryptophan is less so because of its NH groups. 
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Methionine 
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Figure 2.7 Structures of hydrophobic amino acids. For each amino acid, a ball-and- 
stick model (top) shows the arrangement of atoms and bonds in space. A 
stereochemically realistic formula (middle) shows the geometric arrangement of bonds 
around atoms, and a Fischer projection (bottom) shows all bonds as being perpendicular 


for a simplified representation (see the Appendix to Chapter 1). 
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Figure 2.8 Structures of the polar amino 
acids. The additional chiral center in threonine 


is indicated by an asterisk. 
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Polar amino acids Six amino acids are polar but uncharged. Three amino 
acids, serine, threonine, and tyrosine, contain hydroxyl groups (—OH) attached 
to a hydrophobic side chain (Figure 2.8). Serine can be thought of as a ver- 
sion of alanine with a hydroxyl group attached, threonine resembles valine 
with a hydroxyl group in place of one of valine’s methyl groups, and tyrosine 
is a version of phenylalanine with the hydroxyl group replacing a hydrogen 
atom on the aromatic ring. The hydroxyl group makes these amino acids 
much more hydrophilic (water loving) and reactive than their hydrophobic 
analogs. Threonine, like isoleucine, contains an additional asymmetric cen- 
ter; again, only one isomer is present in proteins. 

In addition, the set includes asparagine and glutamine, uncharged deriv- 
atives of the acidic amino acids aspartate and glutamate (see Figure 2.11). 
Each of these two amino acids contains a terminal carboxamide in place of a 
carboxylic acid. The side chain of glutamine is one methylene group longer 
than that of asparagine. 

Cysteine is structurally similar to serine but contains a sulfhydryl, or thiol 
(—SH), group in place of the hydroxyl (—OH) group. The sulfhydryl 
group is much more reactive. Pairs of sulfhydryl groups may come together 
to form disulfide bonds, which are particularly important in stabilizing 
some proteins, as will be discussed shortly. 


Positively charged amino acids We turn now to amino acids with complete 
positive charges that render them highly hydrophilic. Lysine and arginine 
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have long side chains that terminate with groups that are positively charged 
at neutral pH. Lysine is capped by a primary amino group and arginine by 
a guanidinium group. Histidine contains an imidazole group, an aromatic 
ring that also can be positively charged (Figure 2.9). 

With a pK, value near 6, the imidazole group can be uncharged or 
positively charged near neutral pH, depending on its local environment 
(Figure 2.10). Histidine is often found in the active sites of enzymes, where 
the imidazole ring can bind and release protons in the course of enzymatic 
reactions. 


Negatively charged amino acids This set of amino acids contains two with 
acidic side chains: aspartic acid and glutamic acid (Figure 2.11). These amino 
acids are often called aspartate and glutamate to emphasize that, at physio- 
logical pH, their side chains usually lack a proton that is present in the acid 
form and hence are negatively charged. Nonetheless, in some proteins, 
these side chains do accept protons, and this ability is often functionally 
important. 


Figure 2.10 Histidine ionization. Histidine can bind or release protons near physiological pH. 


Figure 2.9 Positively charged amino 
acids lysine, arginine, and histidine. 


NH2 N 
+ mee “et 
ie 
H>N~ “NH N—C 
2 2 H ‘4 
Guanidinium Imidazole 
H H 
N 
aes wae 
HC H 
of NcH NcH 
n_f ww NL 
Hw HG 
AG Sy ao 
Hp? eA 
iG Cc 
Ny a Sy ce 
a O e O 


31 


Aspartate 
(Asp, D) 


*HsN—-C—-COO- 


H 


Aspartate 
(Asp, D) 


Glutamate 
(Glu, E) 


+H3sN—C—COO- 


H 


Glutamate 
(Glu, E) 


Figure 2.11 Negatively charged amino 


acids. 
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Table 2.1 Typical pK, values of ionizable groups in proteins 


Group Acid ——— Base Typical pK," 
ie) 1 
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H H 
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H H 
Cysteine Pal — _s- 8.3 
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*pK, values depend on temperature, ionic strength, and the microenvironment of the ionizable group. 


Seven of the 20 amino acids have readily ionizable side chains. These 
7 amino acids are able to donate or accept protons to facilitate reactions as well 
as to form ionic bonds. Table 2.1 gives equilibria and typical pK, values for 
ionization of the side chains of tyrosine, cysteine, arginine, lysine, histidine, 
and aspartic and glutamic acids in proteins. Two other groups in proteins— 
the terminal a-amino group and the terminal a-carboxyl group—can be 1on- 
ized, and typical pK, values for these groups also are included in Table 2.1. 

Amino acids are often designated by either a three-letter abbreviation or a 
one-letter symbol (Table 2.2). The abbreviations for amino acids are the first 


Table 2.2 Abbreviations for amino acids 


Three-letter One-letter Three-letter One-letter 

Aminoacid abbreviation abbreviation | Aminoacid abbreviation abbreviation 
Alanine Ala A Methionine Met M 
Arginine Arg R Phenylalanine Phe ig 
Asparagine Asn N Proline Pro P 
Aspartic acid Asp D Serine Ser S 
Cysteine Cys C Threonine Thr T 
Glutamine Gln ©) Tryptophan Trp W 
Glutamic acid Glu E Tyrosine Tyr Y 
Glycine Gly G Valine Val Vv 
Histidine His H Asparagine or 

Isoleucine Ile I aspartic acid Asx B 
Leucine Leu L Glutamine or 

Lysine Lys K glutamic acid Glx ZL, 


H> H, Ho Figure 2.12 Undesirable 
ac H 2 ¢ Poetic ; ‘ 
Ce No” 4 C-— \ reactivity in amino acids. 
Ht x il fe) Some amino acids are 
“Saye ee oa rode + HX unsuitable for proteins because 
bi \| 4 I| of undesirable cyclization. 
O Homoserine can cyclize to 
Homoserine form a stable, five-membered 


ring, potentially resulting in 


H peptide-bond cleavage. The 
te 6 H> cyclization of serine would 
H / H — form : sus oy . 

C X % membered ring and is thus 
Sy em Swos [+ HK disfavored. X can be an amino 
H O H group from a neighboring 
Seine amino acid or another 


potential leaving group. 


three letters of their names, except for asparagine (Asn), glutamine (Gln), 
isoleucine (Ile), and tryptophan (Trp). The symbols for many amino acids are 
the first letters of their names (e.g., G for glycine and L for leucine); the other 
symbols have been agreed on by convention. These abbreviations and sym- 
bols are an integral part of the vocabulary of biochemists. 


How did this particular set of amino acids become the building 

blocks of proteins? First, as a set, they are diverse: their structural and 
chemical properties span a wide range, endowing proteins with the versatil- 
ity to assume many functional roles. Second, many of these amino acids 
were probably available from prebiotic reactions; that is, from reactions that 
took place before the origin of life. Finally, other possible amino acids may 
have simply been too reactive. For example, amino acids such as homoser- 
ine and homocysteine tend to form five-membered cyclic forms that limit 
their use in proteins; the alternative amino acids that are found in proteins— 
serine and cysteine—do not readily cyclize, because the rings in their cyclic 
forms are too small (Figure 2.12). 


2.2 Primary Structure: Amino Acids Are Linked by Peptide 
Bonds to Form Polypeptide Chains 


Proteins are linear polymers formed by linking the a-carboxyl group of 
one amino acid to the a-amino group of another amino acid. This type of 
linkage is called a peptide bond or an amide bond. The formation of a dipep- 
tide from two amino acids is accompanied by the loss of a water molecule 
(Figure 2.13). The equilibrium of this reaction lies on the side of hydrolysis 
rather than synthesis under most conditions. Hence, the biosynthesis of 
peptide bonds requires an input of free energy. Nonetheless, peptide bonds 
are quite stable kinetically because the rate of hydrolysis is extremely slow; 
the lifetime of a peptide bond in aqueous solution in the absence of a catalyst 
approaches 1000 years. 
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Figure 2.13 Peptide-bond formation. The linking of two amino acids is accompanied by the 
loss of a molecule of water. 
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Figure 2.14 Amino acid sequences have direction. This illustration of the pentapeptide Try- 
Gly-Gly-Phe-Leu (YGGFL) shows the sequence from the amino terminus to the carboxyl terminus. 
This pentapeptide, Leu-enkephalin, is an opioid peptide that modulates the perception of pain. 
The reverse pentapeptide, Leu-Phe-Gly-Gly-Tyr (LFGGY), is a different molecule and has no 


such effects. 


A series of amino acids joined by peptide bonds form a polypeptide 
chain, and each amino acid unit in a polypeptide is called a residue. A poly- 
peptide chain has polarity because its ends are different: an a-amino group 
is present at one end and an a-carboxyl group at the other. By convention, 
the amino end is taken to be the beginning of a polypeptide chain, and so the 
sequence of amino acids in a polypeptide chain is written starting with the 
amino-terminal residue. Thus, in the pentapeptide Tyr-Gly-Gly-Phe-Leu 
(YGGFL), tyrosine is the amino-terminal (N-terminal) residue and leucine 
is the carboxyl-terminal (C-terminal) residue (Figure 2.14). Leu-Phe-Gly- 
Gly-Tyr (LFGGY) is a different pentapeptide, with different chemical 


properties. 


A polypeptide chain consists of a regularly repeating part, called the 
main chain or backbone, and a variable part, comprising the distinctive side 
chains (Figure 2.15). The polypeptide backbone is rich in hydrogen-bonding 


Dalton 


A unit of mass very nearly equal to that of a 
hydrogen atom. Named after John Dalton 
(1766-1844), who developed the atomic 
theory of matter. 

Kilodalton (kd) 


A unit of mass equal to 1000 daltons 
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Figure 2.15 Components of a polypeptide chain. A polypeptide 
chain consists of a constant backbone (shown in black) and variable side 
chains (shown in green). 


potential. Each residue contains a carbonyl group (C=O), which is a good 
hydrogen-bond acceptor, and, with the exception of proline, an NH group, 
which is a good hydrogen-bond donor. These groups interact with each 
other and with functional groups from side chains to stabilize particular 
structures, as will be discussed in Section 2.3. 

Most natural polypeptide chains contain between 50 and 2000 amino 
acid residues and are commonly referred to as proteins. The largest protein 
known is the muscle protein titin, which consists of more than 27,000 amino 


acids. Peptides made of small numbers of amino acids 
are called oligopeptides or simply peptides. The mean 
molecular weight of an amino acid residue is about 110 g 
mol !, and so the molecular weights of most proteins are 
between 5500 and 220,000 g mol !. We can also refer to 
the mass of a protein, which is expressed in units of dal- 
tons; one dalton is equal to one atomic mass unit. A 
protein with a molecular weight of 50,000 g mol! has a 
mass of 50,000 daltons, or 50 kd (kilodaltons). 


In some proteins, the linear polypeptide chain is fe) 4 
cross-linked. The most common cross-links are disulfide re N 
ase Pee 
bonds, formed by the oxidation of a pair of cysteine resi- / 
dues (Figure 2.16). The resulting unit of two linked To 
cysteines is called cystine. Extracellular proteins often e 
have several disulfide bonds, whereas intracellular pro- 
teins usually lack them. Rarely, nondisulfide cross-links cysteine 
derived from other side chains are present in proteins. 
For example, collagen fibers in connective tissue are 
strengthened in this way, as are fibrin blood clots. @ 
CH 
Proteins have unique amino acid sequences specified ny 
by genes Saye ae 
In 1953, Frederick Sanger determined the amino acid H b 
sequence of insulin, a protein hormone (Figure 2.17). Cysteine 


This work is a landmark in biochemistry because it showed 
for the first time that a protein has a precisely defined amino 
acid sequence consisting only of L amino acids linked by 
peptide bonds. This accomplishment stimulated other 
scientists to carry out sequence studies of a wide variety of proteins. 
Currently, the complete amino acid sequences of more than 2,000,000 pro- 
teins are known. The striking fact is that each protein has a unique, precisely 
defined amino acid sequence. The amino acid sequence of a protein is referred 
to as its primary structure. 


A chain 
Gly-Ile-Val-Glu-Gln-Cys-Cys-Ala-Ser-Val-Cys-Ser-Leu-Tyr-GIn-Leu-Glu-Asn-Tyr-Cys-Asn 

5 10 15 21 
B chain 


Oxidation 


Reduction 


Cystine 


Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Leu-Tyr-Leu-Val-Cys-Gly-Glu-Arg-Gly-Phe-Phe-Tyr-Thr-Pro-Lys-Ala 


D 10 15 20 


Figure 2.17 Amino acid sequence of bovine insulin. 


A series of incisive studies in the late 1950s and early 1960s revealed that 
the amino acid sequences of proteins are determined by the nucleotide 
sequences of genes. The sequence of nucleotides in DNA specifies a com- 
plementary sequence of nucleotides in RNA, which in turn specifies the 
amino acid sequence of a protein. In particular, each of the 20 amino acids 
of the repertoire is encoded by one or more specific sequences of three 
nucleotides (Section 5.5). 

Knowing amino acid sequences is important for several reasons. First, 
knowledge of the sequence of a protein is usually essential to elucidating 
its mechanism of action (e.g., the catalytic mechanism of an enzyme). In 
fact, proteins with novel properties can be generated by varying the 
sequence of known proteins. Second, amino acid sequences determine the 
three-dimensional structures of proteins. Amino acid sequence is the link 
between the genetic message in DNA and the three-dimensional struc- 
ture that performs a protein’s biological function. Analyses of relations 
between amino acid sequences and three-dimensional structures of pro- 
teins are uncovering the rules that govern the folding of polypeptide 
chains. Third, sequence determination is a component of molecular 
pathology, a rapidly growing area of medicine. Alterations in amino acid 
sequence can produce abnormal function and disease. Severe and some- 
times fatal diseases, such as sickle-cell anemia (Chapter 7) and cystic 
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+ 2Ht + 2e@ 


Figure 2.16 Cross-links. The formation of a disulfide bond from two 
cysteine residues is an oxidation reaction. 
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36 fibrosis, can result from a change in a single amino acid within a protein. 

CHAPTER 2. Protein Composition Fourth, the sequence of a protein reveals much about its evolutionary 

and Structure history (Chapter 6). Proteins resemble one another in amino acid 
sequence only if they have a common ancestor. Consequently, molecular 
events in evolution can be traced from amino acid sequences; molecular 
paleontology is a flourishing area of research. 


Polypeptide chains are flexible yet conformationally 
restricted 


Examination of the geometry of the protein backbone reveals 
several important features. First, the peptide bond is essentially 
planar (Figure 2.18). Thus, for a pair of amino acids linked by 
a peptide bond, six atoms lie in the same plane: the a-carbon 
atom and CO group of the first amino acid and the NH group 
and a-carbon atom of the second amino acid. The nature of the 
chemical bonding within a peptide accounts for the bond’s 
planarity. The bond resonates between a single bond and a 
Pimiike 1. Peptide bandera planar in aparetlinied double bond. Because of this double-bond character, rotation 
amino acids, six atoms (Cy, C, O, N, H, and C,) lie in a plane. about this bond is prevented and thus the conformation of the 
Side chains are shown as green balls. peptide backbone is constrained. 


f/f 4 of Ot 


4 + 


a a ae 


Peptide-bond resonance structures 


The double-bond character is also expressed in the length of the bond 
between the CO and the NH groups. The C—N distance in a peptide bond 
is typically 1.32 A, which is between the values expected for a 
C—N single bond (1.49 A) and a C=N double bond (1.27 A), 
as shown in Figure 2.19. Finally, the peptide bond is uncharged, 
allowing polymers of amino acids linked by peptide bonds to 
form tightly packed globular structures. 

Two configurations are possible for a planar peptide bond. 
In the trans configuration, the two a-carbon atoms are on oppo- 
site sides of the peptide bond. In the cis configuration, these 
groups are on the same side of the peptide bond. Almost all pep- 
tide bonds in proteins are trans. This preference for trans over cis 
can be explained by the fact that steric clashes between groups 
attached to the a-carbon atoms hinder the formation of the cis 
form but do not arise in the trans configuration (Figure 2.20). By 
Figure 2.19 Typical bond lengths within a peptide unit. far the most common cis peptide bonds are X—Pro linkages. 
The peptide unit is shown in the trans configuration. Such bonds show less preference for the trans configuration 


») 


Trans Cis 


Figure 2.20 Trans and cis peptide bonds. The trans form is strongly favored because of steric 
clashes that arise in the cis form. 


atk oe 


Trans 


Figure 2.21 Trans and cis X-Pro bonds. The energies of these forms are similar to one another 
because steric clashes arise in both forms. 


because the nitrogen of proline is bonded to two tetrahedral carbon atoms, 
limiting the steric differences between the trans and cis forms (Figure 2.21). 

In contrast with the peptide bond, the bonds between the amino group 
and the a-carbon atom and between the a-carbon atom and the carbonyl 
group are pure single bonds. The two adjacent rigid peptide units can rotate 
about these bonds, taking on various orientations. This freedom of rotation 
about two bonds of each amino acid allows proteins to fold in many different 
ways. The rotations about these bonds can be specified by torsion angles 
(Figure 2.22). The angle of rotation about the bond between the nitrogen 
and the a-carbon atoms is called phi (f@). The angle of rotation about the 
bond between the a-carbon and the carbonyl carbon atoms is called psi (i). 
A clockwise rotation about either bond as viewed from the nitrogen atom 
toward the a-carbon atom or from the carbonyl group toward the a-carbon 
atom corresponds to a positive value. The @ and yw angles determine the 
path of the polypeptide chain. 


(A) (B) (©) 
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Figure 2.22 Rotation about bonds in a polypeptide. The structure of each amino acid in a 
polypeptide can be adjusted by rotation about two single bonds. (A) Phi (#) is the angle of rotation 
about the bond between the nitrogen and the a-carbon atoms, whereas psi (i) is the angle of 
rotation about the bond between the a-carbon and the carbonyl carbon atoms. (B) A view down 
the bond between the nitrogen and the a-carbon atoms, showing how ¢@ is measured. (C) A view 
down the bond between the a-carbon and the carbonyl carbon atoms, showing how w& is measured. 


Are all combinations of @ and w possible? Gopalasamudram 
Ramachandran recognized that many combinations are forbidden because 
of steric collisions between atoms. The allowed values can be visualized on 
a two-dimensional plot called a Ramachandran diagram (Figure 2.23). 
Three-quarters of the possible (f@, ys) combinations are excluded simply by 
local steric clashes. Steric exclusion, the fact that two atoms cannot be in the 
same place at the same time, can be a powerful organizing principle. 
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Torsion angle 


A measure of the rotation about a bond, 
usually taken to lie between —180 and 
+180 degrees. Torsion angles are some- 
times called dihedral angles. 
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-1390 =a A. | 
-180 -120 -60 0 60 120 +180 (b = 90°, w =-90°) 
Disfavored 


ob — 


Figure 2.23 A Ramachandran diagram showing the values of ¢ and w. Not all ¢ and w 
values are possible without collisions between atoms. The most favorable regions are shown in dark 
green; borderline regions are shown in light green. The structure on the right is disfavored because 
of steric clashes. 


The ability of biological polymers such as proteins to fold into well- 
defined structures is remarkable thermodynamically. An unfolded polymer 
exists as a random coil: each copy of an unfolded polymer will have a differ- 
ent conformation, yielding a mixture of many possible conformations. The 
favorable entropy associated with a mixture of many conformations opposes 
folding and must be overcome by interactions favoring the folded form. 
Thus, highly flexible polymers with a large number of possible conforma- 
tions do not fold into unique structures. The rigidity of the peptide unit and 
the restricted set of allowed b and Ws angles limits the number of structures acces- 
sible to the unfolded form sufficiently to allow protein folding to take place. 


2.3 Secondary Structure: Polypeptide Chains Can 
Fold into Regular Structures Such As the Alpha Helix, 
the Beta Sheet, and Turns and Loops 


Can a polypeptide chain fold into a regularly repeating structure? In 1951, 
Linus Pauling and Robert Corey proposed two periodic structures called 
the @ helix (alpha helix) and the B pleated sheet (beta pleated sheet). 
Subsequently, other structures such as the B turn and omega (Q.) loop were 
identified. Although not periodic, these common turn or loop structures are 
well defined and contribute with a helices and B sheets to form the final 
protein structure. Alpha helices, 8 strands, and turns are formed by a regu- 
lar pattern of hydrogen bonds between the peptide N—H and C=O groups 
of amino acids that are near one another in the linear sequence. Such folded 
segments are called secondary structure. 


The alpha helix is a coiled structure stabilized by intrachain 
hydrogen bonds 


In evaluating potential structures, Pauling and Corey considered which 
conformations of peptides were sterically allowed and which most fully 
exploited the hydrogen-bonding capacity of the backbone NH and CO 
groups. The first of their proposed structures, the @ helix, is a rodlike struc- 
ture (Figure 2.24). A tightly coiled backbone forms the inner part of the rod 
and the side chains extend outward in a helical array. The « helix is stabilized 
by hydrogen bonds between the NH and CO groups of the main chain. In 


Figure 2.24 Structure of the a helix. (A) A ribbon depiction shows the a-carbon atoms and 
side chains (green). (B) A side view of a ball-and-stick version depicts the hydrogen bonds (dashed 
lines) between NH and CO groups. (C) An end view shows the coiled backbone as the inside of 
the helix and the side chains (green) projecting outward. (D) A space-filling view of part C shows 
the tightly packed interior core of the helix. 


particular, the CO group of each amino acid forms a hydrogen bond with 
the NH group of the amino acid that is situated four residues ahead in 
the sequence (Figure 2.25). Thus, except for amino acids near the ends of an 
a helix, all the main-chain CO and NH groups are hydrogen bonded. Each 
residue is related to the next one by a rise, also called translation, of 1.5 A 
along the helix axis and a rotation of 100 degrees, which gives 3.6 amino acid 
residues per turn of helix. Thus, amino acids spaced three and four apart 
in the sequence are spatially quite close to one another in an « helix. In 
contrast, amino acids spaced two apart in the sequence are situated on oppo- 
site sides of the helix and so are unlikely to make contact. The pitch of the 
a helix is the length of one complete turn along the helix axis and is equal to 
the product of the rise (1.5 A) and the number of residues per turn (3.6), or 
5.4 A. The screw sense of a helix can be right-handed (clockwise) or left- 
handed (counterclockwise). The Ramachandran diagram reveals that both 
the right-handed and the left-handed helices are among allowed conformations 


Figure 2.25 Hydrogen-bonding scheme for an « helix. In the helix, the CO group of residue 
i forms a hydrogen bond with the NH group of residue i + 4. 
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Screw sense 


Describes the direction in which a helical 
structure rotates with respect to its axis. If, 
viewed down the axis of a helix, the chain 
turns in a clockwise direction, it has a right- 
handed screw sense. If the turning is coun- 
terclockwise, the screw sense is left-handed. 


Left-handed 
helix (very rare) 


-120 + Right-handed 
helix (common) 
-130 = A | 
-180 -120 -60 0 60 120 +180 


go — 


Figure 2.26 Ramachandran diagram for 
helices. Both right- and left-handed helices 
lie in regions of allowed conformations in the 
Ramachandran diagram. However, essentially 
all a helices in proteins are right-handed. 
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Figure 2.29 Ramachandran diagram for 

6 strands. The red area shows the sterically 
allowed conformations of extended, B-strand- 
like structures. 
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(A) (B) 


Figure 2.27 Schematic views of 
a helices. (A) A ribbon depiction. 
(B) A cylindrical depiction. 


Figure 2.28 A largely a-helical 
protein. Ferritin, an iron-storage protein, is 
built from a bundle of « helices. [Drawn from 
1AEW.pdb.] 


(Figure 2.26). However, right-handed helices are energetically more favor- 
able because there is less steric clash between the side chains and the back- 
bone. Essentially all a helices found in proteins are right-handed. In schematic 
representations of proteins, a helices are depicted as twisted ribbons or rods 
(Figure 2.27). 

Not all amino acids can be readily accommodated in an a helix. 
Branching at the B-carbon atom, as in valine, threonine, and isoleucine, 
tends to destabilize a helices because of steric clashes. Serine, aspartate, and 
asparagine also tend to disrupt a helices because their side chains contain 
hydrogen-bond donors or acceptors in close proximity to the main chain, 
where they compete for main-chain NH and CO groups. Proline also is a 
helix breaker because it lacks an NH group and because its ring structure 
prevents it from assuming the ¢ value to fit into an a helix. 

The a-helical content of proteins ranges widely, from none to almost 
100%. For example, about 75% of the residues in ferritin, a protein that 
helps store iron, are in @ helices (Figure 2.28). Indeed, about 25% of all 
soluble proteins are composed of a helices connected by loops and turns of 
the polypeptide chain. Single a helices are usually less than 45 A long. 
Many proteins that span biological membranes also contain « helices. 


Beta sheets are stabilized by hydrogen bonding between 
polypeptide strands 


Pauling and Corey proposed another periodic structural motif, which they 
named the B pleated sheet (B because it was the second structure that they 
elucidated, the a helix having been the first). The B pleated sheet (or, more 
simply, the B sheet) differs markedly from the rodlike o helix. It is composed 


Figure 2.30 Structure of a f strand. The side chains (green) are alternately above and below 
the plane of the strand. 


Figure 2.31 An antiparallel 8 sheet. Adjacent B strands run in opposite directions. Hydrogen 
bonds between NH and CO groups connect each amino acid to a single amino acid on an adjacent 
strand, stabilizing the structure. 


Figure 2.32 A parallel 8 sheet. Adjacent B strands run in the same direction. Hydrogen bonds 
connect each amino acid on one strand with two different amino acids on the adjacent strand. 


of two or more polypeptide chains called B strands. A B strand is almost 
fully extended rather than being tightly coiled as in the a helix. A range 
of extended structures are sterically allowed (Figure 2.29). 

The distance between adjacent amino acids along a B strand 1s approxi- 
mately 3.5 A, in contrast with a distance of 1.5 A along an a helix. The side 
chains of adjacent amino acids point in opposite directions (Figure 2.30). A 
B sheet is formed by linking two or more B strands lying next to one another 
through hydrogen bonds. Adjacent chains in a B sheet can run in opposite 
directions (antiparallel B sheet) or in the same direction (parallel 8 sheet). In 
the antiparallel arrangement, the NH group and the CO group of each 
amino acid are respectively hydrogen bonded to the CO group and the NH 
group of a partner on the adjacent chain (Figure 2.31). In the parallel 
arrangement, the hydrogen-bonding scheme is slightly more complicated. 
For each amino acid, the NH group is hydrogen bonded to the CO group of 
one amino acid on the adjacent strand, whereas the CO group is hydrogen 
bonded to the NH group on the amino acid two residues farther along the 
chain (Figure 2.32). Many strands, typically 4 or 5 but as many as 10 or 
more, can come together in B sheets. Such B sheets can be purely antiparal- 
lel, purely parallel, or mixed (Figure 2.33). 

In schematic representations, B strands are usually depicted by broad 
arrows pointing in the direction of the carboxyl-terminal end to indicate the 
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h Figure 2.35 A protein rich in B 
sheets. The structure of a fatty acid-binding 
protein. [Drawn from 1FTP.pdb.] 


Figure 2.33 Structure of a mixed £ sheet. 


type of B sheet formed—parallel or antiparallel. More structurally diverse 
than a helices, B sheets can be almost flat but most adopt a somewhat 
twisted shape (Figure 2.34). The B sheet is an important structural element 
in many proteins. For example, fatty acid-binding proteins, important for 
lipid metabolism, are built almost entirely from B sheets (Figure 2.35). 


a 


Figure 2.34 A schematic twisted £ sheet. (A) A schematic model. (B) The schematic view 
rotated by 90 degrees to illustrate the twist more clearly. 


B) 


Polypeptide chains can change direction by making reverse 
turns and loops 


Most proteins have compact, globular shapes owing to reversals in the 
direction of their polypeptide chains. Many of these reversals are accom- 
plished by a common structural element called the reverse turn (also known 
as the B turn or hairpin turn), illustrated in Figure 2.36. In many reverse 
turns, the CO group of residue i of a polypeptide is hydrogen bonded to the 
NH group of residue 1 + 3. This interaction stabilizes abrupt changes in 
direction of the polypeptide chain. In other cases, more-elaborate structures 
are responsible for chain reversals. These structures are called loops or some- 
times 2 loops (omega loops) to suggest their overall shape. Unlike a helices 
and B strands, loops do not have regular, periodic structures. Nonetheless, 
loop structures are often rigid and well defined (Figure 2.37). Turns and 
loops invariably lie on the surfaces of proteins and thus often participate in 
interactions between proteins and other molecules. 


Figure 2.36 Structure of a reverse turn. hk Figure 2.37 Loops on a protein 

The CO group of residue / of the surface. A part of an antibody molecule has 
polypeptide chain is hydrogen bonded to surface loops (shown in red) that mediate 
the NH group of residue / +3 to stabilize interactions with other molecules. [Drawn 
the turn. from 7FTP.pdb.] 


Fibrous proteins provide structural support for cells and tissues 


Special types of helices are present in the two proteins a-keratin and colla- 
gen. These proteins form long fibers that serve a structural role. 

a-Keratin, which is the primary component of wool, hair, and skin, con- 
sists of two right-handed « helices intertwined to form a type of left-handed 
superhelix called an a-helical coiled coil. a-Keratin is a member of a super- 
family of proteins referred to as coiled-coil proteins (Figure 2.38). In these 
proteins, two or more a helices can entwine to form a very stable structure, 
which can have a length of 1000 A (100 nm, or 0.1 Jum) or more. There 
are approximately 60 members of this family in humans, including interme- 
diate filaments, proteins that contribute to the cell cytoskeleton (internal 
scaffolding in a cell), and the muscle proteins myosin and tropomyosin 
(Section 35.2). Members of this family are characterized by a central region 
of 300 amino acids that contains imperfect repeats of a sequence of seven 
amino acids called a heptad repeat. 

The two helices in a-keratin are cross-linked by weak interactions such 
as van der Waals forces and ionic interactions. These interactions are facili- 
tated by the fact that the left-handed supercoil alters the two right-handed 
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hk Figure 2.38 An a-helical coiled coil. (A) Space-filling model. (B) Ribbon diagram. The two 
helices wind around one another to form a superhelix. Such structures are found in many proteins, 
including keratin in hair, quills, claws, and horns. [Drawn from 1ClG.pdb_] 
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Leucine (Leu) 
residue 
Leu 


Leu Leu 


Leu Leu 


Leu 


h Figure 2.39 Heptad repeats in a 
coiled-coil protein. Every seventh residue in 
each helix is leucine. The two helices are held 
together by van der Waals interactions 
primarily between the leucine residues. 
[Drawn from 2ZTA.pdb.] 
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Figure 2.40 Amino acid sequence of a 
part of a collagen chain. Every third residue 
is a glycine. Proline and hydroxyproline also 
are abundant. 


a helices such that there are 3.5 residues per turn instead of 3.6. Thus, the 
pattern of side-chain interactions can be repeated every seven residues, 
forming the heptad repeats. Two helices with such repeats are able to inter- 
act with one another if the repeats are complementary (Figure 2.39). For 
example, the repeating residues may be hydrophobic, allowing van der 
Waals interactions, or have opposite charge, allowing ionic interactions. In 
addition, the two helices may be linked by disulfide bonds formed by neigh- 
boring cysteine residues. The bonding of the helices accounts for the physi- 
cal properties of wool, an example of an a-keratin. Wool is extensible and 
can be stretched to nearly twice its length because the a helices stretch, 
breaking the weak interactions between neighboring helices. However, the 
covalent disulfide bonds resist breakage and return the fiber to its original 
state once the stretching force is released. The number of disulfide bond 
cross-links further defines the fiber’s properties. Hair and wool, having 
fewer cross-links, are flexible. Horns, claws, and hooves, having more 
cross-links, are much harder. 

A different type of helix is present in collagen, the most abundant pro- 
tein of mammals. Collagen is the main fibrous component of skin, bone, 
tendon, cartilage, and teeth. This extracellular protein is a rod-shaped mol- 
ecule, about 3000 A long and only 15 A in diameter. It contains three helical 
polypeptide chains, each nearly 1000 residues long. Glycine appears at 
every third residue in the amino acid sequence, and the sequence glycine- 
proline-hydroxyproline recurs frequently (Figure 2.40). Hydroxyproline is 
a derivative of proline that has a hydroxyl group in place of one of the hydro- 
gen atoms on the pyrrolidine rings. 

The collagen helix has properties different from those of the a helix. 
Hydrogen bonds within a strand are absent. Instead, the helix is stabilized 
by steric repulsion of the pyrrolidine rings of the proline and hydroxyproline 
residues (Figure 2.41). The pyrrolidine rings keep out of each other’s way 
when the polypeptide chain assumes its helical form, which has about 
three residues per turn. Three strands wind around one another to form a 
superhelical cable that is stabilized by hydrogen bonds between strands. 
The hydrogen bonds form between the peptide NH groups of glycine 
residues and the CO groups of residues on the other chains. The hydroxyl 
groups of hydroxyproline residues also participate in hydrogen bonding, 
and the absence of the hydroxyl groups results in the disease scurvy 
(Section 27.6). 

The inside of the triple-stranded helical cable is very crowded and 
accounts for the requirement that glycine be present at every third position 
on each strand (Figure 2.42A). The only residue that can fit in an interior 
position is glycine. The amino acid residue on either side of glycine is located 
on the outside of the cable, where there is room for the bulky rings of proline 
and hydroxyproline residues (Figure 2.42B). 


Figure 2.41 Conformation of a single strand of a collagen triple helix. 


Figure 2.42 Structure of the protein collagen. (A) Space- 
filling model of collagen. Each strand is shown in a different 
color. (B) Cross section of a model of collagen. Each strand is 
hydrogen bonded to the other two strands. The a-carbon atom 
of a glycine residue is identified by the letter G. Every third 
residue must be glycine because there is no space in the center 
of the helix. Notice that the pyrrolidone rings are on the outside. 


¥ The importance of the positioning of glycine inside the triple helix 
is illustrated in the disorder osteogenesis imperfecta, also known as 
brittle bone disease. In this condition, which can vary from mild to very severe, 
other amino acids replace the internal glycine residue. This replacement 
leads to a delayed and improper folding of collagen, and the accumulation 
of defective collagen results in cell death. The most serious symptom is 
severe bone fragility. Defective collagen in the eyes causes the whites of the 
eyes to have a blue tint (blue sclera). 


2.4 Tertiary Structure: Water-Soluble Proteins Fold into 
Compact Structures with Nonpolar Cores 


Let us now examine how amino acids are grouped together in a complete 
protein. X-ray crystallographic and nuclear magnetic resonance (NMR) 
studies (Section 3.6) have revealed the detailed three-dimensional struc- 
tures of thousands of proteins. We begin here with an examination of 
myoglobin, the first protein to be seen in atomic detail. 

Myoglobin, the oxygen carrier in muscle, is a single polypeptide chain of 
153 amino acids (see Chapter 7). The capacity of myoglobin to bind oxygen 
depends on the presence of heme, a nonpolypeptide prosthetic (helper) group 
consisting of protoporphyrin IX and a central iron atom. Myoglobin is an 
extremely compact molecule. Its overall dimensions are 45 X 35 X 25 A, an 
order of magnitude less than if it were fully stretched out (Figure 2.43). 
About 70% of the main chain is folded into eight a helices, and much of the 
rest of the chain forms turns and loops between helices. 

The folding of the main chain of myoglobin, like that of most other pro- 
teins, is complex and devoid of symmetry. The overall course of the poly- 
peptide chain of a protein is referred to as its tertiary structure. A unifying 
principle emerges from the distribution of side chains. The striking fact is 
that the interior consists almost entirely of nonpolar residues such as leucine, 
valine, methionine, and phenylalanine (Figure 2.44). Charged residues such 
as aspartate, glutamate, lysine, and arginine are absent from the inside of 
myoglobin. The only polar residues inside are two histidine residues, which 
play critical roles in binding iron and oxygen. The outside of myoglobin, on 
the other hand, consists of both polar and nonpolar residues. The space- 
filling model shows that there is very little empty space inside. 

This contrasting distribution of polar and nonpolar residues reveals a 
key facet of protein architecture. In an aqueous environment, protein fold- 
ing is driven by the strong tendency of hydrophobic residues to be excluded 
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hk Figure 2.44 Distribution of amino 
acids in myoglobin. (A) A space-filling 
model of myoglobin with hydrophobic amino 
acids shown in yellow, charged amino acids 
shown in blue, and others shown in white. 
Notice that the surface of the molecule has 
many charged amino acids, as well as some 
hydrophobic amino acids. (B) In this cross- 
sectional view, notice that mostly hydrophobic 
amino acids are found on the inside of the 
structure, whereas the charged amino acids 
are found on the protein surface. [Drawn 
from 1MBD.pdb.] 
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k Figure 2.43 Three-dimensional structure of myoglobin. (A) A ribbon diagram shows 
that the protein consists largely of « helices. (B) A space-filling model in the same orientation 
shows how tightly packed the folded protein is. Notice that the heme group is nestled into a 
crevice in the compact protein with only an edge exposed. One helix is blue to allow comparison 
of the two structural depictions. [Drawn from 1A6N.pdb_] 


from water. Recall that a system is more thermodynamically stable when 
hydrophobic groups are clustered rather than extended into the aqueous 
surroundings (Chapter 1). The polypeptide chain therefore folds so that its 
hydrophobic side chains are buried and its polar, charged chains are on the 
surface. Many a helices and B strands are amphipathic; that is, the a helix 
or B strand has a hydrophobic face, which points into the protein interior, 
and a more polar face, which points into solution. The fate of the main 
chain accompanying the hydrophobic side chains is important, too. An 
unpaired peptide NH or CO group markedly prefers water to a nonpolar 
milieu. The secret of burying a segment of main chain in a hydrophobic 
environment is to pair all the NH and CO groups by hydrogen bonding. 
This pairing is neatly accomplished in an a helix or B sheet. Van der Waals 
interactions between tightly packed hydrocarbon side chains also contrib- 
ute to the stability of proteins. We can now understand why the set of 
20 amino acids contains several that differ subtly in size and shape. They 
provide a palette from which to choose to fill the interior of a protein 
neatly and thereby maximize van der Waals interactions, which require 
intimate contact. 


Water-filled Largely hydrophobic 
hydrophilic channel exterior 


Figure 2.45 “Inside out” amino acid distribution in porin. The outside of porin (which 
contacts hydrophobic groups in membranes) is covered largely with hydrophobic residues, 
whereas the center includes a water-filled channel lined with charged and polar amino acids. 
[Drawn from 1PRN.pdb.] 


Some proteins that span biological membranes are “the exceptions that 
prove the rule” because they have the reverse distribution of hydrophobic 
and hydrophilic amino acids. For example, consider porins, proteins found 
in the outer membranes of many bacteria (Figure 2.45). Membranes are 
built largely of hydrophobic alkane chains (Section 12.2). Thus, porins are 
covered on the outside largely with hydrophobic residues that interact with 
the neighboring alkane chains. In contrast, the center of the protein contains 
many charged and polar amino acids that surround a water-filled channel 
running through the middle of the protein. Thus, because porins function 
in hydrophobic environments, they are “inside out” relative to proteins that 
function in aqueous solution. 

Certain combinations of secondary structure are present in many 
proteins and frequently exhibit similar functions. These combinations 
are called motifs or supersecondary structures. For example, an a helix 
separated from another a helix by a turn, called a helix-turn-helix unit, 
is found in many proteins that bind DNA (Figure 2.46). 

Some polypeptide chains fold into two or 
more compact regions that may be connected 
by a flexible segment of polypeptide chain, 
rather like pearls on a string. These compact 
globular units, called domains, range in size 
from about 30 to 400 amino acid residues. For 
example, the extracellular part of CD4, the 
cell-surface protein on certain cells of the 
immune system to which the human immuno- 
deficiency virus (HIV) attaches itself, com- 
prises four similar domains of approximately 
100 amino acids each (Figure 2.47). Proteins 
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2.4 Tertiary Structure 


h Figure 2.46 The helix-turn-helix 
motif, a supersecondary structural 
element. Helix-turn-helix motifs are found in 
many DNA-binding proteins. [Drawn from 
1LMB.pdb.] 


may have domains in Lome ever if their Figure 2.47 Protein domains. The cell-surface protein CD4 consists of four 
overall tertiary structures are different. similar domains. [Drawn from 1WIO.pdb] 
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and Structure 


Four levels of structure are frequently cited in discussions of protein 
architecture. So far, we have considered three of them. Primary structure 
is the amino acid sequence. Secondary structure refers to the spatial 
arrangement of amino acid residues that are nearby in the sequence. 
Some of these arrangements are of a regular kind, giving rise to a peri- 
odic structure. The a helix and B strand are elements of secondary struc- 
ture. Tertiary structure refers to the spatial arrangement of amino acid 
residues that are far apart in the sequence and to the pattern of disulfide 
bonds. We now turn to proteins containing more than one polypeptide 
chain. Such proteins exhibit a fourth level of structural organization. 
Each polypeptide chain in such a protein is called a subunit. Quaternary 
structure refers to the spatial arrangement of subunits and the nature of 
their interactions. The simplest sort of quaternary structure 
is a dimer, consisting of two identical subunits. This organi- 
zation is present in the DNA-binding protein Cro found in 
a bacterial virus called \ (Figure 2.48). More-complicated 
quaternary structures also are common. More than one type 
of subunit can be present, often in variable numbers. For 
example, human hemoglobin, the oxygen-carrying protein 
in blood, consists of two subunits of one type (designated a) 
and two subunits of another type (designated §), as illus- 
trated in Figure 2.49. Thus, the hemoglobin molecule exists 
as an Q2B2 tetramer. Subtle changes in the arrangement of 
k Figure 2.48 Quaternary structure. The Cro protein of subunits within the hemoglobin molecule allow it to carry 
bacteriophage 2 is a dimer of identical subunits. [Drawn from oxygen from the lungs to tissues with great efficiency 
5CRO.pdb] (Chapter 7). 

Viruses make the most of a limited amount of genetic 
information by forming coats that use the same kind of 
subunit repetitively in a symmetric array. The coat of rhinovirus, the virus 
that causes the common cold, includes 60 copies of each of four subunits 
(Figure 2.50). The subunits come together to form a nearly spherical shell 

that encloses the viral genome. 


h Figure 2.49 The a2f2 tetramer of human hemoglobin. The structure of the two identical Figure 2.50 Complex quaternary 
a subunits (red) is similar to but not identical with that of the two identical 8 subunits (yellow). The structure. The coat of human rhinovirus, the 
molecule contains four heme groups (gray with the iron atom shown in purple). (A) The ribbon cause of the common cold, comprises 60 
diagram highlights the similarity of the subunits and shows that they are composed mainly of copies of each of four subunits. The three 
a helices. (B) The space-filling model illustrates how the heme groups occupy crevices in the most prominent subunits are shown as 


protein. [Drawn from 1A3N.pdb_] different colors. 


2.6 The Amino Acid Sequence of a Protein Determines 
Its Three-Dimensional Structure 


How is the elaborate three-dimensional structure of proteins attained? The 
classic work of Christian Anfinsen in the 1950s on the enzyme ribonuclease 
revealed the relation between the amino acid sequence of a protein and its 
conformation. Ribonuclease is a single polypeptide chain consisting of 
124 amino acid residues cross-linked by four disulfide bonds (Figure 2.51). 
Anfinsen’s plan was to destroy the three-dimensional structure of the 
enzyme and to then determine what conditions were required to restore the 
structure. 

Agents such as urea or guanidinium chloride effectively disrupt a 


protein’s noncovalent bonds. Although the mechanism of action of these Figure 2.51 Amino acid sequence of 
agents is not fully understood, computer simulations suggest that they bovine ribonuclease. The four disulfide 
replace water as the molecule solvating the protein and are then able to dis- bonds are shown in color. [After C. H. W. Hirs, 


S. Moore, and W. H. Stein, J. Biol. Chem. 


rupt the van der Waals interactions stabilizing the protein structure. The Sas 655 64770 


disulfide bonds can be cleaved reversibly by reducing them with a reagent 
such as B-mercaptoethanol (Figure 2.52). In the presence of a large excess 


; : : ; NH 
of B-mercaptoethanol, the disulfides (cystines) are fully converted into if i a 
sulfhydryls (cysteines). C pe cr 
fa ae ) H,N~ NH HoN“ NH 
Urea Guanidinium chloride 
Excess Ho 
H 
H™ a a ok atl He 


B-Mercaptoethanol 


Figure 2.52 Role of §-mercaptoethanol in reducing disulfide bonds. Note that, as the 
disulfides are reduced, the B-mercaptoethanol is oxidized and forms dimers. 


Most polypeptide chains devoid of cross-links assume a random-coil 
conformation in 8 M urea or 6 M guanidinium chloride. When ribonuclease 
was treated with B-mercaptoethanol in 8 M urea, the product was a fully 
reduced, randomly coiled polypeptide chain devoid of enzymatic activity. 
When a protein is converted into a randomly coiled peptide without its 
normal activity, it is said to be denatured (Figure 2.53). 

Anfinsen then made the critical observation that the denatured ribonu- 
clease, freed of urea and B-mercaptoethanol by dialysis, slowly regained 


72 
8 M urea and 
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Bache tat aad 
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Figure 2.53 Reduction and denaturation 
Native ribonuclease Denatured reduced ribonuclease of ribonuclease. 
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Scrambled ribonuclease 


Trace of 
B-mercaptoethanol 


Native ribonuclease 


Figure 2.54 Reestablishing correct 
disulfide pairing. Native ribonuclease can 
be re-formed from scrambled ribonuclease in 
the presence of a trace of 8-mercaptoethanol. 


enzymatic activity. He immediately perceived the significance of this 
chance finding: the sulfhydryl groups of the denatured enzyme became 
oxidized by air, and the enzyme spontaneously refolded into a catalytically 
active form. Detailed studies then showed that nearly all the original enzy- 
matic activity was regained if the sulfhydryl groups were oxidized under 
suitable conditions. All the measured physical and chemical properties of 
the refolded enzyme were virtually identical with those of the native 
enzyme. These experiments showed that the information needed to specify the 
catalytically active structure of ribonuclease is contained in its amino acid 
sequence. Subsequent studies have established the generality of this central 
principle of biochemistry: sequence specifies conformation. The dependence 
of conformation on sequence is especially significant because of the intimate 
connection between conformation and function. 

A quite different result was obtained when reduced ribonuclease was 
reoxidized while it was still in 8 M urea and the preparation was then 
dialyzed to remove the urea. Ribonuclease reoxidized in this way had only 
1% of the enzymatic activity of the native protein. Why were the outcomes 
so different when reduced ribonuclease was reoxidized in the presence and 
absence of urea? The reason is that the wrong disulfides formed pairs in 
urea. There are 105 different ways of pairing eight cysteine molecules to 
form four disulfides; only one of these combinations is enzymatically active. 
The 104 wrong pairings have been picturesquely termed “scrambled” ribo- 
nuclease. Anfinsen found that scrambled ribonuclease spontaneously 
converted into fully active, native ribonuclease when trace amounts of 
B-mercaptoethanol were added to an aqueous solution of the protein 
(Figure 2.54). The added B-mercaptoethanol catalyzed the rearrangement 
of disulfide pairings until the native structure was regained in about 
10 hours. This process was driven by the decrease in free energy as the 
scrambled conformations were converted into the stable, native conformation of 
the enzyme. The native disulfide pairings of ribonuclease thus contribute to 
the stabilization of the thermodynamically preferred structure. 

Similar refolding experiments have been performed on many other pro- 
teins. In many cases, the native structure can be generated under suitable 
conditions. For other proteins, however, refolding does not proceed effi- 
ciently. In these cases, the unfolding protein molecules usually become 
tangled up with one another to form aggregates. Inside cells, proteins called 
chaperones block such illicit interactions. Additionally, it is now evident that 
some proteins do not assume a defined structure until they interact with 
molecular partners, as we will see shortly. 


Amino acids have different propensities for forming 
alpha helices, beta sheets, and beta turns 


How does the amino acid sequence of a protein specify its three-dimensional 
structure? How does an unfolded polypeptide chain acquire the form of 
the native protein? These fundamental questions in biochemistry can be 
approached by first asking a simpler one: What determines whether a par- 
ticular sequence in a protein forms an a helix, a B strand, or a turn? One 
source of insight is to examine the frequency of occurrence of particular 
amino acid residues in these secondary structures (Table 2.3). Residues 
such as alanine, glutamate, and leucine tend to be present in a helices, 
whereas valine and isoleucine tend to be present in B strands. Glycine, 
asparagine, and proline have a propensity for being present in turns. 
Studies of proteins and synthetic peptides have revealed some reasons 
for these preferences. The a helix can be regarded as the default conforma- 
tion. Branching at the B-carbon atom, as in valine, threonine, and isoleu- 


Table 2.3 Relative frequencies of amino acid residues in secondary structures 


Amino acid « helix B sheet Reverse turn 
Glu 1.59 0.52 1.01 
Ala 1.41 0.72 0.82 
Leu 1.34 1.22 0.57 
Met 130) 1.14 0.52 
Gln 1.27 0.98 0.84 
Lys 1E28 0.69 1.07 
Arg 1.21 0.84 0.90 
His 1.05 0.80 0.81 
Val 0.90 1.87 0.41 
lle 1.09 1.67 0.47 
Tyr 0.74 1.45 0.76 
Cys 0.66 1.40 0.54 
Trp 1.02 1.35 0.65 
Phe 1.16 133) 0.59 
Thr 0.76 HAZ 0.96 
Gly 0.43 0.58 Ia 
Asn 0.76 0.48 1.34 
Pro 0.34 0.31 32) 
Ser 0.57 0.96 1.22 
Asp 0.99 0.39 1.24 


Note: The amino acids are grouped according to their preference for a helices (top group), B sheets (middle 
group), or turns (bottom group). 

Source: T. E. Creighton, Proteins: Structures and Molecular Properties, 2d ed. (W. H. Freeman and Company, 
1992), p. 256. 


cine, tends to destabilize a helices because of steric clashes. These residues 
are readily accommodated in £ strands, in which their side chains project 
out of the plane containing the main chain. Serine, aspartate, and asparagine 
tend to disrupt a helices because their side chains contain hydrogen-bond 
donors or acceptors in close proximity to the main chain, where they com- 
pete for main-chain NH and CO groups. Proline tends to disrupt both 
a helices and B strands because it lacks an NH group and because its ring 
structure restricts its @ value to near 60 degrees. Glycine readily fits into all 
structures and for that reason does not favor helix formation in particular. 
Can we predict the secondary structure of a protein by using this 
knowledge of the conformational preferences of amino acid 
residues? Accurate predictions of secondary structure 
adopted by even a short stretch of residues have proved to 
be difficult. What stands in the way of more-accurate pre- 
diction? Note that the conformational preferences of amino 
acid residues are not tipped all the way to one structure (see 
Table 2.3). For example, glutamate, one of the strongest 
helix formers, prefers a helix to B strand by only a factor of 
two. The preference ratios of most other residues are smaller. 
Indeed, some penta- and hexapeptide sequences have 
been found to adopt one structure in one protein and an 
entirely different structure in another (Figure 2.55). 
Hence, some amino acid sequences do not uniquely 
determine secondary structure. Tertiary interactions— 
interactions between residues that are far apart in the 
sequence—may be decisive in specifying the secondary 
structure of some segments. The context is often crucial in 
determining the conformational outcome. The conforma- 
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2.6 Sequence and Structure 


Figure 2.55 Alternative conformations of a peptide 
sequence. Many sequences can adopt alternative conformations in 
: : - : ; different proteins. Here the sequence VDLLKN shown in red 
tion of a protein evolved to work in a particular environ- assumes an a helix in one protein context (left) and a B strand in 


ment or context. Substantial improvements in secondary another (right). [Drawn from (left) 3WRP.pdb and (right) 2HLA.pdb] 
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Figure 2.56 Transition from folded to 
unfolded state. Most proteins show a sharp 
transition from the folded to the unfolded 
form on treatment with increasing 
concentrations of denaturants. 
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Figure 2.57 Components of a partly denatured protein solution. 


structure prediction can be achieved by using families of related sequences, 
each of which adopts the same structure. 


Protein folding is a highly cooperative process 


Proteins can be denatured by any treatment that disrupts the weak bonds 
stabilizing tertiary structure, such as heating, or by chemical denaturants 
such as urea or guanidinium chloride. For many proteins, a comparison of 
the degree of unfolding as the concentration of denaturant increases reveals 
a sharp transition from the folded, or native, form to the unfolded, or dena- 
tured form, suggesting that only these two conformational states are present 
to any significant extent (Figure 2.56). A similar sharp transition 1s observed 
if denaturants are removed from unfolded proteins, allowing the proteins to 
fold. 

The sharp transition seen in Figure 2.56 suggests that protein folding 
and unfolding is an “all or none” process that results from a cooperative tran- 
sition. For example, suppose that a protein is placed in conditions under 
which some part of the protein structure is thermodynamically unstable. 
As this part of the folded structure is disrupted, the interactions between it 
and the remainder of the protein will be lost. The loss of these interactions, 
in turn, will destabilize the remainder of the structure. Thus, conditions 
that lead to the disruption of any part of a protein structure are likely to 
unravel the protein completely. The structural properties of proteins pro- 
vide a clear rationale for the cooperative transition. 

The consequences of cooperative folding can be illus- 
trated by considering the contents of a protein solution 
under conditions corresponding to the middle of the 
transition between the folded and the unfolded forms. 
Under these conditions, the protein is “half folded.” Yet 
the solution will appear to have no partly folded mole- 
cules but, instead, look as if it is a 50/50 mixture of fully 
folded and fully unfolded molecules (Figure 2.57). 
Although the protein may appear to behave as if it exists 
in only two states, this simple two-state existence is an 
impossibility at a molecular level. Even simple reactions 
go through reaction intermediates, and so a complex mol- 
ecule such as a protein cannot simply switch from a com- 
pletely unfolded state to the native state in one step. 
Unstable, transient intermediate structures must exist 
between the native and denatured state (p. 53). Deter- 


in a half-unfolded protein solution, half the molecules are fully folded mining the nature of these intermediate structures is an 


and half are fully unfolded. 


intense area of biochemical research. 


Proteins fold by progressive stabilization of intermediates rather than 
by random search 


How does a protein make the transition from an unfolded structure to a 
unique conformation in the native form? One possibility a priori would be 
that all possible conformations are tried out to find the energetically most 
favorable one. How long would such a random search take? Consider a 
small protein with 100 residues. Cyrus Levinthal calculated that, if each 
residue can assume three different conformations, the total number of struc- 
tures would be ae which is equal to 5 X 1047. If it takes 10° '? s to convert 
one structure into another, the total search time would be 5 X 10°’ x 107! S, 
which is equal to 5 X 10** s, or 1.6 X 1077 years. Clearly, it would take much 
too long for even a small protein to fold properly by randomly trying out all 
possible conformations. The enormous difference between calculated and 


actual folding times is called Levinthal’s paradox. This paradox clearly 
reveals that proteins do not fold by trying every possible conformation; 
instead, they must follow at least a partly defined folding pathway consist- 
ing of intermediates between the fully denatured protein and its native 
structure. 

The way out of this paradox is to recognize the power of cumulative selec- 
tion. Richard Dawkins, in The Blind Watchmaker, asked how long it would 
take a monkey poking randomly at a typewriter to reproduce Hamlet’s 
remark to Polonius, “Methinks it is like a weasel” (Figure 2.58). An astro- 
nomically large number of keystrokes, of the order of 10*°, would be 
required. However, suppose that we preserved each correct character and 
allowed the monkey to retype only the wrong ones. In this case, only a few 
thousand keystrokes, on average, would be needed. The crucial difference 
between these cases is that the first employs a completely random search, 
whereas, in the second, partly correct intermediates are retained. 

The essence of protein folding is the tendency to retain partly correct inter- 
mediates. However, the protein-folding problem is much more difficult than 
the one presented to our simian Shakespeare. First, the criterion of correct- 
ness is not a residue-by-residue scrutiny of conformation by an omniscient 
observer but rather the total free energy of the transient species. Second, 
proteins are only marginally stable. The free-energy difference between 
the folded and the unfolded states of a typical 100-residue protein is 
42 kJ mol! (10 kcal mol‘), and thus each residue contributes on aver- 
age only 0.42 kJ mol‘ (0.1 kcal mol~‘) of energy to maintain the folded 
state. This amount is less than the amount of thermal energy, which is 
2.5 kJ mol | (0.6 kcal mol!) at room temperature. This meager stabiliza- 
tion energy means that correct intermediates, especially those formed early 
in folding, can be lost. The analogy is that the monkey would be somewhat 
free to undo its correct keystrokes. Nonetheless, the interactions that lead to 
cooperative folding can stabilize intermediates as structure builds up. Thus, 
local regions that have significant structural preference, though not neces- 
sarily stable on their own, will tend to adopt their favored structures and, as 
they form, can interact with one other, leading to increasing stabilization. 
This conceptual framework is often referred to as the nucleation-condensation 
model. 

A simulation of the folding of a protein, based on the nucleation- 
condensation model, is shown in Figure 2.59. This model suggests that 
certain pathways may be preferred. Although Figure 2.59 suggests a dis- 
crete pathway, each of the intermediates shown represents an ensemble of 
similar structures, and thus a protein follows a general rather than a precise 
pathway in its transition from the unfolded to the native state. The energy 


(1) (2) (3) (4) (5) 
Denatured Native 


Figure 2.59 Proposed folding pathway of chymotrypsin inhibitor. Local regions with 
sufficient structural preference tend to adopt their favored structures initially (1). These structures 
come together to form a nucleus with a nativelike, but still mobile, structure (4). This structure then 
fully condenses to form the native, more rigid structure (5). [From A. R. Fersht and V. Daggett. Cell 
108:573—582, 2002; with permission from Elsevier.] 
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Figure 2.58 Typing-monkey analogy. A 
monkey randomly poking a typewriter could 
write a line from Shakespeare's Hamlet, 
provided that correct keystrokes were retained. 
In the two computer simulations shown, the 
cumulative number of keystrokes is given at the 
left of each line. 
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Beginning of helix formation and collapse surface for the overall process of protein folding can be visual- 


+4 


ized as a funnel (Figure 2.60). The wide rim of the funnel repre- 
sents the wide range of structures accessible to the ensemble of 
denatured protein molecules. As the free energy of the popula- 
tion of protein molecules decreases, the proteins move down 
into narrower parts of the funnel and fewer conformations are 
accessible. At the bottom of the funnel is the folded state with 
its well-defined conformation. Many paths can lead to this same 
energy minimum. 


Prediction of three-dimensional structure from sequence 


Percentage of remains a great challenge 
residues of 


rae protein in native = The prediction of three-dimensional structure from sequence 
conformation ange 
has proved to be extremely difficult. As we have seen, the local 
sequence appears to determine only between 60 and 70% of the 
secondary structure; long-range interactions are required to fix 
the full secondary structure and the tertiary structure. 
Investigators are exploring two fundamentally different 
Discrete folding approaches to predicting three-dimensional structure from 
1 tee: 100 amino acid sequence. The first is ab initio (Latin, “from the 
Mative.SuUeCINE beginning’) prediction, which attempts to predict the folding of 
Figure 2.60 Folding funnel. The folding funnel depicts the an amino acid sequence without prior knowledge about similar 
thermodynamics of protein folding. The top of the funnel sequences in known protein structures. Computer-based calcu- 
represents all possible denatured conformations—that is, lations are employed that attempt to minimize the free energy of 


maximal conformational entropy. Depressions on the sides of 
the funnel represent semistable intermediates that can facilitate 
or hinder the formation of the native structure, depending on 


a structure with a given amino acid sequence or to simulate the 
folding process. The utility of these methods is limited by the 


their depth. Secondary structures, such as helices, form and vast number of possible conformations, the marginal stability of 
collapse onto one another to initiate folding. [After D. L. Nelson proteins, and the subtle energetics of weak interactions in aque- 
and M. M. Cox, Lehninger Principles of Biochemistry, 5th ed. ous solution. The second approach takes advantage of our grow- 


(W. H. Freeman and Company, 2008), p. 143.] 
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ing knowledge of the three-dimensional structures of many 

proteins. In these knowledge-based methods, an amino acid 

sequence of unknown structure is examined for compatibility 

with known protein structures or fragments therefrom. Ifa sig- 
nificant match is detected, the known structure can be used as an initial 
model. Knowledge-based methods have been a source of many insights into 
the three-dimensional conformation of proteins of known sequence but 
unknown structure. 


Some proteins are inherently unstructured and can exist 
in multiple conformations 


The discussion of protein folding thus far is based on the paradigm that a given 
protein amino acid sequence will fold into a particular three-dimensional 
structure. This paradigm holds well for many proteins, such as enzymes and 
transport proteins. However, it has been known for some time that some 
proteins can adopt two different structures, one of which results in protein 
aggregation and pathological conditions (p. 55). Such alternate structures 
originating from a unique amino acid sequence were thought to be rare, the 
exception to the paradigm. Recent work has called into question the univer- 
sality of the idea that each amino acid sequence gives rise to one structure 
for certain proteins, even under normal cellular conditions. 

Our first example is a class of proteins referred to as intrinsically unstruc- 
tured proteins (IUPs). As the name suggests, these proteins, completely or in 
part, do not have a discrete three-dimensional structure under physiological 
conditions. Indeed, an estimated 50% of eukaryotic proteins have at least 
one unstructured region greater than 30 amino acids in length. Unstructured 


N 
Chemokine Glycosaminoglycan-binding 
structure structure 


Figure 2.61 Lymphotactin exists in two conformations, which are in equilibrium. [R. L. 
Tuinstra, F. C. Peterson, S. Kutlesa, E. S. Elgin, M. A. Kron, and B. F. Volkman. Proc. Natl. Sci. U.S.A. 
105:5057-5062, 2008, Fig. 2A] 


regions are rich in charged and polar amino acids with few hydrophobic 
residues. These proteins assume a defined structure on interaction with 
other proteins. This molecular versatility means that one protein can assume 
different structures and interact with the different partners, yielding differ- 
ent biochemical functions. [UPs appear to be especially important in signal- 
ing and regulatory pathways. 

Another class of proteins that do not adhere to the paradigm are meta- 
morphic proteins. These proteins appear to exist in an ensemble of structures 
of approximately equal energy that are in equilibrium. Small molecules or 
other proteins may bind to a particular member of the ensemble, resulting 
in acomplex having a biochemical function that differs from that of another 
complex formed by the same metamorphic protein bound to a different 
partner. An especially clear example of a metamorphic protein is the 
cytokine lymphotactin. Cytokines are signal molecules in the immune sys- 
tem that bind to receptor proteins on the surface of immune-system cells, 
instigating an immunological response. Lymphotactin exists in two very 
different structures that are in equilibrium (Figure 2.61). One structure is a 
characteristic of chemokines, consisting of a three-stranded B sheet and a 
carboxyl-terminal helix. This structure binds to its receptor and activates it. 
The alternative structure is an identical dimer of all 8 sheets. When in this 
structure, lymphotactin binds to glycosaminglycan, a complex carbohy- 
drate (Chapter 11). The biochemical activities of each structure are mutu- 
ally exclusive: the cytokine structure cannot bind the glycosaminoglycan, 
and the B-sheet structure cannot activate the receptor. Yet, remarkably, both 
activities are required for full biochemical activity of the cytokine. 

Note that [UPs and metamorphic proteins effectively expand the pro- 
tein encoding capacity of the genome. In some cases, a gene can encode 
a single protein that has more than one structure and function. These 
examples also illustrate the dynamic nature of the study of biochemistry 
and its inherent excitement: even well-established ideas are often subject 
to modifications. 


Protein misfolding and aggregation are associated with some 
neurological diseases 


e Understanding protein folding and misfolding is of more than academic 
interest. A host of diseases, including Alzheimer disease, Parkinson 
disease, Huntington disease, and transmissible spongiform encephalopathies 
(prion disease), are associated with improperly folded proteins. All of these 
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Figure 2.62 A model of the human prion protein amyloid. A detailed 


diseases result in the deposition of protein aggregates, called amyloid fibrils 
or plaques. These diseases are consequently referred to as amyloidoses. A 
common feature of amyloidoses is that normally soluble proteins are con- 
verted into insoluble fibrils rich in B sheets. The correctly folded protein is 
only marginally more stable than the incorrect form. But the incorrect form 
aggregates, pulling more correct forms into the incorrect form. We will 
focus on the transmissible spongiform encephalopathies. 

One of the great surprises in modern medicine was that certain infec- 
tious neurological diseases were found to be transmitted by agents that were 
similar in size to viruses but consisted only of protein. These diseases 
include bovine spongiform encephalopathy (commonly referred to as mad cow 
disease) and the analogous diseases in other organisms, including Creutzfeld— 
Jacob disease (CJD) in human beings, scrapie in sheep, and chronic wasting 
disease in deer and elk. The agents causing these diseases are termed prions. 
Prions are composed largely or completely of a cellular protein called PrP, 
which is normally present in the brain but its function has not been identi- 
fied. Indeed, mice lacking PrP display normal phenotypes. The infectious 
prions are aggregated forms of the PrP protein termed 
Pree 

How does the structure of the protein in the 
aggregated form differ from that of the protein in its 
normal state in the brain? The structure of the nor- 
mal cellular protein PrP contains extensive regions of 
a helix and relatively little B-strand structure. The 
structure of the form of the protein present in infected 
brains, termed PrP©, has not yet been determined 
because of challenges posed by its insoluble and hetero- 
geneous nature. However, a variety of evidence indi- 
cates that some parts of the protein that had been in 
a-helical or turn conformations have been converted 
into B-strand conformations (Figure 2.62). The B 


model of a human prion amyloid fibril deduced from spin labeling and 
electron paramagnetic resonance (EPR) spectroscopy studies shows that 
protein aggregation is due to the formation of large parallel B sheets. The 


strands of largely planar monomers stack on one anoth- 
er with their side chains tightly interwoven. A side view 
shows the extensive network of hydrogen bonds between 


arrow indicates the long axis of the fibril. [N. J. Cobb, F. D. Sonnichsen, 
H. Mchaourab, and W. K. Surewicz. Proc. Natl. Acad. Sci. U.S.A. 104: 
18946-18951, 2007, Fig. 4E.] 


Figure 2.63 The protein-only model for 
prion-disease transmission. A nucleus 
consisting of proteins in an abnormal 
conformation grows by the addition of 
proteins from the normal pool. 


the monomers. These fibrous protein aggregates are 
often referred to as amyloid forms. 

With the realization that the infectious agent in 
prion diseases is an aggregated form of a protein that 
is already present in the brain, a model for disease transmission emerges 
(Figure 2.63). Protein aggregates built of abnormal forms of PrP act as 
nuclei to which other PrP molecules attach. Prion diseases can thus be 
transferred from one individual organism to another through the transfer of 
an aggregated nucleus, as likely happened in the mad cow disease outbreak 


PrPS© nucleus 
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in the United Kingdom in the 1990s. Cattle fed on animal feed containing 
material from diseased cows developed the disease in turn. 

Amyloid fibers are also seen in the brains of patients with certain nonin- 
fectious neurodegenerative diseases such as Alzheimer and Parkinson dis- 
eases. For example, the brains of patients with Alzheimer disease contain 
protein aggregates called amyloid plaques that consist primarily of a single 
polypeptide termed AB. This polypeptide is derived from a cellular protein 
called amyloid precursor protein (APP) through the action of specific pro- 
teases. Polypeptide AB is prone to form insoluble aggregates. Despite the 
difficulties posed by the protein’s insolubility, a detailed structural model 
for AB has been derived through the use of NMR techniques that can be 
applied to solids rather than to materials in solution. As expected, the struc- 
ture is rich in B strands, which come together to form extended parallel 
B-sheet structures (see Figure 2.63). 

How do such aggregates lead to the death of the cells that harbor them? 
The answer is still controversial. One hypothesis is that the large aggregates 
themselves are not toxic but, instead, smaller aggregates of the same proteins 
may be the culprits, perhaps damaging cell membranes. 


Protein modification and cleavage confer new capabilities 


. 2 Proteins are able to perform numerous functions that rely solely on 

the versatility of their 20 amino acids. In addition, many proteins are 
covalently modified, through the attachment of groups other than amino 
acids, to augment their functions (Figure 2.64). For example, acetyl groups 
are attached to the amino termini of many proteins, a modification that 
makes these proteins more resistant to degradation. As discussed earlier 
(p. 44), the addition of hydroxyl groups to many proline residues stabilizes 
fibers of newly synthesized collagen. The biological significance of this 
modification is evident in the disease scurvy: a deficiency of vitamin C 
results in insufficient hydroxylation of collagen, and the abnormal collagen 
fibers that result are unable to maintain normal tissue strength. Another 
specialized amino acid produced by a finishing touch is y-carboxyglutamate. 
In vitamin K deficiency, insufficient carboxylation of glutamate in pro- 
thrombin, a clotting protein, can lead to hemorrhage (Chapter 10). Many 
proteins, especially those that are present on the surfaces of cells or are 
secreted, acquire carbohydrate units on specific asparagine residues (see 
Chapter 11). The addition of sugars makes the proteins more hydrophilic 
and able to participate in interactions with other proteins. Conversely, the 
addition of a fatty acid to an a-amino group or a cysteine sulfhydryl group 
produces a more hydrophobic protein. 
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Figure 2.64 Finishing touches. Some common and important covalent modifications of amino 
acid side chains are shown. 
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Figure 2.65 Chemical rearrangement in GFP. (A) The structure of green fluorescent 
protein (GFP). The rearrangement and oxidation of the sequence Ser-Tyr-Gly is the source of 
fluorescence. (B) Fluorescence micrograph of a four-cell embryo (cells are outlined) from the 
roundworm Caenorhabditis elegans containing a protein, PIE-1, labeled with GFP. The protein is 
expressed only in the cell (top) that will give rise to the germ line. [(A) Drawn from 1GFL.pdb; 
(B) courtesy of Dr. Geraldine Seydoux.] 


Many hormones, such as epinephrine (adrenaline), alter the activities of 
enzymes by stimulating the phosphorylation of the hydroxyl amino acids 
serine and threonine; phosphoserine and phosphothreonine are the most ubiqui- 
tous modified amino acids in proteins. Growth factors such as insulin act by 
triggering the phosphorylation of the hydroxyl group of tyrosine residues to 
form phosphotyrosine. The phosphoryl groups on these three modified amino 
acids are readily removed; thus the modified amino acids are able to act as 
reversible switches in regulating cellular processes. The roles of phosphoryla- 
tion in signal transduction will be discussed extensively in Chapter 14. 

The preceding modifications consist of the addition of special groups to 
amino acids. Other special groups are generated by chemical rearrange- 
ments of side chains and, sometimes, the peptide backbone. For example, 
certain jellyfish produce a green fluorescent protein (Figure 2.65). The 
source of the fluorescence is a group formed by the spontaneous rearrange- 
ment and oxidation of the sequence Ser-Tyr-Gly within the center of the 
protein. This protein is of great utility to researchers as a marker within cells. 

Finally, many proteins are cleaved and trimmed after synthesis. For 
example, digestive enzymes are synthesized as inactive precursors that can 
be stored safely in the pancreas. After release into the intestine, these pre- 
cursors become activated by peptide-bond cleavage (Section 10.4). In blood 
clotting, peptide-bond cleavage converts soluble fibrinogen into insoluble 
fibrin. A number of polypeptide hormones, such as adrenocorticotropic 
hormone, arise from the splitting of a single large precursor protein. 
Likewise, many viral proteins are produced by the cleavage of large 
polyprotein precursors. We shall encounter many more examples of 


modification and cleavage as essential features of protein formation and 
function. Indeed, these finishing touches account for much of the versatility, 
precision, and elegance of protein action and regulation. 


Summary 


2. 


— 


2.2 


2.3 


Protein structure can be described at four levels. The primary structure 
refers to the amino acid sequence. The secondary structure refers to the 
conformation adopted by local regions of the polypeptide chain. 
Tertiary structure describes the overall folding of the polypeptide chain. 
Finally, quaternary structure refers to the specific association of multi- 
ple polypeptide chains to form multisubunit complexes. 


Proteins Are Built from a Repertoire of 20 Amino Acids 

Proteins are linear polymers of amino acids. Each amino acid consists of 
a central tetrahedral carbon atom linked to an amino group, a carboxy- 
lic acid group, a distinctive side chain, and a hydrogen atom. These 
tetrahedral centers, with the exception of that of glycine, are chiral; only 
the L isomer exists in natural proteins. All natural proteins are con- 
structed from the same set of 20 amino acids. The side chains of these 
20 building blocks vary tremendously in size, shape, and the presence 
of functional groups. They can be grouped as follows: (1) hydrophobic 
side chains, including the aliphatic amino acids—glycine, alanine, 
valine, leucine, isoleucine, methionine, and proline—and aromatic side 
chains—phenylalanine, and tryptophan; (2) polar side chains, includ- 
ing hydroxyl-containing side chains—serine, threonine and tyrosine; 
the sulfhydryl-containing cysteine; and carboxamide-containing side 
chains—asparagine and glutamine; (3) basic side chains—lysine, argin- 
ine, and histidine; and (4) acidic side chains—aspartic acid and glu- 
tamic acid. These groupings are somewhat arbitrary and many other 
sensible groupings are possible. 


Primary Structure: Amino Acids Are Linked by Peptide Bonds to Form 
Polypeptide Chains 

The amino acids in a polypeptide are linked by amide bonds formed 
between the carboxyl group of one amino acid and the amino group of 
the next. This linkage, called a peptide bond, has several important 
properties. First, it is resistant to hydrolysis, and so proteins are remark- 
ably stable kinetically. Second, the peptide group is planar because the 
C—N bond has considerable double-bond character. Third, each pep- 
tide bond has both a hydrogen-bond donor (the NH group) and a 
hydrogen-bond acceptor (the CO group). Hydrogen bonding between 
these backbone groups is a distinctive feature of protein structure. 
Finally, the peptide bond is uncharged, which allows proteins to form 
tightly packed globular structures having significant amounts of the 
backbone buried within the protein interior. Because they are linear 
polymers, proteins can be described as sequences of amino acids. Such 
sequences are written from the amino to the carboxyl terminus. 


Secondary Structure: Polypeptide Chains Can Fold into Regular 
Structures Such As the Alpha Helix, the Beta Sheet, and Turns 

and Loops 

Two major elements of secondary structure are the a helix and the 
B strand. In the a helix, the polypeptide chain twists into a tightly 
packed rod. Within the helix, the CO group of each amino acid is 
hydrogen bonded to the NH group of the amino acid four residues far- 
ther along the polypeptide chain. In the B strand, the polypeptide chain 
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2.4 Tertiary Structure: Water-Soluble Proteins Fold into Compact Structures 
with Nonpolar Cores 
The compact, asymmetric structure that individual polypeptides attain 
is called tertiary structure. The tertiary structures of water-soluble pro- 
teins have features in common: (1) an interior formed of amino acids 
with hydrophobic side chains and (2) a surface formed largely of hydro- 
philic amino acids that interact with the aqueous environment. The 
hydrophobic interactions between the interior residues are the driving 
force for the formation of the tertiary structure of water-soluble pro- 
teins. Some proteins that exist in a hydrophobic environment, such as 
in membranes, display the inverse distribution of hydrophobic and 
hydrophilic amino acids. In these proteins, the hydrophobic amino 
acids are on the surface to interact with the environment, whereas the 
hydrophilic groups are shielded from the environment in the interior of 
the protein. 


2.5 Quaternary Structure: Polypeptide Chains Can Assemble into 
Multisubunit Structures 
Proteins consisting of more than one polypeptide chain display quater- 
nary structure; each individual polypeptide chain is called a subunit. 
Quaternary structure can be as simple as two identical subunits or as 
complex as dozens of different subunits. In most cases, the subunits are 
held together by noncovalent bonds. 


2.6 The Amino Acid Sequence of a Protein Determines Its 

Three-Dimensional Structure 

The amino acid sequence determines the three-dimensional structure 
and, hence, all other properties of a protein. Some proteins can be 
unfolded completely yet refold efficiently when placed under condi- 
tions in which the folded form of the protein is stable. The amino acid 
sequence of a protein is determined by the sequences of bases ina DNA 
molecule. This one-dimensional sequence information is extended into 
the three-dimensional world by the ability of proteins to fold spontane- 
ously. Protein folding is a highly cooperative process; structural inter- 
mediates between the unfolded and folded forms do not accumulate. 

Some proteins, such as intrinsically unstructured proteins and 
metamorphic proteins, do not strictly adhere to the one-sequence—one- 
structure paradigm. Because of this versatility, these proteins expand 
the protein encoding capacity of the genome. 

The versatility of proteins is further enhanced by covalent modifica- 
tions. Such modifications can incorporate functional groups not present 
in the 20 amino acids. Other modifications are important to the regula- 
tion of protein activity. Through their structural stability, diversity, and 
chemical reactivity, proteins make possible most of the key processes 
associated with life. 


APPENDK: Visualizing Molecular Structures II: Proteins 


Scientists have developed powerful techniques for the experiment include the x, y, and z coordinates for each 
determination of protein structures, as will be consid- atom in the structure. These coordinate files are compiled 
ered in Chapter 3. In most cases, these techniques allow in the Protein Data Bank (http://www.pdb.org) from 
the positions of the thousands of atoms within a protein which they can be readily downloaded. These struc- 


structure to be determined. The final results from such an tures comprise thousands or even tens of thousands 


of atoms. The complexity of proteins with thou- 
sands of atoms presents a challenge for the depiction of 
their structure. Several different types of representa- 
tions are used to portray proteins, each with its 
own strengths and weaknesses. The types that you 
will see most often in this book are space-filling mod- 
els, ball-and-stick models, backbone models, and rib- 
bon diagrams. Where appropriate, structural features 
of particular importance or relevance are noted in an 
illustration’s legend. 


Space-Filling Models 

Space-filling models are the most realistic type of 
representation. Each atom is shown asa sphere with a 
size corresponding to the van der Waals radius of the 
atom (Section 1.3). Bonds are not shown explicitly 
but are represented by the intersection of the spheres 
shown when atoms are closer together than the sum of 
their van der Waals radii. All atoms are shown, 
including those that make up the backbone and those 
in the side chains. A space-filling model of lysozyme 
is depicted in Figure 2.66. 

Space-filling models convey a sense of how little 
open space there is in a protein’s structure, which 
always has many atoms in van der Waals contact with 
one another. These models are particularly useful in 
showing conformational changes in a protein from 
one set of circumstances to another. A disadvantage of 
space-filling models is that the secondary and tertiary 
structures of the protein are difficult to see. Thus, these 
models are not very effective in distinguishing one pro- 
tein from another—many space-filling models of pro- 
teins look very much alike. 


Ball-and-Stick Models 

Ball-and-stick models are not as realistic as space-filling 
models. Realistically portrayed atoms occupy more 
space, determined by their van der Waals radii, than do 
the atoms depicted in ball-and-stick models. However, 
the bonding arrangement is easier to see because the 
bonds are explicitly represented as sticks (Figure 2.67). 
A ball-and-stick model reveals a complex structure 
more clearly than a space-filling model does. However, 
the depiction is so complicated that structural features 
such as a helices or potential binding sites are difficult 
to discern. 

Because space-filling and ball-and-stick models 
depict protein structures at the atomic level, the large 
number of atoms in a complex structure makes it dif- 
ficult to discern the relevant structural features. Thus, 
representations that are more schematic—such as 
backbone models and ribbon diagrams—have been 
developed for the depiction of macromolecular struc- 
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Figure 2.66 Space-filling model of lysozyme. Notice how tightly 
packed the atoms are, with little unfilled space. All atoms are shown with 
the exception of hydrogen atoms. Hydrogen atoms are often omitted 
because their positions are not readily determined by x-ray 
crystallographic methods and because their omission somewhat 
improves the clarity of the structure's depiction. 


Figure 2.67 Ball-and-stick model of lysozyme. Again, hydrogen 
atoms are omitted. 


tures. In these representations, most or all atoms are 
not shown explicitly. 


Backbone Models 

Backbone models show only the backbone atoms of a 
molecule’s polypeptide or even only the a-carbon atom 
of each amino acid. Atoms are linked by lines repre- 
senting bonds; if only a-carbon atoms are depicted, 
lines connect a-carbon atoms of amino acids that are 
adjacent in the amino acid sequence (Figure 2.68). In 
this book, backbone models show only the lines 
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Figure 2.68 Backbone model of lysozyme. 


connecting the a-carbon atoms; other carbon atoms are 
not depicted. 

A backbone model shows the overall course of the 
polypeptide chain much better than a space-filling or 
ball-and-stick model does. However, secondary struc- 
tural elements are still difficult to see. 


Ribbon Diagrams 
Ribbon diagrams are highly schematic and most com- 
monly used to accent a few dramatic aspects of protein 


B strand 


Figure 2.69 Ribbon diagram of lysozyme. The « helices are shown 
as coiled ribbons; B strands are depicted as arrows. More irregular 
structures are shown as thin tubes. 


structure, such as the a helix (depicted as a coiled rib- 
bon or a cylinder), the B strand (a broad arrow), and 
loops (thin tubes), to provide clear views of the folding 
patterns of proteins (Figure 2.69). The ribbon diagram 
allows the course of a polypeptide chain to be traced 
and readily shows the secondary structural elements. 
Thus, ribbon diagrams of proteins that are related to 
one another by evolutionary divergence appear similar 
(see Figure 6.14), whereas unrelated proteins are clearly 
distinct. 

In this book, coiled ribbons will be generally used to 
depict a helices. However, for membrane proteins, 
which are often quite complex, cylinders will be used 
rather than coiled ribbons. This convention will also- 
make membrane proteins with their membrane-spanning 
a helices easy to recognize (see Figure 12.18). 

Bear in mind that the open appearance of ribbon 
diagrams is deceptive. As noted earlier, protein struc- 
tures are tightly packed and have little open space. The 
openness of ribbon diagrams makes them particularly 
useful as frameworks in which to highlight additional 
aspects of protein structure. Active sites, substrates, 
bonds, and other structural fragments can be included 
in ball-and-stick or space-filling form within a ribbon 
diagram (Figure 2.70). 


Disulfide bonds 


Active-site 
aspartate residue 


Disulfide bonds 


Figure 2.70 Ribbon diagram of lysozyme with highlights. Four 
disulfide bonds and a functionally important aspartate residue are shown 
in ball-and-stick form. 


Key Terms 


side chain (R group) (p. 27) 
torsion angle (p. 37) 
phi (#) angle (p. 37) 
psi (ys) angle (p. 37) 


L amino acid (p. 27) 

dipolar ion (zwitterion) (p. 27) 
peptide bond (amide bond) (p. 33) 
disulfide bond (p. 35) 


primary structure (p. 35) 


Ramachandran diagram (p. 37) 


secondary structure (p. 38) 
a helix (p. 38) 

rise (translation) (p. 39) 

B pleated sheet (p. 40) 

B strand (p. 40) 


reverse turn (8 turn; hairpin 
turn) (p. 42) 

coiled coil (p.43) 

heptad repeat (p. 43) 

tertiary structure (p. 45) 


(p. 47) 
domain (p. 47) 
subunit (p. 48) 


Problems 


1. Identify. Examine the following four amino acids (A—D): 


coo CcOO™ oe coo 
*H,N—-CH  “H3sN——CH ee *H;N—CH 
H3C. 7 CH, ida CH, 
oe Z CH CH, 
| ee 
SY HzC CH; CH, 
Hi CH, 
NH;" 
A B C D 


What are their names, three-letter abbreviations, and one- 
letter symbols? 


2. Properties. In reference to the amino acids shown in 
Problem 1, which are associated with the following charac- 
teristics? 


a) Hydrophobic side chain 
b) Basic side chain 


c) Three ionizable groups 


( 
( 
( 
(d) pK, of approximately 10 in proteins 
(e) Modified form of phenylalanine 


3. Match ’em. Match each amino acid in the left-hand col- 
umn with the appropriate side-chain type in the right-hand 
column. 


(a) Leu (1) hydroxyl-containing 
(b) Glu (2) acidic 

(c) Lys (3) basic 

(d) Ser (4) sulfur-containing 
(e) Cys (5) nonpolar aromatic 
(f) Trp (6) nonpolar aliphatic 


4. Solubility. In each of the following pairs of amino acids, 
identify which amino acid would be most soluble in water: 


(a) Ala, Leu; (b) Tyr, Phe; (c) Ser, Ala; (d) Trp, His. 


5. Bonding is good. Which of the following amino acids 
have R groups that have hydrogen-bonding potential? Ala, 
Gly, Ser, Phe, Glu, Tyr, Ie, and Thr. 


motif (supersecondary structure) 


quaternary structure (p. 48) 
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cooperative transition (p. 52) 

intrinsically unstructured protein 
(IUP) (p. 54) 

metamorphic protein (p. 55) 

prion (p. 56) 


6. Name those components. Examine the segment of a pro- 
tein shown here. 


(a) What three amino acids are present? 

(b) Of the three, which is the N-terminal amino acid? 
(c) Identify the peptide bonds. 

(d) Identify the a-carbon atoms. 


7. Who’s charged? Draw the structure of the dipeptide Gly- 
His. What is the charge on the peptide at pH 5.5? pH 7.5? 


8. Alphabet soup. How many different polypeptides of 50 
amino acids in length can be made from the 20 common 
amino acids? 


9. Sweet tooth, but calorie conscious. Aspartame (NutraSweet), 
an artificial sweetener, is a dipeptide composed of Asp-Phe 
in which the carboxyl terminus is modified by the attach- 
ment of a methyl group. Draw the structure of Aspartame 
at pH 7. 


10. Vertebrate proteins? What is meant by the term polypep- 
tide backbone? 


11. Not a sidecar. Define the term side chain in the context 
of amino acid or protein structure. 


12. One from many. Differentiate between amino acid com- 
position and amino acid sequence. 


13. Shape and dimension. (a) Tropomyosin, a 70-kd muscle 
protein, is a two-stranded a-helical coiled coil. Estimate the 
length of the molecule. (b) Suppose that a 40-residue seg- 
ment of a protein folds into a two-stranded antiparallel 8 
structure with a 4-residue hairpin turn. What is the longest 
dimension of this motif? 


14. Contrasting isomers. Poly-L-leucine in an organic sol- 
vent such as dioxane is « helical, whereas poly-L-isoleucine 
is not. Why do these amino acids with the same number and 
kinds of atoms have different helix-forming tendencies? 


15. Active again. A mutation that changes an alanine resi- 
due in the interior of a protein to valine is found to lead toa 
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loss of activity. However, activity is regained when a second 
mutation at a different position changes an isoleucine resi- 
due to glycine. How might this second mutation lead to a 
restoration of activity? 


16. Shuffle test. An enzyme that catalyzes disulfide— 
sulfhydryl exchange reactions, called protein disulfide 
isomerase (PDI), has been isolated. PDI rapidly converts 
inactive scrambled ribonuclease into enzymatically active 
ribonuclease. In contrast, insulin is rapidly inactivated by 
PDI. What does this important observation imply about 
the relation between the amino acid sequence of insulin and 
its three-dimensional structure? 


17. Stretching a target. A protease is an enzyme that cata- 
lyzes the hydrolysis of the peptide bonds of target proteins. 
How might a protease bind a target protein so that its main 
chain becomes fully extended in the vicinity of the vulner- 
able peptide bond? 


18. Often irreplaceable. Glycine is a highly conserved 
amino acid residue in the evolution of proteins. Why? 


19. Potential partners. Identify the groups in a protein that 
can form hydrogen bonds or electrostatic bonds with an 
arginine side chain at pH 7. 


20. Permanent waves. The shape of hair is determined in 
part by the pattern of disulfide bonds in keratin, its major 
protein. How can curls be induced? 


21. Location is everything 1. Most proteins have hydro- 
philic exteriors and hydrophobic interiors. Would you 
expect this structure to apply to proteins embedded in the 
hydrophobic interior of amembrane? Explain. 


22. Location is everything 2. Proteins that span biological 
membranes often contain a helices. Given that the insides 
of membranes are highly hydrophobic (Section 12.2), pre- 
dict what type of amino acids would be in such a helix. Why 
is an q@ helix particularly suited to existence in the hydro- 
phobic environment of the interior of a membrane? 


23. Neighborhood peer pressure? Table 2.1 shows the typical 
pK, values for ionizable groups in proteins. However, more 
than 500 pK, values have been determined for individual 
groups in folded proteins. Account for this discrepancy. 


24. Maybe size does matter. Osteo imperfecta displays a 
wide range of symptoms, from mild to severe. On the 
basis of your knowledge of amino acid and collagen 
structure, propose a biochemical basis for the variety of 
symptoms. 


25. Issues of stability. Proteins are quite stable. The lifetime 
of a peptide bond in aqueous solution is nearly 1000 years. 
However, the free energy of hydrolysis of proteins is nega- 
tive and quite large. How can you account for the stability 
of the peptide bond in light of the fact that hydrolysis 
releases much energy? 


26. Minor species. For an amino acid such as alanine, the 
major species in solution at pH 7 is the zwitterionic form. 
Assume a pK, value of 8 for the amino group and a pK, 
value of 3 for the carboxylic acid. Estimate the ratio of the 
concentration of the neutral amino acid species (with the 
carboxylic acid protonated and the amino group neutral) to 
that of the zwitterionic species at pH 7 (see Section 1.3). 


27. A matter of convention. All L amino acids have an S 
absolute configuration except L-cysteine, which has the R 
configuration. Explain why L-cysteine is designated as hav- 
ing the R absolute configuration. 


28. Hidden message. Translate the following amino acid 
sequence into one-letter code: Glu-Leu-Val-Ile-Ser-Ile- 
Ser-Leu-Ile-Val-Ile-Asn-Gly-Ile-Asn-Leu-Ala-Ser-Val- 
Glu-Gly-Ala-Ser. 


29. Who goes first? Would you expect Pro—X peptide 
bonds to tend to have cis conformations like those of 
X—Pro bonds? Why or why not? 


30. Matching. For each of the amino acid derivatives shown 
here (A-E), find the matching set of @ and w values (a-e). 


(A) (B) wy (D) (E) 

(a) (b) (c) (d) (e) 

6 =120°, =180°, =180°, =0°, = -60°, 
w=120° yw=0° b=180° yw=180° y=—-40° 


31. Scrambled ribonuclease. When performing his experiments 
on protein refolding, Christian Anfinsen obtained a quite dif- 
ferent result when reduced ribonuclease was reoxidized while 
it was still in 8 M urea and the preparation was then dialyzed 
to remove the urea. Ribonuclease reoxidized in this way had 
only 1% of the enzymatic activity of the native protein. Why 
were the outcomes so different when reduced ribonuclease was 
reoxidized in the presence and absence of urea? 


Exploring Proteins and Proteomes 


Intensity — 


Lactalbumin 
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Casein2* 


Casein 


Lactoglobulin 


16,000 
Mass/charge 


30,000 


Milk, a source of nourishment for all mammals, is composed, in part, of a 
variety of proteins. The protein components of milk are revealed by the 
technique of MALDI-TOF mass spectrometry, which separates molecules on 
the basis of their mass-to-charge ratio. [(Left) Okea/istockphoto.com. (Right) 


Courtesy of Dr. Brian Chait.] 


E eete play crucial roles in nearly all biological processes—in catalysis, 
signal transmission, and structural support. This remarkable range of 
functions arises from the existence of thousands of proteins, each folded 
into a distinctive three-dimensional structure that enables it to interact with 
one or more of a highly diverse array of molecules. A major goal of biochem- 
istry is to determine how amino acid sequences specify the conformations, 
and hence functions, of proteins. Other goals are to learn how individual 
proteins bind specific substrates and other molecules, mediate catalysis, 
and transduce energy and information. 

It is often preferable to study a protein of interest after it has been sepa- 
rated from other components within the cell so that the structure and func- 
tion of this protein can be probed without any confounding effects from 
contaminants. Hence, the first step in these studies is the purification of 
the protein of interest. Proteins can be separated from one another on the 
basis of solubility, size, charge, and binding ability. After a protein has been 
purified, its amino acid sequence can be determined. Automated peptide 
sequencing and the application of recombinant DNA methods are provid- 
ing a wealth of amino acid sequence data that are opening new vistas. Many 
protein sequences, often deduced from genome sequences, are now avail- 
able in vast sequence databases. If the sequence of a purified protein has 
been archived in a publicly searchable database, the job of the investigator 
becomes much easier. The investigator need determine only a small stretch 
of amino acid sequence of the protein to find its match in the database. 
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Alternatively, such a protein might be identified by matching its mass to 
those deduced for proteins in the database. Mass spectrometry provides a 
powerful method for determining the mass of a protein. 

After a protein has been purified and its identity confirmed, the chal- 
lenge remains to determine its function within a physiologically relevant 
context. Antibodies are choice probes for locating proteins in vivo and mea- 
suring their quantities. Monoclonal antibodies, able to recognize specific 
proteins, can be obtained in large amounts and used to detect and quantify 
the protein both in isolation and in cells. Peptides and proteins can be 
chemically synthesized, providing tools for research and, in some cases, 
highly pure proteins for use as drugs. Finally, x-ray crystallography and 
nuclear magnetic resonance (NMR) spectroscopy are the principal tech- 
niques for elucidating three-dimensional structure, the key determinant of 
function. 

The exploration of proteins by this array of physical and chemical tech- 
niques has greatly enriched our understanding of the molecular basis of life. 
These techniques make it possible to tackle some of the most challenging 
questions of biology in molecular terms. 


The proteome is the functional representation of the genome 


As will be discussed in Chapter 5, the complete DNA base sequences, or 
genomes, of many organisms are now available. For example, the round- 
worm Caenorhabditis elegans has a genome of 97 million bases and about 
19,000 protein-encoding genes, whereas that of the fruit fly Drosophila 
melanogaster contains 180 million bases and about 14,000 genes. The com- 
pletely sequenced human genome contains 3 billion bases and about 23,000 
genes. However, these genomes are simply inventories of the genes that 
could be expressed within a cell under specific conditions. Only a subset of 
the proteins encoded by these genes will actually be present in a given bio- 
logical context. The proteome—derived from proteins expressed by the 
genome—of an organism signifies a more complex level of information con- 
tent, encompassing the types, functions, and interactions of proteins within 
its biological environment. 

The proteome is not a fixed characteristic of the cell. Because it repre- 
sents the functional expression of information, it varies with cell type, 
developmental stage, and environmental conditions, such as the presence of 
hormones. The proteome is much larger than the genome because almost all 
gene products are proteins that can be chemically modified in a variety of 
ways. Furthermore, these proteins do not exist in isolation; they often inter- 
act with one another to form complexes with specific functional properties. 
Whereas the genome is “hard wired,” the proteome is highly dynamic. An 
understanding of the proteome is acquired by investigating, characterizing, 
and cataloging proteins. In some, but not all, cases, this process begins by 
separating a particular protein from all other biomolecules in the cell. 


3.1 The Purification of Proteins Is an Essential First Step in 
Understanding Their Function 


An adage of biochemistry is “Never waste pure thoughts on an impure pro- 
tein.” Starting from pure proteins, we can determine amino acid sequences 
and investigate a protein’s biochemical function. From the amino acid 
sequences, we can map evolutionary relationships between proteins in 
diverse organisms (Chapter 6). By using crystals grown from pure protein, 
we can obtain x-ray data that will provide us with a picture of the protein’s 
tertiary structure—the shape that determines function. 


The assay: How do we recognize the protein that we are looking for? 


Purification should yield a sample containing only one type of molecule— 
the protein in which the biochemist is interested. This protein sample may 
be only a fraction of 1% of the starting material, whether that starting mate- 
rial consists of one type of cell in culture or a particular organ from a plant 
or animal. How is the biochemist able to isolate a particular protein from a 
complex mixture of proteins? 

A protein can be purified by subjecting the impure mixture of the start- 
ing material to a series of separations based on physical properties such as 
size and charge. To monitor the success of this purification, the biochemist 
needs a test, called an assay, for some unique identifying property of the 
protein. A positive result on the assay indicates that the protein is present. 
Although assay development can be a challenging task, the more specific 
the assay, the more effective the purification. For enzymes, which are pro- 
tein catalysts (Chapter 8), the assay usually measures enzyme activity—that 
is, the ability of the enzyme to promote a particular chemical reaction. This 
activity is often measured indirectly. Consider the enzyme lactate dehydro- 
genase, which catalyzes the following reaction in the synthesis of glucose: 
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Reduced nicotinamide adenine dinucleotide (NADH, see Figure 15.13) 
absorbs light at 340 nm, whereas oxidized nicotinamide adenine dinucleotide 
(NAD“*) does not. Consequently, we can follow the progress of the reaction 
by examining how much light-absorbing ability is developed by a sample in 
a given period of time—for instance, within 1 minute after the addition of 
the enzyme. Our assay for enzyme activity during the purification of lactate 
dehydrogenase is thus the increase in the absorbance of light at 340 nm 
observed in 1 minute. 

To analyze how our purification scheme is working, we need one addi- 
tional piece of information—the amount of protein present in the mixture 
being assayed. There are various rapid and reasonably accurate means of 
determining protein concentration. With these two experimentally deter- 
mined numbers—enzyme activity and protein concentration—we then 
calculate the specific activity, the ratio of enzyme activity to the amount of 
protein in the mixture. Ideally, the specific activity will rise as the purifica- 
tion proceeds and the protein mixture being assayed consists to a greater 
and greater extent of lactate dehydrogenase. In essence, the overall goal of 
the purification is to maximize the specific activity. For a pure enzyme, the 
specific activity will have a constant value. 


Proteins must be released from the cell to be purified 


Having found an assay and chosen a source of protein, we now fractionate 
the cell into components and determine which component 1s enriched in 
the protein of interest. In the first step, a homogenate is formed by disrupt- 
ing the cell membrane, and the mixture is fractionated by centrifugation, 
yielding a dense pellet of heavy material at the bottom of the centrifuge 
tube and a lighter supernatant above (Figure 3.1). The supernatant is 
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Figure 3.1 Differential centrifugation. 
Cells are disrupted in a homogenizer and the 
resulting mixture, called the homogenate, is 
centrifuged in a step-by-step fashion of 
increasing centrifugal force. The denser 
material will form a pellet at lower centrifugal 
force than will the less-dense material. The 
isolated fractions can be used for further 
purification. [Photographs courtesy of 

Dr. S. Fleischer and Dr. B. Fleischer.] 
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again centrifuged at a greater force to yield yet another pellet and super- 
natant. The procedure, called differential centrifugation, yields several 
fractions of decreasing density, each still containing hundreds of different 
proteins. The fractions are each separately assayed for the desired activity. 
Usually, one fraction will be enriched for such activity, and it then serves 
as the source of material to which more-discriminating purification tech- 
niques are applied. 


Proteins can be purified according to solubility, size, charge, and 
binding affinity 

Several thousand proteins have been purified in active form on the basis of 
such characteristics as solubility, size, charge, and specific binding affinity. 
Usually, protein mixtures are subjected to a series of separations, each based 
on a different property. At each step in the purification, the preparation is 
assayed and its specific activity is determined. A variety of purification 
techniques are available. 


Salting out. Most proteins are less soluble at high salt concentrations, an 
effect called salting out. The salt concentration at which a protein precipi- 
tates differs from one protein to another. Hence, salting out can be used to 
fractionate proteins. For example, 0.8 M ammonium sulfate precipitates 
fibrinogen, a blood-clotting protein, whereas a concentration of 2.4 M 
is needed to precipitate serum albumin. Salting out is also useful for con- 
centrating dilute solutions of proteins, including active fractions obtained 
from other purification steps. Dialysis can be used to remove the salt if 
necessary. 


Dialysis. Proteins can be separated from small molecules 
such as salt by dialysis through a semipermeable membrane, 
such as a cellulose membrane with pores (Figure 3.2). The pro- 
tein mixture is placed inside the dialysis bag, which is then 
submerged in a buffer solution that is devoid of the small mol- 
ecules to be separated away. Molecules having dimensions 
significantly greater than the pore diameter are retained inside 
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the dialysis bag. Smaller molecules and ions capable of passing 
through the pores of the membrane diffuse down their concen- 
tration gradients and emerge in the solution outside the bag. 
This technique is useful for removing a salt or other small mol- 
ecule from a cell fractionate, but it will not distinguish between 
proteins effectively. 


Gel-filtration chromatography. More-discriminating separations on the 
basis of size can be achieved by the technique of gel-filtration chromatogyra- 
phy, also known as molecular exclusion chromatography (Figure 3.3). The 
sample is applied to the top of a column consisting of porous beads made of 
an insoluble but highly hydrated polymer such as dextran or agarose (which 
are carbohydrates) or polyacrylamide. Sephadex, Sepharose, and Biogel are 
commonly used commercial preparations of these beads, which are typi- 
cally 100 wm (0.1 mm) in diameter. Small molecules can enter these beads, 
but large ones cannot. The result is that small molecules are distributed in 
the aqueous solution both inside the beads and between them, whereas large 
molecules are located only in the solution between the beads. Large mole- 
cules flow move rapidly through this column and emerge first because a smaller 
volume is accessible to them. Molecules that are of a size to occasionally enter 
a bead will flow from the column at an intermediate position, and small 
molecules, which take a longer, tortuous path, will exit last. 


Ion-exchange chromatography. To obtain a protein of high purity, one 
chromatography step is usually not sufficient, because other proteins in the 
crude mixture will likely co-elute with the desired material. Additional 
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Figure 3.2 Dialysis. Protein molecules (red) 
are retained within the dialysis bag, whereas 
small molecules (blue) diffuse down their 
concentration gradient into the surrounding 
medium. 
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Figure 3.3 Gel-filtration 
chromatography. A mixture 
of proteins in a small volume 
is applied to a column filled 
with porous beads. Because 
large proteins cannot enter the 
internal volume of the beads, 
they emerge sooner than do 
small ones. 
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Figure 3.4 lon-exchange chromatography. 
This technique separates proteins mainly 
according to their net charge. 
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Figure 3.5 Affinity chromatography. 
Affinity chromatography of concanavalin A 
(shown in yellow) on a solid support 
containing covalently attached glucose 
residues (G). 
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purity can be achieved by performing sequential separations that are based 
on distinct molecular properties. For example, in addition to size, proteins 
can be separated on the basis of their net charge by ion-exchange chromatog- 
raphy. If a protein has a net positive charge at pH 7, it will usually bind to a 
column of beads containing carboxylate groups, whereas a negatively 
charged protein will not (Figure 3.4). The bound protein can then be eluted 
(released) by increasing the concentration of sodium chloride or another salt 
in the eluting buffer; sodium ions compete with positively charged groups 
on the protein for binding to the column. Proteins that have a low density of 
net positive charge will tend to emerge first, followed by those having a 
higher charge density. This procedure is also referred to as cation exchange 
to indicate that positively charged groups will bind to the anionic beads. 
Positively charged proteins (cationic proteins) can be separated by chroma- 
tography on negatively charged carboxymethylcellulose (CM-cellulose) 
columns. Conversely, negatively charged proteins (anionic proteins) can be 
separated by anion exchange on positively charged diethylaminoethylcellu- 
lose (DEAE-cellulose) columns. 


Carboxymethyl Diethylaminoethyl 
(CM) group (DEAE) group 
(ionized form) (protonated form) 


Affinity chromatography. Affinity chromatography is another powerful 
means of purifying proteins that is highly selective for the protein of inter- 
est. This technique takes advantage of the high affinity of many proteins for 
specific chemical groups. For example, the plant protein concanavalin A is 
a carbohydrate-binding protein, or lectin (Section 11.4), that has affinity for 
glucose. When a crude extract is passed through a column of beads contain- 
ing covalently attached glucose residues, concanavalin A binds to the beads, 
whereas most other proteins do not (Figure 3.5). The bound concanavalin A 
can then be released from the column by adding a concentrated solution of 
glucose. The glucose in solution displaces the column-attached glucose 
residues from binding sites on concanavalin A. Affinity chromatography is 
a powerful means of isolating transcription factors—proteins that regulate 
gene expression by binding to specific DNA sequences. A protein mixture 
is passed through a column containing specific DNA sequences attached to 
a matrix; proteins with a high affinity for the sequence will bind and be 
retained. In this instance, the transcription factor is released by washing 
with a solution containing a high concentration of salt. 

In general, affinity chromatography can be effectively used to isolate a 
protein that recognizes group X by (1) covalently attaching X or a derivative 
of it to a column; (2) adding a mixture of proteins to this column, which is 
then washed with buffer to remove unbound proteins; and (3) eluting the 
desired protein by adding a high concentration of a soluble form of X or 
altering the conditions to decrease binding affinity. Affinity chromatogra- 
phy is most effective when the interaction of the protein and the molecule 
that is used as the bait is highly specific. 

The process of standard affinity chromatography can isolate proteins 
expressed from cloned genes (Section 5.2). Extra amino acids are encoded 


in the cloned gene that, when expressed, serve as an affinity tag that can be 
readily trapped. For example, repeats of the codon for histidine may be 
added such that the expressed protein has a string of histidine residues 
(called a His tag) on one end. The tagged proteins are then passed through 
a column of beads containing covalently attached, immobilized nickel(IT) or 
other metal ions. The His tags bind tightly to the immobilized metal ions, 
binding the desired protein, while other proteins flow through the column. 
The protein can then be eluted from the column by the addition of imida- 
zole or some other chemical that binds to the metal ions and displaces the 
protein. 


High-pressure liquid chromatography. A technique called high-pressure 
liquid chromatography (HPLC) is an enhanced version of the column tech- 
niques already discussed. The column materials are much more finely 
divided and, as a consequence, possess more interaction sites and thus 
greater resolving power. Because the column is made of finer material, pres- 
sure must be applied to the column to obtain adequate flow rates. The net 
result is both high resolution and rapid separation. In a typical HPLC 
setup, a detector that monitors the absorbance of the eluate at a particular 
wavelength is placed immediately after the column. In the sample HPLC 
elution profile shown in Figure 3.6, proteins are detected by setting the 
detector to 220 nm (the characteristic absorbance wavelength of the peptide 
bond) In a short span of 10 minutes, a number of sharp peaks representing 
individual proteins can be readily identified. 


Proteins can be separated by gel electrophoresis and displayed 


How can we tell that a purification scheme is effective? One way is to ascer- 
tain that the specific activity rises with each purification step. Another is to 
determine that the number of different proteins in each sample declines 
at each step. The technique of electrophoresis makes the latter method 
possible. 


Gel electrophoresis. A molecule with a net charge will move in an electric 
field. This phenomenon, termed electrophoresis, offers a powerful means of 
separating proteins and other macromolecules, such as DNA and RNA. 
The velocity of migration (v) of a protein (or any molecule) in an electric 
field depends on the electric field strength (E), the net charge on the protein 
(z), and the frictional coefficient (f). 


v= E2/f (1) 


The electric force Ez driving the charged molecule toward the oppo- 
sitely charged electrode is opposed by the viscous drag fv arising from fric- 
tion between the moving molecule and the medium. The frictional coeffi- 
cient f depends on both the mass and shape of the migrating molecule and 
the viscosity (1) of the medium. For a sphere of radius r, 


f = 6mHr (2) 


Electrophoretic separations are nearly always carried out in porous gels (or 
on solid supports such as paper) because the gel serves as a molecular sieve 
that enhances separation (Figure 3.7). Molecules that are small compared 
with the pores in the gel readily move through the gel, whereas molecules 
much larger than the pores are almost immobile. Intermediate-size mole- 
cules move through the gel with various degrees of facility. The electric field 
is applied such that proteins migrate from the negative to the positive 
electrodes, typically from top to bottom. Electrophoresis is performed in a 
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Figure 3.6 High-pressure liquid 
chromatography (HPLC). Gel filtration by 
HPLC clearly defines the individual proteins 
because of its greater resolving power: 

(1) thyroglobulin (669 kd), (2) catalase 
(232 kd), (3) bovine serum albumin 

(67 kd), (4) ovalbumin (43 kd), and (5) 
ribonuclease (13.4 kd). [After K. J. Wilson 
and T. D. Schlabach. In Current Protocols in 
Molecular Biology, vol. 2, suppl. 41, F. M. 
Ausubel, R. Brent, R. E. Kingston, D. D. 
Moore, J. G. Seidman, J. A. Smith, and K. 
Struhl, Eds. (Wiley, 1998), p. 10.14.1.] 


Figure 3.7 Polyacrylamide 
gel electrophoresis. 

(A) Gel-electrophoresis 
apparatus. Typically, several 
samples undergo 
electrophoresis on one flat 
polyacrylamide gel. A microliter 
pipette is used to place 
solutions of proteins in the 
wells of the slab. A cover is 
then placed over the gel 
chamber and voltage is 
applied. The negatively charged 
SDS (sodium dodecyl sulfate)— 
protein complexes migrate in 
the direction of the anode, at 
the bottom of the gel. 

(B) The sieving action of a 
porous polyacrylamide gel 
separates proteins according 
to size, with the smallest 
moving most rapidly. 
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thin, vertical slab of polyacrylamide gel. Polyacrylamide gels are choice 
supporting media for electrophoresis because they are chemically inert and 
readily formed by the polymerization of acrylamide with a small amount of 
the cross-linking agent methylenebisacrylamide to make a three-dimensional 
mesh (Figure 3.8). Electrophoresis is distinct from gel filtration in that, 
because of the electric field, all of the molecules, regardless of size, are 
forced to move through the same matrix. 
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Figure 3.8 Formation of a polyacrylamide gel. A three-dimensional mesh is formed by 
copolymerizing activated monomer (blue) and cross-linker (red). 


Proteins can be separated largely on the basis of mass by electrophoresis 
in a polyacrylamide gel under denaturing conditions. The mixture of pro- 
teins is first dissolved in a solution of sodium dodecyl! sulfate (SDS), an 
anionic detergent that disrupts nearly all noncovalent interactions in native 
proteins. B-Mercaptoethanol (2-thioethanol) or dithiothreitol is added to 
reduce disulfide bonds. Anions of SDS bind to main chains at a ratio of 
about one SDS anion for every two amino acid residues. The negative charge 


acquired on binding SDS is usually much greater than the charge on the 
native protein; the contribution of the protein to the total charge of the 
SDS-—protein complex is thus rendered insignificant. As a result, this com- 
plex of SDS with a denatured protein has a large net negative charge that is 
roughly proportional to the mass of the protein. The SDS—protein com- 
plexes are then subjected to electrophoresis. When the electrophoresis is 
complete, the proteins in the gel can be visualized by staining them with 
silver or a dye such as Coomassie blue, which reveals a series of bands 
(Figure 3.9). Radioactive labels, if they have been incorporated into pro- 
teins, can be detected by placing a sheet of x-ray film over the gel, a proce- 
dure called autoradiography. 

Small proteins move rapidly through the gel, whereas large proteins stay 
at the top, near the point of application of the mixture. The mobility of most 
polypeptide chains under these conditions is linearly proportional to the 
logarithm of their mass (Figure 3.10). Some carbohydrate-rich proteins 
and membrane proteins do not obey this empirical relation, however. 
SDS—polyacrylamide gel electrophoresis (often referred to as SDS-PAGE) 
is rapid, sensitive, and capable of a high degree of resolution. As little as 
0.1 wg (~2 pmol) of a protein gives a distinct band when stained with 
Coomassie blue, and even less (~0.02 jug) can be detected with a silver stain. 
Proteins that differ in mass by about 2% (e.g., 50 and 51 kd, arising from a 
difference of about 10 amino acids) can usually be distinguished with SDS- 
PAGE. 

We can examine the efficacy of our purification scheme by analyzing a 
part of each fraction by electrophoresis. The initial fractions will display 
dozens to hundreds of proteins. As the purification progresses, the number 
of bands will diminish, and the prominence of one of the bands should 
increase. This band should correspond to the protein of interest. 


Isoelectric focusing. Proteins can also be separated electrophoretically 
on the basis of their relative contents of acidic and basic residues. The 
isoelectric point (pI) of a protein is the pH at which its net charge is zero. 
At this pH, its electrophoretic mobility is zero because z in equation 1 is 
equal to zero. For example, the pI of cytochrome c, a highly basic electron- 
transport protein, is 10.6, whereas that of serum albumin, an acidic pro- 
tein in blood, is 4.8. Suppose that a mixture of proteins undergoes electro- 
phoresis in a pH gradient in a gel in the absence of SDS. Each protein will 
move until it reaches a position in the gel at which the pH is equal to the 
pl of the protein. This method of separating proteins according to their 
isoelectric point is called isoelectric focusing. The pH gradient in the gel is 
formed first by subjecting a mixture of polyampholytes (small multi- 
charged polymers) having many different pI values to electrophoresis. 
Isoelectric focusing can readily resolve proteins that differ in pI by as little 
as 0.01, which means that proteins differing by one net charge can be 
separated (Figure 3.11). 
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Figure 3.9 Staining of proteins after 
electrophoresis. Proteins subjected to 
electrophoresis on an SDS-polyacrylamide gel 
can be visualized by staining with Coomassie 
blue. [Courtesy of Kodak Scientific Imaging 
Systems.] 
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Figure 3.10 Electrophoresis can 
determine mass. The electrophoretic mobility 
of many proteins in SDS—polyacrylamide gels 
is inversely proportional to the logarithm of 
their mass. [After K. Weber and M. Osborn, 
The Proteins, vol. 1, 3d ed. (Academic Press, 
1975), p. 179] 


Figure 3.11 The principle of isoelectric 
focusing. A pH gradient is established in a 
gel before loading the sample. (A) The 
sample is loaded and voltage is applied. The 
proteins will migrate to their isoelectric pH, 
the location at which they have no net charge. 
(B) The proteins form bands that can be 
excised and used for further experimentation. 
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Figure 3.12 Two-dimensional gel 
electrophoresis. (A) A protein sample is 
initially fractionated in one dimension by 
isoelectric focusing as described in Figure 
3.11. The isoelectric focusing gel is then 
attached to an SDS—polyacrylamide gel, and 
electrophoresis is performed in the second 
dimension, perpendicular to the original 
separation. Proteins with the same pl are now 
separated on the basis of mass. (B) Proteins 
from E. coli were separated by two- 
dimensional gel electrophoresis, resolving 
more than a thousand different proteins. The 
proteins were first separated according to their 
isoelectric pH in the horizontal direction and 
then by their apparent mass in the vertical 
direction. [(B) Courtesy of Dr. Patrick H. 
O'Farrell.] 


Figure 3.13 Alterations in protein levels 
detected by two-dimensional gel 
electrophoresis. Samples of normal colon 
mucosa and colorectal tumor tissue from the 
same person were analyzed by two- 
dimensional gel electrophoresis. In the gel 
section shown, changes in the intensity of 
several spots are evident, including a dramatic 
increase in levels of the protein indicated by 
the arrow, corresponding to the enzyme 
glyceraldehyde-3-phosphate dehydrogenase. 
[Courtesy of Lin Quinsong © 2010, The 
American Society for Biochemistry and 
Molecular Biology.] 
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Two-dimensional electrophoresis. Isoelectric focusing can be combined 
with SDS-PAGE to obtain very high resolution separations. A single sample 
is first subjected to isoelectric focusing. This single-lane gel is then placed 
horizontally on top of an SDS—polyacrylamide slab. The proteins are thus 
spread across the top of the polyacrylamide gel according to how far they 
migrated during isoelectric focusing. They then undergo electrophoresis 
again in a perpendicular direction (vertically) to yield a two-dimensional 
pattern of spots. In such a gel, proteins have been separated in the horizon- 
tal direction on the basis of isoelectric point and in the vertical direction on 
the basis of mass. Remarkably, more than a thousand different proteins in 
the bacterium Escherichia coli can be resolved in a single experiment by two- 
dimensional electrophoresis (Figure 3.12). 

Proteins isolated from cells under different physiological conditions can be 
subjected to two-dimensional electrophoresis. The intensities of individual 
spots on the gels can then be compared, which indicates that the concentra- 
tions of specific proteins have changed in response to the physiological state 
(Figure 3.13). How can we discover the identity of a protein that is showing 
such responses? Although many proteins are displayed on a two-dimensional 
gel, they are not identified. It is now possible to identify proteins by coupling 
two-dimensional gel electrophoresis with mass spectrometric techniques. We 
will examine these powerful techniques shortly (Section 3.4). 
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Table 3.1 Quantification of a purification protocol for a fictitious protein 75 


Total protein Total activity Specificactivity Yield Purification 3.1 The Purification of Proteins 

Step (mg) (units) (units mg — 4) (%) level 
Homogenization 15,000 150,000 10 100 1 
Salt fractionation 4,600 138,000 30 92 8B 
Ion-exchange 

chromatography 1,278 115,500 90 77 9 
Gel-filtration 

chromatography 68.8 75,000 1,100 50 110 
Affinity 

chromatography 1.75 52,500 30,000 35 3,000 


A protein purification scheme can be quantitatively 
evaluated 


To determine the success of a protein purification scheme, 8 _ 

; ea Homogenate Salt lon-exchange — Gel-filtration Affinity 
we monitor each step of the procedure by determining the fractionation chromatography chromatography chromatography 
specific activity of the protein mixture and by subjecting it 
to SDS-PAGE analysis. Consider the results for the puri- 
fication of a fictitious protein, summarized in Table 3.1 
and Figure 3.14. At each step, the following parameters 
are measured: 


2 4 5 


Total Protein. The quantity of protein present in a fraction 
is obtained by determining the protein concentration of a 
part of each fraction and multiplying by the fraction’s total 
volume. 


Total Activity. The enzyme activity for the fraction is 
obtained by measuring the enzyme activity in the volume 
of fraction used in the assay and multiplying by the frac- 
tion’s total volume. 


Specific Activity. This parameter is obtained by dividing 
total activity by total protein. 


Yield. This parameter is a measure of the activity retained Figure 3.14 Electrophoretic analysis of a protein purification. 
The purification scheme in Table 3.1 was analyzed by SDS-PAGE. Each 


after each purification step as a percentage of the activity lane contained 50 yg of sample. The effectiveness of the purification 
in the crude extract. The amount of activity in the initial can be seen as the band for the protein of interest becomes more 
extract is taken to be 100%. prominent relative to other bands. 


Purification Level. This parameter is a measure of the increase in purity and 
is obtained by dividing the specific activity, calculated after each purifica- 
tion step, by the specific activity of the initial extract. 


As we see in Table 3.1, the first purification step, salt fractionation, leads 
to an increase in purity of only 3-fold, but we recover nearly all the target 
protein in the original extract, given that the yield is 92%. After dialysis to 
lower the high concentration of salt remaining from the salt fractionation, 
the fraction 1s passed through an ion-exchange column. The purification 
now increases to 9-fold compared with the original extract, whereas the 
yield falls to 77%. Gel-filtration chromatography brings the level of purifi- 
cation to 110-fold, but the yield is now at 50%. The final step is affinity 
chromatography with the use of a ligand specific for the target enzyme. 
This step, the most powerful of these purification procedures, results in a 
purification level of 3000-fold but lowers the yield to 35%. The SDS-PAGE 
analysis in Figure 3.14 shows that, if we load a constant amount of protein 
onto each lane after each step, the number of bands decreases in proportion 
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to the level of purification, and the amount of protein of interest increases as 
a proportion of the total protein present. 

A good purification scheme takes into account both purification levels 
and yield. A high degree of purification and a poor yield leave little protein 
with which to experiment. A high yield with low purification leaves many 
contaminants (proteins other than the one of interest) in the fraction and 
complicates the interpretation of subsequent experiments. 


Ultracentrifugation is valuable for separating biomolecules and 
determining their masses 


We have already seen that centrifugation is a powerful and generally appli- 
cable method for separating a crude mixture of cell components. This tech- 
nique is also valuable for the analysis of the physical properties of biomol- 
ecules. Using centrifugation, we can determine such parameters as mass 
and density, learn something about the shape of a molecule, and investigate 
the interactions between molecules. To deduce these properties from the 
centrifugation data, we require a mathematical description of how a particle 
behaves when a centrifugal force is applied. 

A particle will move through a liquid medium when subjected to a 
centrifugal force. A convenient means of quantifying the rate of movement 
is to calculate the sedimentation coefficient, s, of a particle by using the fol- 
lowing equation: 


s = m(1 — vp)/f 


where m is the mass of the particle, v is the partial specific volume (the 
reciprocal of the particle density), p is the density of the medium, and f 
is the frictional coefficient (a measure of the shape of the particle). The 
(1 — vp) term is the buoyant force exerted by liquid medium. 

Sedimentation coefficients are usually expressed in Svedberg units (S), 
equal to 10 !°s. The smaller the S value, the more slowly a molecule moves 
in a centrifugal field. The S values for a number of biomolecules and cellular 
components are listed in Table 3.2 and Figure 3.15. 

Several important conclusions can be drawn from the preceding 
equation: 


1. The sedimentation velocity of a particle depends in part on its mass. A 
more massive particle sediments more rapidly than does a less massive par- 
ticle of the same shape and density. 


2. Shape, too, influences the sedimentation velocity because it affects the 
viscous drag. The frictional coefficient f of a compact particle is smaller than 
that of an extended particle of the same mass. Hence, elongated particles 
sediment more slowly than do spherical ones of the same mass. 


Table 3.2 S values and molecular weights of sample proteins 


Protein S value (Svedberg units) Molecular weight 
Pancreatic trypsin inhibitor 1 6,520 
Cytochrome c 1.83 12,310 
Ribonuclease A 1.78 13,690 
Myoglobin 1.97 17,800 
Trypsin 2.5 23,200 
Carbonic anhydrase 3.23 28,800 
Concanavalin A 3.8 51,260 
Malate dehydrogenase 5.76 74,900 
Lactate dehydrogenase 7.54 146,200 


Source: T. Creighton, Proteins, 2d ed. (W. H. Freeman and Company, 1993), Table 7.1. 
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Figure 3.15 Density and sedimentation coefficients of cellular components. [After L. J. 
Kleinsmith and V. M. Kish, Principles of Cell and Molecular Biology, 2d ed. (HarperCollins, 1995), 
p. 138,] 


3. A dense particle moves more rapidly than does a less dense one because 
the opposing buoyant force (1 — vp) is smaller for the denser particle. 


4. The sedimentation velocity also depends on the density of the solution 
(p). Particles sink when vp < 1, float when vp > 1, and do not move 
when vp = 1, 


A technique called zonal, band, or most commonly gradient centrifuga- 
tion can be used to separate proteins with different sedimentation coeffi- 
cients. The first step is to form a density gradient in a centrifuge tube. 
Differing proportions of a low-density solution (such as 5% sucrose) and a 
high-density solution (such as 20% sucrose) are mixed to create a linear 
gradient of sucrose concentration ranging from 20% at the bottom of the 
tube to 5% at the top (Figure 3.16). The role of the gradient is to prevent 
convective flow. A small volume of a solution containing the mixture of 
proteins to be separated is placed on top of the density gradient. When the 
rotor is spun, proteins move through the gradient and separate according to 
their sedimentation coefficients. The time and speed of the centrifugation is 
determined empirically. The separated bands, or zones, of protein can be 
harvested by making a hole in the bottom of the tube and collecting drops. 
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Figure 3.16 Zonal centrifugation. The 
steps are as follows: (A) form a density 
gradient, (B) layer the sample on top of the 
gradient, (C) place the tube in a swinging- 
bucket rotor and centrifuge it, and (D) collect 
the samples. [After D. Freifelder, Physical 
Biochemistry, 2d ed. (W. H. Freeman and 
Company, 1982), p. 397.] 
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The drops can be measured for protein content and catalytic activity or 
another functional property. This sedimentation-velocity technique readily 
separates proteins differing in sedimentation coefficient by a factor of two 
or more. 

The mass of a protein can be directly determined by sedimentation 
equilibrium, in which a sample is centrifuged at low speed such that a con- 
centration gradient of the sample is formed. However, this sedimentation is 
counterbalanced by the diffusion of the sample from regions of high to 
low concentration. When equilibrium has been achieved, the shape of the 
final gradient depends solely on the mass of the sample. The sedimentation- 
equilibrium technique for determining mass is very accurate and can be applied 
without denaturing the protein. Thus the native quaternary structure of 
multimeric proteins is preserved. In contrast, SDS—polyacrylamide gel elec- 
trophoresis provides an estimate of the mass of dissociated polypeptide 
chains under denaturing conditions. Note that, if we know the mass of the 
dissociated components of a multimeric protein as determined by SDS— 
polyacrylamide analysis and the mass of the intact multimer as determined 
by sedimentation-equilibrium analysis, we can determine the number of 
copies of each polypeptide chain present in the protein complex. 


Protein purification can be made easier with the use of recombinant 
DNA technology 


In Chapter 5, we shall consider the widespread effect of recombinant DNA 
technology on all areas of biochemistry and molecular biology. The applica- 
tion of recombinant methods to the overproduction of proteins has enabled 
dramatic advances in our understanding of their structure and function. 
Before the advent of this technology, proteins were isolated solely from their 
native sources, often requiring a large amount of tissue to obtain a sufficient 
amount of protein for analytical study. For example, the purification of 
bovine deoxyribonuclease in 1946 required nearly ten pounds of beef pan- 
creas to yield one gram of protein. As a result, biochemical studies on puri- 
fied material were often limited to abundant proteins. 

Armed with the tools of recombinant technology, the biochemist is now 
able to enjoy a number of significant advantages: 


1. Proteins can be expressed in large quantities. The homogenate serves as 
the starting point in a protein purification scheme. For recombinant sys- 
tems, a host organism that is amenable to genetic manipulation, such as the 
bacterium Escherichia coli or the yeast Pichia pastoris, is utilized to express a 
protein of interest. The biochemist can exploit the short doubling times and 
ease of genetic manipulation of such organisms to produce large amounts 
of protein from manageable amounts of culture. Asa result, purification can 
begin with a homogenate that is often highly enriched with the desired 
molecule. Moreover, a protein can be easily obtained regardless of its natu- 
ral abundance or its species of origin. 


2. Affinity tags can be fused to proteins. As described earlier, affinity chro- 
matography can be a highly selective step within a protein purification 
scheme. Recombinant DNA technology enables the attachment of any one 
of anumber of possible affinity tags to a protein (such as the “His tag” men- 
tioned earlier). Hence, the benefits of affinity chromatography can be real- 
ized even for those proteins for which a binding partner is unknown or not 
easily determined. 


3. Proteins with modified primary structures can be readily generated. A 
powerful aspect of recombinant DNA technology as applied to protein 


purification is the ability to manipulate genes to generate variants of a 
native protein sequence (Section 5.2). We learned in Section 2.4 that many 
proteins consist of compact domains connected by flexible linker regions. 
With the use of genetic-manipulation strategies, fragments of a protein that 
encompass single domains can be generated, an advantageous approach 
when expression of the entire protein is limited by its size or solubility. 
Additionally, as we will see in Section 9.1, amino acid substitutions can be 
introduced into the active site of an enzyme to precisely probe the roles of 
specific residues within its catalytic cycle. 


3.2 Amino Acid Sequences of Proteins Can Be 
Determined Experimentally 


The amino acid sequence of a protein can be a valuable source of insight 
into its function, structure, and history. 


1. The sequence of a protein of interest can be compared with all other 
known sequences to ascertain whether significant similarities exist. A search 
for kinship between a newly sequenced protein and the millions of previ- 
ously sequenced ones takes only a few seconds on a personal computer 
(Chapter 6). If the newly isolated protein is a member of an established class 
of protein, we can begin to infer information about the protein’s structure 
and function. For instance, chymotrypsin and trypsin are members of 
the serine protease family, a clan of proteolytic enzymes that have a com- 
mon catalytic mechanism based on a reactive serine residue (Chapter 9). If 
the sequence of the newly isolated protein shows sequence similarity 
with trypsin or chymotrypsin, the result suggests that it may be a serine 
protease. 


2. Comparison of sequences of the same protein in different species yields a 
wealth of information about evolutionary pathways. Genealogical relation- 
ships between species can be inferred from sequence differences between 
their proteins. If we assume that the random mutation rate of proteins over 
time is constant, then careful sequence comparison of related proteins 
between two organisms can provide an estimate for when these two evolu- 
tionary lines diverged. For example, a comparison of serum albumins found 
in primates indicates that human beings and African apes diverged 5 mil- 
lion years ago, not 30 million years ago as was once thought. Sequence 
analyses have opened a new perspective on the fossil record and the pathway 
of human evolution. 


3. Amino acid sequences can be searched for the presence of internal repeats. 
Such internal repeats can reveal the history of an individual protein itself. 
Many proteins apparently have arisen by duplication of primordial genes 
followed by their diversification. For example, calmodulin, a ubiquitous 
calcium sensor in eukaryotes, contains four similar calctum-binding mod- 
ules that arose by gene duplication (Figure 3.17). 


4. Many proteins contain amino acid sequences that serve as signals designat- 
ing their destinations or controlling their processing. For example, a protein 
destined for export from a cell or for location in a membrane contains a sig- 
nal sequence, a stretch of about 20 hydrophobic residues near the amino 
terminus that directs the protein to the appropriate membrane. Another 
protein may contain a stretch of amino acids that functions as a nuclear 
localization signal, directing the protein to the nucleus. 
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Figure 3.17 Repeating motifs in a 
protein chain. Calmodulin, a calcium sensor, 
contains four similar units (shown in red, 
yellow, blue, and orange) in a single 
polypeptide chain. Notice that each unit binds 
a calcium ion (shown in green). [Drawn from 
1CLL.pdb.] 
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5. Sequence data provide a basis for preparing antibodies specific for a protein 
of interest. One or more parts of the amino acid sequence of a protein will 
elicit an antibody when injected into a mouse or rabbit. These specific anti- 
bodies can be very useful in determining the amount of a protein present in 
solution or in the blood, ascertaining its distribution within a cell, or cloning 
its gene (Section 3.3). 


6. Amino acid sequences are valuable for making DNA probes that are specific 
for the genes encoding the corresponding proteins. Knowledge of a protein’s 
primary structure permits the use of reverse genetics. DNA sequences that 
correspond to a part of the amino acid sequence can be constructed on the 
basis of the genetic code. These DNA sequences can be used as probes to 
isolate the gene encoding the protein so that the entire sequence of the 
protein can be determined. The gene in turn can provide valuable informa- 
tion about the physiological regulation of the protein. Protein sequencing is 
an integral part of molecular genetics, just as DNA cloning is central to the 
analysis of protein structure and function. We will revisit some of these 
topics in more detail in Chapter 5. 


Peptide sequences can be determined by automated Edman 
degradation 


Given the importance of determining the amino acid sequence of a protein, 
let us consider one of the methods available to the biochemist for determin- 
ing this information. Consider a simple peptide, whose composition is 
unknown to the researcher: 


Ala-Gly-Asp-Phe-Arg-Gly 


The first step is to determine the amino acid composition of the peptide. The 
peptide is hydrolyzed into its constituent amino acids by heating it in 6 M 
HCl at 110°C for 24 hours. The amino acids in solution can then be sepa- 
rated by ion-exchange chromatography. The identity of each amino acid is 
revealed by its elution volume, which is the volume of buffer used to remove 
the amino acid from the column (Figure 3.18), and its quantity is revealed 
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Figure 3.18 Determination of amino acid composition. Different amino acids in a peptide 
hydrolysate can be separated by ion-exchange chromatography on a sulfonated polystyrene resin 
(such as Dowex-50). Buffers (in this case, sodium citrate) of increasing pH are used to elute the 
amino acids from the column. The amount of each amino acid present is determined from the 
absorbance. Aspartate, which has an acidic side chain, is first to emerge, whereas arginine, which 
has a basic side chain, is the last. The original peptide is revealed to be composed of one aspartate, 
one alanine, one phenylalanine, one arginine, and two glycine residues. 


by reaction with an indicator dye such as ninhydrin or fluorescamine. After 
conjugation to the indicator, the amino acid exhibits a color with an inten- 
sity that is proportional to its concentration. A comparison of the chro- 
matographic patterns of our sample hydrolysate with that of a standard 
mixture of amino acids would show that the amino acid composition of the 
peptide is 


(Ala, Arg, Asp, Gly2, Phe) 


The parentheses denote that this is the amino acid composition of the 
peptide, not its sequence. 

The next step is to identify the N-terminal amino acid. Pehr Edman 
devised a method for labeling the amino-terminal residue and cleaving it 
from the peptide without disrupting the peptide bonds between the other 
amino acid residues. The Edman degradation sequentially removes one resi- 
due at a time from the amino end of a peptide (Figure 3.19). Phenyl isothio- 
cyanate reacts with the uncharged terminal amino group of the peptide to 
form a phenylthiocarbamoy] derivative. Then, under mildly acidic condi- 
tions, acyclic derivative of the terminal amino acid is liberated, which leaves 
an intact peptide shortened by one amino acid. The cyclic compound is a 
phenylthiohydantoin (PTH)—amino acid, which can be identified by chro- 
matographic methods. The Edman procedure can then be repeated on the 
shortened peptide, yielding another PTH—amino acid, which can again be 
identified by chromatography. Three more rounds of the Edman degrada- 
tion will reveal the complete sequence of the original hexapeptide. 

The development of automated sequencers has markedly decreased the 
time required to determine protein sequences. By repeated Edman degra- 
dations, the amino acid sequence of some 50 residues in a protein can be 
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Figure 3.19 The Edman degradation. The labeled amino-terminal residue (PTH—alanine in the 
first round) can be released without hydrolyzing the rest of the peptide. Hence, the amino-terminal 
residue of the shortened peptide (Gly-Asp-Phe-Arg-Gly) can be determined in the second round. 
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Three more rounds of the Edman degradation reveal the complete sequence of the original peptide. 
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Figure 3.20 Separation of PTH-amino 
acids. PTH—amino acids can be rapidly 
separated by high-pressure liquid 
chromatography (HPLC). In this HPLC profile, a 
mixture of PTH—amino acids is clearly resolved 
into tts components. An unknown amino acid 
can be identified by its elution position relative 
to the known ones. 
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determined. Gas-phase sequenators can analyze picomole quantities of 
peptides and proteins with the use of high-pressure liquid chromatography 
to identify each amino acid as it is released (Figure 3.20). This high sensitiv- 
ity makes it feasible to analyze the sequence of a protein sample eluted from 
a single band of an SDS—polyacrylamide gel. 


Proteins can be specifically cleaved into small peptides to 
facilitate analysis 


In principle, it should be possible to sequence an entire protein by using the 
Edman method. In practice, the peptides cannot be much longer than about 
50 residues, because not all peptides in the reaction mixture release the 
amino acid derivative at each step. For instance, if the efficiency of release 
for each round were 98%, the proportion of “correct” amino acid released 
after 60 rounds would be (0.98°"), or 0.3—a hopelessly impure mix. This 
obstacle can be circumvented by cleaving a protein into smaller peptides 
that can be sequenced. Protein cleavage can be achieved by chemical 
reagents, such as cyanogen bromide, or proteolytic enzymes, such as trypsin. 
Table 3.3 gives several other ways of specifically cleaving polypeptide 
chains. Note that these methods are sequence specific: they disrupt the pro- 
tein backbone at particular amino acid residues in a predictable manner. 


Table 3.3 Specific cleavage of polypeptides 


Reagent Cleavage site 


Chemical cleavage 
Cyanogen bromide 
O-Iodosobenzoate 
Hydroxylamine 
2-Nitro-5-thiocyanobenzoate 


Carboxy] side of methionine residues 
Carboxyl side of tryptophan residues 
Asparagine-glycine bonds 

Amino side of cysteine residues 


Enzymatic cleavage 
Trypsin 
Clostripain 
Staphylococcal protease 


Carboxy] side of lysine and arginine residues 

Carboxyl side of arginine residues 

Carboxyl side of aspartate and glutamate residues 
(glutamate only under certain conditions) 

Carboxyl side of arginine 

Carboxy] side of tyrosine, tryptophan, phenylalanine, 
leucine, and methionine 

Amino side of C-terminal amino acid (not arginine, 
lysine, or proline) 


Thrombin 
Chymotrypsin 


Carboxypeptidase A 


The peptides obtained by specific chemical or enzy- 
matic cleavage are separated by some type of chroma- 
tography. The sequence of each purified peptide is then 
determined by the Edman method. At this point, the 
amino acid sequences of segments of the protein are 
known, but the order of these segments is not yet 
defined. How can we order the peptides to obtain the 
primary structure of the original protein? The neces- 
sary additional information is obtained from overlap 
peptides (Figure 3.21). A second enzyme is used to split 
the polypeptide chain at different linkages. For exam- 
ple, chymotrypsin cleaves preferentially on the car- 
boxyl side of aromatic and some other bulky nonpolar 
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Figure 3.21 Overlap peptides. The peptide obtained by 
chymotryptic digestion overlaps two tryptic peptides, establishing 


their order. 
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residues (Chapter 9). Because these chymotryptic pep- 
tides overlap two or more tryptic peptides, they can be 
used to establish the order of the peptides. The entire 
amino acid sequence of the polypeptide chain is then 
known. 


Additional steps are necessary if the initial protein sample is actually 
several polypeptide chains. SDS—gel electrophoresis under reducing condi- 
tions should display the number of chains. Alternatively, the number of 
distinct N-terminal amino acids could be determined. After a protein has 
been identified as being made up of two or more polypeptide chains, dena- 
turing agents, such as urea or guanidine hydrochloride, are used to dissoci- 
ate chains held together by noncovalent bonds. The dissociated chains must 
be separated from one another before sequence determination can begin. 
Polypeptide chains linked by disulfide bonds are separated by reduction 
with thiols such as B-mercaptoethanol or dithiothreitol. To prevent the 
cysteine residues from recombining, they are then alkylated with iodoace- 
tate to form stable S-carboxymethy] derivatives (Figure 3.22). Sequencing 
can then be performed as already described. 
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Figure 3.22 Disulfide-bond reduction. Polypeptides linked by disulfide bonds can be separated 
by reduction with dithiothreitol followed by alkylation to prevent them from re-forming. 
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Genomic and proteomic methods are complementary 


Thousands of proteins have been sequenced by the Edman degradation of 
peptides derived from specific cleavages. Nevertheless, heroic effort is 
required to elucidate the sequence of large proteins, those with more than 
1000 residues. For sequencing such proteins, a complementary experimen- 
tal approach based on recombinant DNA technology is often more effi- 
cient. As will be discussed in Chapter 5, long stretches of DNA can be 
cloned and sequenced, and the nucleotide sequence can be translated to 
reveal the amino acid sequence of the protein encoded by the gene (Fig- 
ure 3.23). Recombinant DNA technology is producing a wealth of amino 
acid sequence information at a remarkable rate. 


GGG TIC TIG GGA GCA GCA GGA AGC ACT ATG GGC GCA 


Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 


Figure 3.23 DNA sequence yields the amino acid sequence. The complete nucleotide 
sequence of HIV-1 (human immunodeficiency virus), the cause of AIDS (acquired immune 
deficiency syndrome), was determined within a year after the isolation of the virus. A part of the 
DNA sequence specified by the RNA genome of the virus is shown here with the corresponding 
amino acid sequence (deduced from a knowledge of the genetic code). 


Even with the use of the DNA base sequence to determine primary 
structure, there is still a need to work with isolated proteins. The amino acid 
sequence deduced by reading the DNA sequence is that of the nascent 
protein, the direct product of the translational machinery. However, many 
proteins undergo posttranslational modifications after their syntheses. Some 
have their ends trimmed, and others arise by cleavage of a larger initial 
polypeptide chain. Cysteine residues in some proteins are oxidized to form 
disulfide links, connecting either parts within a chain or separate polypep- 
tide chains. Specific side chains of some proteins are altered. Amino acid 
sequences derived from DNA sequences are rich in information, but they 
do not disclose these modifications. Chemical analyses of proteins in their 
mature form are needed to delineate the nature of these changes, which are 
critical for the biological activities of most proteins. Thus, genomic and 
proteomic analyses are complementary approaches to elucidating the structural 
basis of protein function. 


3.3. Immunology Provides Important Techniques with 
Which to Investigate Proteins 


The purification of a protein enables the biochemist to explore its function 
and structure within a precisely controlled environment. However, the iso- 
lation of a protein removes it from its native context within the cell, where 
its activity is most physiologically relevant. Advances in the field of immu- 
nology (Chapter 34) have enabled the use of antibodies as critical reagents 
for exploring the functions of proteins within the cell. The exquisite speci- 
ficity of antibodies for their target proteins provides a means to tag a specific 
protein so that it can be isolated, quantified, or visualized. 


Antibodies to specific proteins can be generated 


Immunological techniques begin with the generation of antibodies to a par- 
ticular protein. An antibody (also called an immunoglobulin, Ig) is itself a pro- 
tein (Figure 3.24); it is synthesized by an animal in response to the presence 
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of a foreign substance, called an antigen. Antibodies have specific and high 
affinity for the antigens that elicited their synthesis. The binding of antibody 
and antigen is a step in the immune response that protects the animal from 
infection (Chapter 34). Foreign proteins, polysaccharides, and nucleic acids 
can be antigens. Small foreign molecules, such as synthetic peptides, also can 
elicit antibodies, provided that the small molecule 1s attached to a macromo- 
lecular carrier. An antibody recognizes a specific group or cluster of amino 
acids on the target molecule called an antigenic determinant or epitope. The 
specificity of the antibody—antigen interaction is a consequence of the 
shape complementarity between the two surfaces (Figure 3.25). Animals 
have a very large repertoire of antibody-producing cells, each producing an 
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hk Figure 3.24 Antibody structure. 

(A) Immunoglobulin G (IgG) consists of four 
chains, two heavy chains (blue) and two light 
chains (red), linked by disulfide bonds. The 
heavy and light chains come together to form 
Fab domains, which have the antigen-binding 
sites at the ends. The two heavy chains form 
the F, domain. Notice that the Fa) domains 
are linked to the F. domain by flexible linkers. 
(B) A more schematic representation of an 
IgG molecule. [Drawn from 1IGT.pdb.] 


hk Figure 3.25 Antigen-antibody 
interactions. A protein antigen, in this 
case lysozyme, binds to the end of an Fab 
domain of an antibody. Notice that the 
end of the antibody and the antigen have 
complementary shapes, allowing a large 
amount of surface to be buried on binding. 
[Drawn from 3HFL.pdb.] 
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antibody that contains a unique surface for antigen recognition. When an 
antigen is introduced into an animal, it is recognized by a select few cells from 
this population, stimulating the proliferation of these cells. This process 
ensures that more antibodies of the appropriate specificity are produced. 

Immunological techniques depend on the ability to generate antibodies 
to a specific antigen. To obtain antibodies that recognize a particular pro- 
tein, a biochemist injects the protein into a rabbit twice, 3 weeks apart. The 
injected protein acts as an antigen, stimulating the reproduction of cells 
producing antibodies that recognize it. Blood is drawn from the immunized 
rabbit several weeks later and centrifuged to separate blood cells from the 
supernatant, or serum. The serum, called an antiserum, contains antibodies 
to all antigens to which the rabbit has been exposed. Only some of them will 
be antibodies to the injected protein. Moreover, antibodies that recognize 
a particular antigen are not a single molecular species. For instance, 
2,4-dinitrophenol (DNP) was used as an antigen to generate antibodies. 
Analyses of anti- DNP antibodies revealed a wide range of binding affini- 
ties; the dissociation constants ranged from about 0.1 nM to 1 pM. 
Correspondingly, a large number of bands were evident when anti- DNP 
antibody was subjected to isoelectric focusing. These results indicate that 
cells are producing many different antibodies, each recognizing a different 
surface feature of the same antigen. These antibodies are termed polyclonal, 
referring to the fact that they are derived from multiple antibody-producing 
cell populations (Figure 3.26). The heterogeneity of polyclonal antibodies 
can be advantageous for certain applications, such as the detection of a pro- 
tein of low abundance, because each protein molecule can be bound by more 
than one antibody at multiple distinct antigenic sites. 
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Figure 3.26 Polyclonal and monoclonal 
antibodies. Most antigens have several 
epitopes. Polyclonal antibodies are 
heterogeneous mixtures of antibodies, each 
specific for one of the various epitopes on an 
antigen. Monoclonal antibodies are all 
identical, produced by clones of a single 
antibody-producing cell. They recognize one 
specific epitope. [After R. A. Goldsby, T. J. 
Kindt, and B. A. Osborne, Kuby Immunology, 
Ath ed. (W. H. Freeman and Company, 
2000), p. 1541] 


Monoclonal antibodies 


Monoclonal antibodies with virtually any desired specificity can be 
readily prepared 


The discovery of a means of producing monoclonal antibodies of virtually 
any desired specificity was a major breakthrough that intensified the power 
of immunological approaches. As with impure proteins, working with an 


impure mixture of antibodies makes it difficult to interpret data. The ideal 
would be to isolate a clone of cells producing a single, identical antibody. 
The problem is that antibody-producing cells isolated from an organism 
have short life spans. 

Immortal cell lines that produce monoclonal antibodies do exist. These 
cell lines are derived from a type of cancer, multiple myeloma, which is 
a malignant disorder of antibody-producing cells. In this cancer, a single 
transformed plasma cell divides uncontrollably, generating a very large 
number of cells of a single kind. Such a group of cells is a clone because 
the cells are descended from the same cell and have identical properties. 
The identical cells of the myeloma secrete large amounts of immunoglobulin 
of a single kind generation after generation. These antibodies were useful for 
elucidating antibody structure, but nothing is known about their specificity 
and so they are useless for the immunological methods described in the 
next pages. 

César Milstein and Georges Kohler discovered that large amounts of 
antibodies of nearly any desired specificity can be obtained by fusing a 
short-lived antibody-producing cell with an immortal myeloma cell. An 
antigen is injected into a mouse, and its spleen is removed several weeks 
later (Figure 3.27). A mixture of plasma cells from this spleen is fused in 
vitro with myeloma cells. Each of the resulting hybrid cells, called hybri- 
doma cells, indefinitely produces the identical antibody specified by the 
parent cell from the spleen. Hybridoma cells can then be screened by a spe- 
cific assay for the antigen—antibody interaction to determine which ones 
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Figure 3.27 Preparation of monoclonal 
antibodies. Hybridoma cells are formed by 
the fusion of antibody-producing cells and 
myeloma cells. The hybrid cells are allowed to 
proliferate by growing them in selective 
medium. They are then screened to 
determine which ones produce antibody of 
the desired specificity. [After C. Milstein. 
Monoclonal antibodies. Copyright © 1980 by 
Scientific American, Inc. All rights reserved.] 
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Figure 3.28 Fluorescence micrograph of 
a developing Drosophila embryo. The 
embryo was stained with a fluorescence- 
labeled monoclonal antibody for the DNA- 
binding protein encoded by engrailed, an 
essential gene in specifying the body plan. 
[Courtesy of Dr. Nipam Patel and Dr. Corey 
Goodman.] 


produce antibodies of the preferred specificity. Collections of cells shown to 
produce the desired antibody are subdivided and reassayed. This process is 
repeated until a pure cell line, a clone producing a single antibody, is iso- 
lated. These positive cells can be grown in culture medium or injected into 
mice to induce myelomas. Alternatively, the cells can be frozen and stored 
for long periods. 

The hybridoma method of producing monoclonal antibodies has opened 
new vistas in biology and medicine. Large amounts of identical antibodies 
with tailor-made specificities can be readily prepared. They are sources of 
insight into relations between antibody structure and specificity. Moreover, 
monoclonal antibodies can serve as precise analytical and preparative 
reagents. Proteins that guide development have been identified with the use 
of monoclonal antibodies as tags (Figure 3.28). Monoclonal antibodies 
attached to solid supports can be used as affinity columns to purify scarce 
proteins. This method has been used to purify interferon (an antiviral pro- 
tein) 5000-fold from a crude mixture. Clinical laboratories are using mono- 
clonal antibodies in many assays. For example, the detection in blood of 
isozymes that are normally localized in the heart points to a myocardial 
infarction (heart attack). Blood transfusions have been made safer by anti- 
body screening of donor blood for viruses that cause AIDS (acquired 
immune deficiency syndrome), hepatitis, and other infectious diseases. 
Monoclonal antibodies can be used as therapeutic agents. For example, 
trastuzumab (Herceptin) is a monoclonal antibody useful for treating some 
forms of breast cancer. 


Proteins can be detected and quantified by using an enzyme-linked 
immunosorbent assay 


Antibodies can be used as exquisitely specific analytic reagents to quantify 
the amount of a protein or other antigen present in a biological sample. The 
enzyme-linked immunosorbent assay (ELISA) makes use of an enzyme that 
reacts with a colorless substrate to produce a colored product. The enzyme 
is covalently linked to a specific antibody that recognizes a target antigen. If 
the antigen is present, the antibody—enzyme complex will bind to it and, on 
addition of the substrate, the enzyme will catalyze the reaction, generating 
the colored product. Thus, the presence of the colored product indicates the 
presence of the antigen. Rapid and convenient, ELISAs can detect less than 
a nanogram (10 ° g) of a specific protein. ELISA can be performed with 
either polyclonal or monoclonal antibodies, but the use of monoclonal anti- 
bodies yields more-reliable results. 


¥e We will consider two among the several types of ELISA. The indirect 

ELISA is used to detect the presence of antibody and is the basis of the 
test for HIV infection. The HIV test detects the presence of antibodies that 
recognize viral core protein antigens. Viral core proteins are adsorbed to the 
bottom of a well. Antibodies from the person being tested are then added to 
the coated well. Only someone infected with HIV will have antibodies that 
bind to the antigen. Finally, enzyme-linked antibodies to human antibodies 
(e.g., enzyme-linked goat antibodies that recognize human antibodies) are 
allowed to react in the well, and unbound antibodies are removed by 
washing. Substrate is then applied. An enzyme reaction yielding a colored 
product suggests that the enzyme-linked antibodies were bound to human 
antibodies, which in turn implies that the patient has antibodies to the viral 
antigen (Figure 3.29A). Moreover, this assay is quantitative: the rate of the 
color-formation reaction is proportional to the amount of antibody origi- 
nally present. 
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Figure 3.29 Indirect ELISA and sandwich ELISA. (A) In indirect ELISA, the production of color 
indicates the amount of an antibody to a specific antigen. (B) In sandwich ELISA, the production of 
color indicates the quantity of antigen. [After R. A. Goldsby, T. J. Kindt, and B. A. Osborne, Kuby 
Immunology, 4th ed. (W. H. Freeman and Company, 2000), p. 162.] 


The sandwich ELISA is used to detect antigen rather than antibody. 
Antibody to a particular antigen is first adsorbed to the bottom of a well. 
Next, solution containing the antigen (such as blood or urine, in medical 
diagnostic tests) is added to the well and binds to the antibody. Finally, a 
second, different antibody to the antigen is added. This antibody is enzyme 
linked and is processed as described for indirect ELISA. In this case, the 
rate of color formation is directly proportional to the amount of antigen 
present. Consequently, it permits the measurement of small quantities of 
antigen (Figure 3.29B). 


Western blotting permits the detection of proteins separated by 
gel electrophoresis 


Very small quantities of a protein of interest in a cell or in body fluid can be 
detected by an immunoassay technique called western blotting (Figure 3.30). 
A sample is subjected to electrophoresis on an SDS—polyacrylamide gel. A 
polymer sheet is pressed against the gel, transferring the resolved proteins on 
the gel to the sheet, which makes the proteins more accessible for reaction. 
An antibody that is specific for the protein of interest is added to the sheet 
and reacts with the antigen. The antibody—antigen complex on the sheet can 
then be detected by rinsing the sheet with a second antibody specific for the 
first (e.g., goat antibody that recognizes mouse antibody). A radioactive or 
fluorescent label on the second antibody enables the identification and quan- 
titation of the protein of interest. Alternatively, an enzyme on the second 
antibody generates a colored product, as in the ELISA method. Western 
blotting makes it possible to find a protein in a complex mixture, the prover- 
bial needle in a haystack. It is the basis for the test for infection by hepatitis 
C, where it is used to detect a core protein of the virus. This technique is also 
very useful in monitoring protein purification and in the cloning of genes. 
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Figure 3.30 Western blotting. Proteins on 
an SDS—polyacrylamide gel are transferred to 
a polymer sheet and stained with radioactive 
antibody. A band corresponding to the protein 
to which the antibody binds appears in the 
autoradiogram. 


Figure 3.31 Actin filaments. Fluorescence 

micrograph of actin filaments in a cell stained 
with an antibody specific to actin. [Courtesy of 
Dr. Elias Lazarides.] 
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Fluorescent markers make the visualization of proteins in the 
cell possible 


Biochemistry is often performed in test tubes or polyacrylamide gels. 
However, most proteins function in the context of a cell. Fluorescent 
markers provide a powerful means of examining proteins in their biological 
context. Cells can be stained with fluorescence-labeled antibodies and 
examined by fluorescence microscopy to reveal the location of a protein of 
interest. For example, arrays of parallel bundles are evident in cells stained 
with antibody specific for actin, a protein that polymerizes into filaments 
(Figure 3.31). Actin filaments are constituents of the cytoskeleton, the 
internal scaffolding of cells that controls their shape and movement. By 
tracking protein location, fluorescent markers also provide clues to protein 
function. For instance, the glucocorticoid receptor protein binds to the 
steroid hormone cortisone. The receptor was linked to green fluorescent 
protein (GFP), a naturally fluorescent protein isolated from the jellyfish 
Aequorea victoria (Chapter 2). Fluorescence microscopy revealed that, in 
the absence of the hormone, the receptor is located in the cytoplasm (Figure 
3.32A). On addition of the steroid, the receptor is translocated to the 
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Figure 3.32 Nuclear localization of a steroid receptor. (A) The receptor, made visible by 
attachment of the green fluorescent protein, is located predominantly in the cytoplasm of the 
cultured cell. (B) Subsequent to the addition of corticosterone (a glucocorticoid steroid), the 
receptor moves into the nucleus. [Courtesy of Dr. William B. Pratt.] 


nucleus, where it binds to DNA (Figure 3.32B). These results suggested 
that glucocorticoid receptor protein is a transcription factor that controls 
gene expression. 

The highest resolution of fluorescence microscopy is about 0.2 pm 
(200 nm, or 2000 A), the wavelength of visible light. Finer spatial resolution 
can be achieved by electron microscopy if the antibodies are tagged with 
electron-dense markers. For example, antibodies conjugated to clusters 
of gold or to ferritin (which has an electron-dense core rich in iron) are 
highly visible under the electron microscope. Immunoelectron microscopy 
can define the position of antigens to a resolution of 10 nm (100 A) or finer 
(Figure 3.33). 


3.4 Mass Spectrometry Is a Powerful Technique for the 
Identification of Peptides and Proteins 


In many instances, the study of a particular biological process in its native 
context is advantageous. For example, if we are interested in a pathway that 
is localized to the nucleus of a cell, we might conduct studies on an isolated 
nuclear extract. In these experiments, identification of the proteins present 
in the sample is often critical. Antibody-based techniques, such as the 
ELISA method described in Section 3.3, can be very helpful toward this 
goal. However, these techniques are limited to the detection of proteins for 
which an antibody is already available. Mass spectrometry enables the 
highly precise and sensitive measurement of the atomic composition of a 
particular molecule, or analyte, without prior knowledge of its identity. 
Originally, this method was relegated to the study of the chemical composi- 
tion and molecular mass of gases or volatile liquids. However, technological 
advances in the past two decades have dramatically expanded the utility of 
mass spectrometry to the study of proteins, even those found at very low 
concentrations within highly complex mixtures, such as the contents of a 
particular cell type. 


The mass of a protein can be precisely determined by mass spectrometry 


Mass spectrometry enables the highly accurate and sensitive detection of 
the mass of an analyte. This information can be used to determine the iden- 
tity and chemical state of the molecule of interest. Mass spectrometers oper- 
ate by converting analyte molecules into gaseous, charged forms (gas-phase 
ions). Through the application of electrostatic potentials, the ratio of the 
mass of each ion to its charge (the mass-to-charge ratio, or m/z) can be mea- 
sured. Although a wide variety of techniques employed by mass spectrom- 
eters are used in current practice, each of them comprises three essential 
components: the ion source, the mass analyzer, and the detector. Let us 
consider the first two in greater detail, because improvements in them 
have contributed most significantly to the analysis of biological samples. 
The ion source achieves the first critical step in mass spectrometric 
analysis: conversion of the analyte into gas-phase ions (ionization). Until 
recently, proteins could not be ionized efficiently because of their high 
molecular weights and low volatility. However, the development of tech- 
niques such as matrix-assisted laser desorption/ionization (MALDI) and 
electrospray ionization (ESI) has enabled the clearing of this significant 
hurdle. In MALDI, the analyte is evaporated to dryness in the presence of 
a volatile, aromatic compound (the matrix) that can absorb light at specific 
wavelengths. A laser pulse tuned to one of these wavelengths excites and 
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Figure 3.33 Immunoelectron microscopy. 
The opaque particles (150-A, or 15-nm, 
diameter) in this electron micrograph are 
clusters of gold atoms bound to antibody 
molecules. These membrane vesicles from 
the synapses of neurons contain a channel 
protein that is recognized by the specific 
antibody. [Courtesy of Dr. Peter Sargent] 
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Protein 
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ionized 


vaporizes the matrix, converting some of the analyte into the gas phase. 
Subsequent gaseous collisions enable the intermolecular transfer of charge, 
ionizing the analyte. In ESI, a solution of the analyte is passed through an 
electrically charged nozzle. Droplets of the analyte, now charged, emerge 
from the nozzle into a chamber of very low pressure, evaporating the solvent 
and ultimately yielding the ionized analyte. 

The newly formed analyte ions then enter the mass analyzer, where they 
are distinguished on the basis of their mass-to-charge ratios. There are a 
number of different types of mass analyzers. For this discussion, we will 
consider one of the simplest, the time-of-flight (TOF) mass analyzer, in 
which ions are accelerated through an elongated chamber under a fixed 
electrostatic potential. Given two ions of identical net charge, the smaller 
ion will require less time to traverse the chamber than will the larger ion. 
The mass of each ion can be determined by measuring the time required for 
each ion to pass through the chamber. 

The sequential action of the ion source and the mass analyzer enables the 
highly sensitive measurement of the mass of potentially massive ions, such 
as those of proteins. Consider an example of a MALDI ion source coupled 
toa TOF mass analyzer: the MALDI-TOF mass spectrometer (Figure 3.34). 
Gas-phase ions generated by the MALDI ion source pass directly into the 
TOF analyzer, where the mass-to-charge ratios are recorded. In Figure 3.35, 
the MALDI-TOF mass spectrum of a mixture of 5 pmol each of insulin 
and lactoglobulin is shown. The masses determined by MALDI-TOF are 
5733.9 and 18,364, respectively. A comparison with the calculated values of 
5733.5 and 18,388 reveals that MALDI-TOF is clearly an accurate means 
of determining protein mass. 
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Figure 3.34 MALDI-TOF mass spectrometry. (1) The protein sample, embedded in an 
appropriate matrix, is ionized by the application of a laser beam. (2) An electric field accelerates the 
ions through the flight tube toward the detector. (3) The lightest ions arrive first. (4) The ionizing 
laser pulse also triggers a clock that measures the time of flight (TOF) for the ions. [After J. T. 
Watson, Introduction to Mass Spectrometry, 3d ed. (Lippincott-Raven, 1997), p. 279.] 
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Figure 3.35 MALDI-TOF mass spectrum of insulin and B-lactoglobulin. A mixture of 5 pmol 
each of insulin (1) and B-lactoglobulin (L) was ionized by MALDI, which produces predominately 
singly charged molecular ions from peptides and proteins—the insulin ion (I + H)* and the 
lactoglobulin ion (L + H)*. Molecules with multiple charges, such as those for B-lactoglobulin 
indicated by the blue arrows, as well as small quantities of a singly charged dimer of insulin 

(21 + H)* also are produced. [After J. T. Watson, Introduction to Mass Spectrometry, 3d ed. 
(Lippincott-Raven, 1997), p. 282.] 


In the ionization process, a family of ions, each of the same mass but car- 
rying different total net charges, is formed from a single analyte. Because 
the mass spectrometer detects ions on the basis of their mass-to-charge 
ratio, these ions will appear as separate peaks in the mass spectrum. For 
example, in the mass spectrum of B-lactoglobulin shown in Figure 3.35, 
peaks near m/z = 18,388 (corresponding to the +1 charged ion) and m/z = 
9,194 (corresponding to the +2 charged ion) are visible (indicated by the 
blue arrows). Although multiple peaks for the same ion may appear to be a 
nuisance, they enable the spectrometrist to measure the mass of an analyte 
ion more than once in a single experiment, improving the overall precision 
of the calculated result. 


Peptides can be sequenced by mass spectrometry 


Earlier in this chapter, the Edman degradation was presented as a method 
for identifying the sequence of a peptide. Mass spectrometry of peptide 
fragments is an alternative to Edman degradation as a means of sequencing 
proteins. Ions of proteins that have been analyzed by a mass spectrometer, 
the precursor ions, can be broken into smaller peptide chains by bombard- 
ment with atoms of an inert gas such as helium or argon. These new frag- 
ments, or product ions, can be passed through a second mass analyzer for 
further mass characterization. The utilization of two mass analyzers 
arranged in this manner is referred to as tandem mass spectrometry. 
Importantly, the product-ion fragments are formed in chemically 
predictable ways that can provide clues to the amino acid sequence of the 
precursor ion. For peptide analytes, product ions can be formed such that indi- 
vidual amino acid residues are cleaved from the precursor ion (Figure 3.36A). 
Hence, a family of ions is detected; each ion represents a fragment of the 
original peptide with one or more amino acids removed from one end. 
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Figure 3.36 Peptide sequencing by tandem mass spectrometry. (A) Within the mass 
spectrometer, peptides can be fragmented by bombardment with inert gaseous ions to generate a 
family of product ions in which individual amino acids have been removed from one end. As drawn 
here, the carboxyl fragment of the cleaved peptide bond is ionized. (B) The product ions are 
detected in the second mass analyzer. The mass differences between the peaks indicate the 
sequence of amino acids in the precursor ion. [After H. Steen and M. Mann. Nat. Rev. Mol. Cell Biol. 
5:699-711, 2004.] 


Figure 3.36B depicts a representative mass spectrum from a fragmented 
peptide. The mass differences between the product ions indicate the amino 
acid sequence of the precursor peptide ion. 


Individual proteins can be identified by mass spectrometry 


The combination of the mass spectrometry with the chromatographic and 
peptide-cleavage techniques described earlier in this chapter enables highly 
sensitive protein identification in complex biological mixtures. When a 
protein is cleaved by chemical or enzymatic methods (see Table 3.3), a specific 
and predictable family of peptide fragments is formed. We learned in 
Chapter 2 that each protein has a unique, precisely defined amino acid 
sequence. Hence, the identity of the individual peptides formed from this 
cleavage reaction—and, importantly, their corresponding masses—is a dis- 
tinctive signature for that particular protein. Protein cleavage, followed by 
chromatographic separation and mass spectrometry, enables rapid identifi- 
cation and quantitation of these signatures, even if they are present at very 
low concentrations. 

As an example of the power of this proteomic approach, consider the 
analysis of the nuclear-pore complex from yeast, which facilitates the transport 
of large molecules into and out of the nucleus. This huge macromolecular 


complex was purified from yeast cells by careful procedures. The purified 
complex was fractionated by HPLC followed by gel electrophoresis. Individual 
bands from the gel were isolated, cleaved with trypsin, and analyzed by 
MALDI-TOF mass spectrometry. The fragments produced were compared 
with amino acid sequences deduced from the DNA sequence of the yeast 
genome as shown in Figure 3.37. A total of 174 nuclear-pore proteins were 
identified in this manner. Many of these proteins had not previously been 
identified as being associated with the nuclear pore despite years of study. 
Furthermore, mass spectrometric methods are sensitive enough to detect 
essentially all components of the pore if they are present in the samples used. 
Thus, a complete list of the components constituting this macromolecular 
complex could be obtained in a straightforward manner. Proteomic analysis 
of this type is growing in power as mass spectrometric and biochemical frac- 
tionation methods are refined. 
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3.5 Peptides Can Be Synthesized by Automated 
Solid-Phase Methods 


Peptides of defined sequence can be synthesized to assist in biochemical 
analysis. These peptides are valuable tools for several purposes. 


1. Synthetic peptides can serve as antigens to stimulate the formation of spe- 
cific antibodies. Suppose we want to isolate the protein expressed by a spe- 
cific gene. Peptides can be synthesized that match the translation of part of 
the gene’s nucleic acid sequence, and antibodies can be generated that tar- 
get these peptides. These antibodies can then be used to isolate the intact 
protein or localize it within the cell. 


2. Synthetic peptides can be used to isolate receptors for many hormones and 
other signal molecules. For example, white blood cells are attracted to bacte- 
ria by formylmethiony]l (f Met) peptides released in the breakdown of bacte- 
rial proteins. Synthetic formylmethiony! peptides have been useful in iden- 
tifying the white blood cell’s cell-surface receptor for this class of peptide. 
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Figure 3.37 Proteomic analysis by mass 
spectrometry. This mass spectrum was 
obtained by analyzing a trypsin-treated band in 
a gel derived from a yeast nuclear-pore sample. 
Many of the peaks were found to match the 
masses predicted for peptide fragments from 
three proteins (Nup120p, Kap122p, and 
Kap120p) within the yeast genome. The band 
corresponded to an apparent molecular mass of 
100 kd. [From M. P. Rout, J. D. Aitchison, 

A. Suprapto, K. Hjertaas, Y. Zhao, and B. T. Chait. 
J. Cell Biol. 148:635-651, 2000.] 
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Figure 3.38 Vasopressin and a synthetic 
vasopressin analog. Structural formulas of 
(A) vasopressin, a peptide hormone that 
stimulates water resorption, and 

(B) 1-desamino-8-p-arginine vasopressin, 

a more stable synthetic analog of this 
antidiuretic hormone. 
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Moreover, synthetic peptides can be attached to agarose beads to prepare 
affinity chromatography columns for the purification of receptor proteins 
that specifically recognize the peptides. 


e 3. Synthetic peptides can serve as drugs. Vasopressin is a peptide hor- 

mone that stimulates the reabsorption of water in the distal tubules of 
the kidney, leading to the formation of more-concentrated urine. Patients 
with diabetes insipidus are deficient in vasopressin (also called antidiuretic 
hormone), and so they excrete large volumes of dilute urine (more than 
5 liters per day) and are continually thirsty. This defect can be treated by 
administering 1-desamino-8-D-arginine vasopressin, a synthetic analog of 
the missing hormone (Figure 3.38). This synthetic peptide is degraded 
in vivo much more slowly than vasopressin and does not increase blood 
pressure. 
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4. Finally, studying synthetic peptides can help define the rules governing the 
three-dimensional structure of proteins. We can ask whether a particular 
sequence by itself tends to fold into an a helix, a B strand, or a hairpin turn 
or behaves as a random coil. The peptides created for such studies can incor- 
porate amino acids not normally found in proteins, allowing more variation 
in chemical structure than is possible with the use of only 20 amino acids. 


How are these peptides constructed? The amino group of one amino acid 
is linked to the carboxyl group of another. However, a unique product is 
formed only if a single amino group and a single carboxyl group are avail- 
able for reaction. Therefore, it is necessary to block some groups and to 
activate others to prevent unwanted reactions. First, the carboxyl-terminal 
amino acid is attached to an insoluble resin by its carboxyl group, effectively 
protecting it from further peptide-bond-forming reactions (Figure 3.39). 
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(2) deprotection of the amino terminus, and 
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ee > Sn call amino acid. Steps 2 and 3 are repeated for 

2H yy H = each added amino acid. Finally, in step 4, the 


completed peptide is released from the resin. 


The a-amino group of this amino acid is blocked with a protecting group 
such as a tert-butyloxycarbonyl (t-Boc) group. The t-Boc protecting group 
of this amino acid is then removed with trifluoroacetic acid. 

The next amino acid (in the protected t-Boc form) and dicyclohexylcar- 
bodiimide (DCC) are added together. At this stage, only the carboxyl group 
of the incoming amino acid and the amino group of the resin-bound amino 
acid are free to form a peptide bond. DCC reacts with the carboxyl group of 
the incoming amino acid, activating it for the peptide-bond-forming reac- 
tion. After the peptide bond has formed, excess reagents and dicyclohexyl- 
urea are washed away, leaving the desired dipeptide product attached to 
the beads. Additional amino acids are linked by the same sequence of reac- 
tions. At the end of the synthesis, the peptide is released from the beads by 
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Figure 3.40 An x-ray crystallographic 
experiment. An x-ray source generates a 
beam, which is diffracted by a crystal. The 


resulting diffraction pattern is collected on a 
detector. 


the addition of hydrofluoric acid (HF), which cleaves the carboxyl ester 
anchor without disrupting peptide bonds. Protecting groups on potentially 
reactive side chains, such as that of lysine, also are removed at this time. 

A major advantage of this solid-phase method, first developed by 
R. Bruce Merrifield, is that the desired product at each stage is bound to 
beads that can be rapidly filtered and washed, and so there is no need to 
purify intermediates. All reactions are carried out in a single vessel, elimi- 
nating losses caused by repeated transfers of products. This cycle of 
reactions can be readily automated, which makes it feasible to routinely 
synthesize peptides containing about 50 residues in good yield and purity. 
In fact, the solid-phase method has been used to synthesize interferons 
(155 residues) that have antiviral activity and ribonuclease (124 residues) 
that is catalytically active. The protecting groups and cleavage agents may 
be varied for increased flexibility or convenience. 

Synthetic peptides can be linked to create even longer molecules. With 
the use of specially developed peptide-ligation methods, proteins of 100 
amino acids or more can by synthesized in very pure form. These methods 
enable the construction of even sharper tools for examining protein struc- 
ture and function. 


3.6 Three-Dimensional Protein Structure Can Be Determined 
by X-ray Crystallography and NMR Spectroscopy 


Elucidation of the three-dimensional structure of a protein is often the 
source of a tremendous amount of insight into its corresponding function, 
inasmuch as the specificity of active sites and binding sites is defined by the 
precise atomic arrangement within these regions. For example, knowledge 
of the structure of a protein enables the biochemist to predict its mechanism 
of action, the effects of mutations on its function, and the desired features of 
drugs that may inhibit or augment its activity. X-ray crystallography and 
nuclear magnetic resonance spectroscopy are the two most important tech- 
niques for elucidating the conformation of proteins. 


X-ray crystallography reveals three-dimensional structure in atomic detail 


X-ray crystallography was the first method developed to determine protein 
structure in atomic detail. This technique provides the clearest visualization 
of the precise three-dimensional positions of most atoms within a protein. 
Of all forms of radiation, x-rays provide the best resolution for the determi- 
nation of molecular structures because their wavelength approximately 
corresponds to that of a covalent bond. The three components in an x-ray 
crystallographic analysis are a protein crystal, a source of x-rays, and a detec- 
tor (Figure 3.40). 

X-ray crystallography first requires the preparation of a protein or pro- 
tein complex in crystal form, in which all protein molecules are oriented in 
a fixed, repeated arrangement with respect to one another. Slowly adding 
ammonium sulfate or another salt to a concentrated solution of protein to 
reduce its solubility favors the formation of highly ordered crystals—the 
process of salting out discussed on page 68. For example, myoglobin crys- 
tallizes in 3 M ammonium sulfate. Protein crystallization can be quite chal- 
lenging: a concentrated solution of highly pure material is required and it 
is often difficult to predict which experimental conditions will yield the 
most-effective crystals. Methods for screening many different crystalliza- 
tion conditions using a small amount of protein sample have been devel- 
oped. Typically, hundreds of conditions must be tested to obtain crystals 
fully suitable for crystallographic studies. Nevertheless, increasingly large 


and complex proteins have been crystallized. For example, poliovirus, an 
8500-kd assembly of 240 protein subunits surrounding an RNA core, has 
been crystallized and its structure solved by x-ray methods. Crucially, 
proteins frequently crystallize in their biologically active configuration. 
Enzyme crystals may display catalytic activity if the crystals are suffused 
with substrate. 

After a suitably pure crystal of protein has been obtained, a source of 
x-rays is required. A beam of x-rays of wavelength 1.54 A is produced by 
accelerating electrons against a copper target. Equipment suitable for generat- 
ing x-rays in this manner is available in many laboratories. Alternatively, 
x-rays can be produced by synchrotron radiation, the acceleration of electrons 
in circular orbits at speeds close to the speed of light. Synchrotron-generated 
x-ray beams are much more intense than those generated by electrons hitting 
copper. Several facilities throughout the world generate synchrotron radia- 
tion, such as the Advanced Light Source at Argonne National Laboratory 
outside Chicago and the Photon Factory in Tsukuba City, Japan. 

When a narrow beam of x-rays is directed at the protein crystal, most of 
the beam passes directly through the crystal while a small part is scattered 
in various directions. These scattered, or diffracted, x-rays can be detected 
by x-ray film or by a solid-state electronic detector. The scattering pattern 
provides abundant information about protein structure. The basic physical 
principles underlying the technique are: 


1. Electrons scatter x-rays. The amplitude of the wave scattered by an atom 
is proportional to its number of electrons. Thus, a carbon atom scatters six 
times as strongly as a hydrogen atom does. 


2. The scattered waves recombine. Each diffracted beam comprises waves 
scattered by each atom in the crystal. The scattered waves reinforce one 
another at the film or detector if they are in phase (in step) there, and they 
cancel one another if they are out of phase. 


3. The way in which the scattered waves recombine depends only on the atomic 
arrangement. 


The protein crystal is mounted and positioned in a precise orientation 
with respect to the x-ray beam and the film. The crystal is rotated so that the 
beam can strike the crystal from many directions. This rotational motion 
results in an x-ray photograph consisting of a regular array of spots called 
reflections. The x-ray photograph shown in Figure 3.41 is a two-dimensional 
section through a three-dimensional array of 25,000 reflections. The inten- 
sities and positions of these reflections are the basic experimental data of 
an x-ray crystallographic analysis. Each reflection is formed from a wave 
with an amplitude proportional to the square root of the observed intensity 
of the spot. Each wave also has a phase—that is, the timing of its crests and 
troughs relative to those of other waves. Additional experiments or calcula- 
tions must be performed to determine the phases corresponding to each 
reflection. 

The next step is to reconstruct an image of the protein from the observed 
reflections. In light microscopy or electron microscopy, the diffracted 
beams are focused by lenses to directly form an image. However, appropri- 
ate lenses for focusing x-rays do not exist. Instead, the image is formed by 
applying a mathematical relation called a Fourier transform to the measured 
amplitudes and calculated phases of every observed reflection. The image 
obtained is referred to as the electron-density map. It is a three-dimensional 
graphic representation of where the electrons are most densely localized and 
is used to determine the positions of the atoms in the crystallized molecule 
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Figure 3.41 An x-ray diffraction pattern. 
X-ray precession photograph from a crystal of 
myoglobin. [Mel Pollinger/Fran Heyl Associates.] 
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Figure 3.42 Interpretation of an 
electron-density map. (A) A segment of 
an electron-density map is drawn as a three- 
dimensional contour plot, in which the 
regions inside the “cage” represent the 
regions of highest electron density. 

(B) A model of the protein is built into this 
map so as to maximize the placement of 
atoms within this density. [Drawn from 
1FCH.pdb] 


Figure 3.43 Resolution affects the 
quality of an image. The effect of resolution 
on the quality of a reconstructed image is 
shown by an optical analog of x-ray diffraction: 
(A) a photograph of the Parthenon; (B) an 
optical diffraction pattern of the Parthenon; (C 
and D) images reconstructed from the pattern 
in part B. More data were used to obtain 
image D than image C, which accounts for 
the higher quality of image D. [Courtesy of 
Dr. Thomas Steitz (part A) and Dr. David 
DeRosier (part B).] 


(Figure 3.42). Critical to the interpretation of the map is its resolution, 
which is determined by the number of scattered intensities used in the 
Fourier transform. The fidelity of the image depends on this resolution, as 
shown by the optical analogy in Figure 3.43. A resolution of 6 A reveals 
the course of the polypeptide chain but few other structural details. The 
reason is that polypeptide chains pack together so that their centers are 
between 5 A and 10 A apart. Maps at higher resolution are needed to delin- 
eate groups of atoms, which lie between 2.8 A and 4.0 A apart, and indi- 
vidual atoms, which are between 1.0 A and 1.5 A apart. The ultimate reso- 
lution of an x-ray analysis is determined by the degree of perfection of the 
crystal. For proteins, this limiting resolution is often about 2 A. 


(B) 


Nuclear magnetic resonance spectroscopy can reveal 
the structures of proteins in solution 


X-ray crystallography is the most powerful method for determining protein 
structures. However, some proteins do not readily crystallize. Furthermore, 
although structures present in crystallized proteins very closely represent 
those of proteins free of the constraints imposed by the crystalline environ- 
ment, structures in solution can be sources of additional insights. Nuclear 
magnetic resonance (NMR) spectroscopy is unique in being able to reveal the 
atomic structure of macromolecules in solution, provided that highly con- 
centrated solutions (~1 mM, or 15 mg ml ! for a 15-kd protein) can be 
obtained. This technique depends on the fact that certain atomic nuclei are 
intrinsically magnetic. Only a limited number of isotopes display this prop- 
erty, called spin, and those most important to biochemistry are listed in 
Table 3.4. The simplest example is the hydrogen nucleus ('H), which is a 
proton. The spinning of a proton generates a magnetic moment. This 
moment can take either of two orientations, or spin states (called a and 8), 
when an external magnetic field is applied (Figure 3.44). The energy differ- 
ence between these states is proportional to the strength of the imposed 
magnetic field. The a@ state has a slightly lower energy because it is aligned 
with this applied field. Hence, in a given population of nuclei, slightly more 
will occupy the a state (by a factor of the order of 1.00001 in a typical 
experiment). A spinning proton in an a state can be raised to an excited state 
(B state) by applying a pulse of electromagnetic radiation (a radio-frequency, 
or RF, pulse), provided that the frequency corresponds to the energy differ- 
ence between the a and the 8 states. In these circumstances, the spin will 
change from a to B; in other words, resonance will be obtained. 

These properties can be used to examine the chemical surroundings of 
the hydrogen nucleus. The flow of electrons around a magnetic nucleus 
generates a small local magnetic field that opposes the applied field. The 
degree of such shielding depends on the surrounding electron density. 
Consequently, nuclei in different environments will change states, or reso- 
nate, at slightly different field strengths or radiation frequencies. A reso- 
nance spectrum for a molecule is obtained by keeping the magnetic field 
constant and varying the frequency of the electromagnetic radiation. The 
nuclei of the perturbed sample absorb electromagnetic radiation at a fre- 
quency that can be measured. The different frequencies, termed chemical 
shifts, are expressed in fractional units 6 (parts per million, or ppm) relative 
to the shifts of a standard compound, such as a water-soluble derivative of 
tetramethylsilane, that is added with the sample. For example, a —CH3 
proton typically exhibits a chemical shift (6) of 1 ppm, compared with a 
chemical shift of 7 ppm for an aromatic proton. The chemical shifts of most 
protons in protein molecules fall between 0 and 9 ppm (Figure 3.45). Most 
protons in many proteins can be resolved by using this technique of one- 
dimensional NMR. With this information, we can then deduce changes to a 
particular chemical group under different conditions, such as the conforma- 
tional change of a protein from a disordered structure to an a helix in 
response to a change in pH. 

We can garner even more information by examining how the spins on 
different protons affect their neighbors. By inducing a transient magnetiza- 
tion in a sample through the application of a radio-frequency pulse, we can 
alter the spin on one nucleus and examine the effect on the spin of a neigh- 
boring nucleus. Especially revealing is a two-dimensional spectrum obtained 
by nuclear Overhauser enhancement spectroscopy (NOESY), which graphi- 
cally displays pairs of protons that are in close proximity, even if they are not 
close together in the primary structure. The basis for this technique is the 


Table 3.4 Biologically important 


nuclei giving NMR signals 


Natural abundance 


Nucleus (% by weight of the element) 
ty 99.984 
2H 0.016 

BC 1.108 
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aN 0.365 
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Figure 3.45 One-dimensional NMR spectra. (A) 'H-NMR spectrum of ethanol (CH3CH2OH) 
shows that the chemical shifts for the hydrogen are clearly resolved. (B) 'H-NMR spectrum of a 55 
amino acid fragment of a protein having a role in RNA splicing shows a greater degree of 
complexity. A large number of peaks are present and many overlap. [(A) After C. Branden and 

J. Tooze, Introduction to Protein Structure (Garland, 1991), p. 280; (B) courtesy of Dr. Barbara 
Amann and Dr. Wesley McDermott] 


nuclear Overhauser effect (NOE), an interaction between nuclei that is pro- 
portional to the inverse sixth power of the distance between them. 
Magnetization is transferred from an excited nucleus to an unexcited one if 
the two nuclei are less than about 5 A apart (Figure 3.46A). In other words, 
the effect provides a means of detecting the location of atoms relative to one 
another in the three-dimensional structure of the protein. The peaks that lie 
along the diagonal of a NOESY spectrum (shown in white in Figure 3.46B) 
correspond to those present in a one-dimensional NMR experiment. The 
peaks apart from the diagonal (shown in red in Figure 3.46B), referred to as 
off-diagonal peaks or cross-peaks, provide crucial new information: they 
identify pairs of protons that are less than 5 A apart. A two-dimensional 
NOESY spectrum for a protein comprising 55 amino acids is shown in 
Figure 3.47. The large number of off-diagonal peaks reveals short proton— 
proton distances. The three-dimensional structure of a protein can be 
reconstructed with the use of such proximity relations. Structures are 
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Figure 3.46 The nuclear Overhauser effect. The nuclear Overhauser effect (NOE) identifies 
pairs of protons that are in close proximity. (A) Schematic representation of a polypeptide chain 
highlighting five particular protons. Protons 2 and 5 are in close proximity (~4 A apart), whereas 
other pairs are farther apart. (B) A highly simplified NOESY spectrum. The diagonal shows five 
peaks corresponding to the five protons in part A. The peak above the diagonal and the 
symmetrically related one below reveal that proton 2 is close to proton 5. 
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calculated such that protons that must be separated by less than 5 A on the 
basis of NOESY spectra are close to one another in the three-dimensional 
structure (Figure 3.48). If a sufficient number of distance constraints are 
applied, the three-dimensional structure can nearly be determined uniquely. 
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Figure 3.47 Detecting short proton- 
proton distances. A NOESY spectrum for a 
55 amino acid domain from a protein having 
a role in RNA splicing. Each off-diagonal peak 
corresponds to a short proton—proton 
separation. This spectrum reveals hundreds of 
such short proton—proton distances, which 
can be used to determine the three- 
dimensional structure of this domain. 
[Courtesy of Dr. Barbara Amann and 

Dr. Wesley McDermott] 


(B) 


Calculated structure 


Figure 3.48 Structures calculated on the 
basis of NMR constraints. (A) NOESY 
observations show that protons (connected 
by dotted red lines) are close to one another 
in space. (B) A three-dimensional structure 
calculated with these proton pairs constrained 
to be close together. 


In practice, a family of related structures is generated by NMR spectros- 
copy for three reasons (Figure 3.49). First, not enough constraints may be 


experimentally accessible to fully 
specify the structure. Second, the 
distances obtained from analysis of 
the NOESY spectrum are only 
approximate. Finally, the experi- 
mental observations are made not 
on single molecules but on a large 
number of molecules in solution that 
may have slightly different struc- 
tures at any given moment. Thus, 
the family of structures generated 
from NMR structure analysis indi- 
cates the range of conformations for 
the protein in solution. At present, 
NMR spectroscopy can determine 


Figure 3.49 A family of structures. A set 
of 25 structures for a 28 amino acid domain 
from a zinc-finger-DNA-binding protein. The 
red line traces the average course of the 
protein backbone. Each of these structures is 
consistent with hundreds of constraints 
derived from NMR experiments. The 
differences between the individual structures 
are due to a combination of imperfections in 
the experimental data and the dynamic nature 
of proteins in solution. [Courtesy of 

Dr. Barbara Amann.] 
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the structures of only relatively small proteins (40 kd), but its resolving 
power is certain to increase. The power of NMR has been greatly enhanced by 
the ability of recombinant DNA technology to produce proteins labeled uni- 
formly or at specific sites with °C, '°N, and 7H (Chapter 5). 

The structures of nearly 60,000 proteins had been elucidated by x-ray 
crystallography and NMR spectroscopy by the end of 2009, and several 
new structures are now determined each day. The coordinates are collected 
at the Protein Data Bank (www.pdb.org), and the structures can be accessed 
for visualization and analysis. Knowledge of the detailed molecular archi- 
tecture of proteins has been a source of insight into how proteins recognize 
and bind other molecules, how they function as enzymes, how they fold, 
and how they evolved. This extraordinarily rich harvest is continuing at a 
rapid pace and is greatly influencing the entire field of biochemistry as well 
as other biological and physical sciences. 


Summary 


The rapid progress in gene sequencing has advanced another goal of 
biochemistry—elucidation of the proteome. The proteome is the com- 
plete set of proteins expressed and includes information about how they 
are modified, how they function, and how they interact with other 
molecules. 


3.1 The Purification of Proteins Is an Essential First Step in Understanding 

Their Function 

Proteins can be separated from one another and from other molecules 
on the basis of such characteristics as solubility, size, charge, and bind- 
ing affinity. SDS—polyacrylamide gel electrophoresis separates the poly- 
peptide chains of proteins under denaturing conditions largely accord- 
ing to mass. Proteins can also be separated electrophoretically on the 
basis of net charge by isoelectric focusing in a pH gradient. 
Ultracentrifugation and gel-filtration chromatography resolve proteins 
according to size, whereas ion-exchange chromatography separates 
them mainly on the basis of net charge. The high affinity of many pro- 
teins for specific chemical groups is exploited in affinity chromatogra- 
phy, in which proteins bind to columns containing beads bearing cova- 
lently linked substrates, inhibitors, or other specifically recognized 
groups. The mass of a protein can be determined by sedimentation- 
equilibrium measurements. 


3.2 Amino Acid Sequences of Proteins Can Be Determined Experimentally 
Amino acid sequences are rich in information concerning the kinship of 
proteins, their evolutionary relationships, and diseases produced by 
mutations. Knowledge of a sequence provides valuable clues to confor- 
mation and function. The amino acid composition of a protein can be 
ascertained by hydrolyzing the protein into its constituent amino acids 
in 6M HClat 110°C. The amino acids can be separated by ion-exchange 
chromatography and quantitated by their reaction with ninhydrin or 
fluorescamine. Amino acid sequences can be determined by Edman 
degradation, which removes one amino acid at a time from the amino 
end of a peptide. Longer polypeptide chains are broken into shorter 
ones for analysis by specifically cleaving them with reagents such as 
cyanogen bromide, which splits peptide bonds on the carboxy] side of 
methionine residues, or the enzyme trypsin, which cleaves on the car- 
boxy] side of lysine and arginine residues. 


3.3 Immunology Provides Important Techniques with Which to Investigate 105 
Proteins Key Terms 
Proteins can be detected and quantitated by highly specific antibodies; 
monoclonal antibodies are especially useful because they are homoge- 
neous. Enzyme-linked immunosorbent assays and western blots of 
SDS-—polyacrylamide gels are used extensively. Proteins can also be 
localized within cells by immunofluorescence microscopy and immu- 
noelectron microscopy. 


3.4 Mass Spectrometry Is a Powerful Technique for the Identification of 

Peptides and Proteins 

Techniques such as matrix-assisted laser desorption/ionization 
(MALDD) and electrospray ionization (ESI) allow the generation of 
ions of proteins and peptides in the gas phase. The mass of such protein 
ions can be determined with great accuracy and precision. Masses 
determined by these techniques act as protein name tags because the 
mass of a protein or peptide is precisely determined by its amino acid 
composition and, hence, by its sequence. Tandem mass spectrometry 
is an alternative to Edman degradation that enables the rapid and 
highly accurate sequencing of peptides. Mass spectrometric techniques 
are central to proteomics because they make it possible to analyze the 
constituents of large macromolecular assemblies or other collections of 
proteins. 


3.5 Peptides Can Be Synthesized by Automated Solid-Phase Methods 
Polypeptide chains can be synthesized by automated solid-phase meth- 
ods in which the carboxyl end of the growing chain is linked to an 
insoluble support. The carboxyl group of the incoming amino acid is 
activated by dicyclohexylcarbodiimide and joined to the amino group of 
the growing chain. Synthetic peptides can serve as drugs and as antigens 
to stimulate the formation of specific antibodies. They can also be 
sources of insight into the relation between amino acid sequence and 
conformation. 


3.6 Three-Dimensional Protein Structure Can Be Determined by X-ray 

Crystallography and NMR Spectroscopy 

X-ray crystallography and nuclear magnetic resonance spectroscopy 
have greatly enriched our understanding of how proteins fold, recog- 
nize other molecules, and catalyze chemical reactions. X-ray crystal- 
lography is possible because electrons scatter x-rays. The diffraction 
pattern produced can be analyzed to reveal the arrangement of atoms 
in a protein. The three-dimensional structures of tens of thousands of 
proteins are now known in atomic detail. Nuclear magnetic resonance 
spectroscopy reveals the structure and dynamics of proteins in solu- 
tion. The chemical shift of nuclei depends on their local environment. 
Furthermore, the spins of neighboring nuclei interact with each other 
in ways that provide definitive structural information. This informa- 
tion can be used to determine complete three-dimensional structures 


of proteins. 
Key Terms 
proteome (p. 66) homogenate (p. 67) gel-filtration chromatography (p. 69) 
assay (p. 67) salting out (p. 68) ion-exchange chromatography (p. 70) 


specific activity (p. 67) dialysis (p. 69) cation exchange (p. 70 ) 
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anion exchange (p. 70) 
affinity chromatography (p. 70) 


high-pressure liquid chromatography 
(HPLC) (p. 71) 


gel electrophoresis (p. 71) 
isoelectric point (p. 73) 
isoelectric focusing (p. 73) 
two-dimensional electrophoresis (p. 74) 
sedimentation coefficient 
(Svedberg unit, 5) (p. 76) 
Edman degradation (p. 81) 
phenyl isothiocyanate (p. 81) 


overlap peptide (p. 82) 

antibody (p. 84) 

antigen (p. 85) 

antigenic determinant (epitope) (p. 85) 

polyclonal antibody (p. 86) 

monoclonal antibody (p. 86) 

enzyme-linked immunosorbent assay 
(ELISA) (p. 88) 

western blotting (p. 89) 

fluorescence microscopy (p. 90) 

green fluorescent protein 


(GFP) (p. 90) 


matrix-assisted laser desorption/ 
ionization (MALDI) (p. 91) 
electrospray ionization (ESI) (p. 91) 
time-of-flight (TOF) mass analyzer (p. 92) 
tandem mass spectrometry (p. 93) 
solid-phase method (p. 98) 
x-ray crystallography (p. 98) 
Fourier transform (p. 99) 
electron-density map (p. 99) 
nuclear magnetic resonance (NMR) 
spectroscopy (p. 101) 
chemical shift (p. 101) 


Problems 


1. Valuable reagents. The following reagents are often used 
in protein chemistry: 


CNBr Trypsin Ninhydrin 
Urea Performic acid Pheny] isothiocyanate 
Mercaptoethanol 6 N HCl Chymotrypsin 


Which one is the best suited for accomplishing each of the 
following tasks? 


(a) Determination of the amino acid sequence of a small 
peptide. 

(b) Reversible denaturation of a protein devoid of disulfide 
bonds. Which additional reagent would you need if disul- 
fide bonds were present? 


(c) Hydrolysis of peptide bonds on the carboxyl side of 
aromatic residues. 


(d) Cleavage of peptide bonds on the carboxyl side of 
methionines. 


(e) Hydrolysis of peptide bonds on the carboxyl side of 
lysine and arginine residues. 


2. Finding an end. Anhydrous hydrazine (H2N—NH)z) has 
been used to cleave peptide bonds in proteins. What are the 
reaction products? How might this technique be used to 
identify the carboxyl-terminal amino acid? 


3. Crafting a new breakpoint. Ethyleneimine reacts with 
cysteine side chains in proteins to form S-aminoethyl 
derivatives. The peptide bonds on the carboxyl side of 
these modified cysteine residues are susceptible to hydroly- 
sis by trypsin. Why? 


4. Spectrometry. The absorbance A of a solution is defined as 
A = logio(Io/I) 


in which Ip is the incident-light intensity and I is the 
transmitted-light intensity. The absorbance is related to 


the molar absorption coefficient (extinction coefficient) 
(in M~! cm7'), concentration c (in M), and path length 
(in cm) by 


€ 
l 


A = lc 


The absorption coefficient of myoglobin at 580 nm is 
15,000 M~! cm‘. What is the absorbance of a 1 mg ml! 
solution across a 1-cm path? What percentage of the inci- 
dent light is transmitted by this solution? 


5. It’s in the bag. Suppose that you precipitate a protein 
with 1 M (NHy4).5O4 and that you wish to reduce the 
concentration of the (NH4)25O4.You take 1 ml of your 
sample and dialyze it in 1000 ml of buffer. At the end of 
dialysis, what is the concentration of (NH4)2SO4 in your 
sample? How could you further lower the (NH4)2504 
concentration? 


6. Too much or not enough. Why do proteins precipitate at 
high salt concentrations? Although many proteins precipi- 
tate at high salt concentrations, some proteins require salt to 
dissolve in water. Explain why some proteins require salt to 
dissolve. 


7. A slow mover. Tropomyosin, a 70-kd muscle protein, 
sediments more slowly than does hemoglobin (65 kd). 
Their sedimentation coefficients are 2.65 and 4.315, respec- 
tively. Which structural feature of tropomyosin accounts 
for its slow sedimentation? 


8. Sedimenting spheres. What is the dependence of the sedi- 
mentation coefficient s of a spherical protein on its mass? 
How much more rapidly does an 80-kd protein sediment 
than does a 40-kd protein? 


9. Frequently used in shampoos. The detergent sodium 
dodecyl sulfate (SDS) denatures proteins. Suggest how SDS 
destroys protein structure. 


10. Size estimate. The relative electrophoretic mobilities of 
a 30-kd protein and a 92-kd protein used as standards on an 
SDS-polyacrylamide gel are 0.80 and 0.41, respectively. 
What is the apparent mass of a protein having a mobility of 
0.62 on this gel? 


11. Unexpected migration. Some proteins migrate anoma- 
lously in SDS-PAGE gels. For instance, the molecular weight 
determined from an SDS-PAGE gel is sometimes very dif- 
ferent from the molecular weight determined from the amino 
acid sequence. Suggest an explanation for this discrepancy. 


12. Sorting cells. Fluorescence-activated cell sorting 
(FACS) is a powerful technique for separating cells accord- 
ing to their content of particular molecules. For example, a 
fluorescence-labeled antibody specific for a cell-surface 
protein can be used to detect cells containing such a mole- 
cule. Suppose that you want to isolate cells that possess a 
receptor enabling them to detect bacterial degradation pro- 
ducts. However, you do not yet have an antibody directed 
against this receptor. Which fluorescence-labeled mole- 
cule would you prepare to identify such cells? 


13. Column choice. (a) The octapeptide AVGWRVKS was 
digested with the enzyme trypsin. Which method would be 
most appropriate for separating the products: ion-exchange 
or gel-filtration chromatography? Explain. (b) Suppose that 
the peptide was digested with chymotrypsin. What would 
be the optimal separation technique? Explain. 


14. Power(ful) tools. Monoclonal antibodies can be conju- 
gated to an insoluble support by chemical methods. Explain 
how these antibody-bound beads can be exploited for pro- 
tein purification. 


15. Assay development. You wish to isolate an enzyme from 
its native source and need a method for measuring its activ- 
ity throughout the purification. However, neither the sub- 
strate nor the product of the enzyme-catalyzed reaction can 
be detected by spectroscopy. You discover that the product 
of the reaction is highly antigenic when injected into mice. 
Propose a strategy to develop a suitable assay for this 
enzyme. 


16. Making more enzyme? In the course of purifying an 
enzyme, a researcher performs a purification step that 
results in an increase in the total activity to a value greater 
than that present in the original crude extract. Explain how 
the amount of total activity might increase. 


17. Divide and conquer. The determination of the mass of a 
protein by mass spectrometry often does not allow its 
unique identification among possible proteins within a 
complete proteome, but determination of the masses of all 
fragments produced by digestion with trypsin almost always 
allows unique identification. Explain. 
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Problems 


18. Know your limits. Which two amino acids are indistin- 
guishable in peptide sequencing by the tandem mass spec- 
trometry method described in this chapter and why? 


19. Protein purification problem. Complete the following 
table. 


Total Total Specific 
Purification protein activity activity Purification Yield 
procedure (mg) (units) (units mg~') level (%) 
Crude extract 20,0004,000,000 ] 100 
(NH4)2SO4 
precipitation 5,0003,000,000 


DEAE-cellulose 
chromatography 1,5001,000,000 


Gel-filtration 

chromatography 500 750,000 
Affinity 

chromatography 45 675,000 


20. The challenge of flexibility. Structures of proteins com- 
prising domains separated by flexible linker regions can be 
quite difficult to solve by x-ray crystallographic methods. 
Why might this be the case? What are possible experimen- 
tal approaches to circumvent this barrier? 


Chapter Integration Problems 


21. Quaternary structure. A protein was purified to homoge- 
neity. Determination of the mass by gel-filtration chroma- 
tography yields 60 kd. Chromatography in the presence of 
6 M urea yields a 30-kd species. When the chromatog- 
raphy is repeated in the presence of 6 M urea and 10 mM 
B-mercaptoethanol, a single molecular species of 15 kd 
results. Describe the structure of the molecule. 


22. Helix—coil transitions. (a) NMR measurements have 
shown that poly-L-lysine is a random coil at pH 7 but 
becomes « helical as the pH is raised above 10. Account for 
this pH-dependent conformational transition. (b) Predict 
the pH dependence of the helix—coil transition of poly-L- 
glutamate. 


23. Peptide mass determination. You have isolated a protein 
from the bacterium E. coli and seek to confirm its identity 
by trypsin digestion and mass spectrometry. Determination 
of the masses of several peptide fragments has enabled you 
to deduce the identity of the protein. However, there is a 
discrepancy with one of the peptide fragments, which 
you believe should have the sequence MLNSFK and an 
(M + H)* value of 739.38. In your experiments, you 
repeatedly obtain an (M + H)* value of 767.38. What is 
the cause of this discrepancy and what does it tell you 
about the region of the protein from which this peptide is 
derived? 
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24. Peptides on a chip. Large numbers of different peptides 
can be synthesized in a small area on a solid support. This 
high-density array can then be probed with a fluorescence- 
labeled protein to find out which peptides are recognized. 
The binding of an antibody to an array of 1024 different 
peptides occupying a total area the size of a thumbnail is 
shown in the adjoining illustration. How would you synthe- 
size such a peptide array? (Hint: Use light instead of acid 
to deprotect the terminal amino group in each round of 
synthesis. ) 


Fluorescence scan of an array of 1024 peptides in a 1.6-cm 
area. Each synthesis site is a 400-~m square. A fluorescently labeled 
monoclonal antibody was added to the array to identify peptides that are 
recognized. The height and color of each square denote the fluorescence 
intensity. [After S. P. A. Fodor et al., Science 251(1991):767.] 


25. Exchange rate. The amide hydrogen atoms of peptide 
bonds within proteins can exchange with protons in the 
solvent. In general, amide hydrogen atoms in buried regions 
of proteins and protein complexes exchange more slowly 
than those on the solvent-accessible surface do. Deter- 
mination of these rates can be used to explore the protein- 
folding reaction, probe the tertiary structure of proteins, 
and identify the regions of protein-protein interfaces. These 
exchange reactions can be followed by studying the behav- 
ior of the protein in solvent that has been labeled with 
deuterium (7H), a stable isotope of hydrogen. What two 
methods described in this chapter could be readily 
applied to the study of hydrogen—deuterium exchange rates 
in proteins? 


Data Interpretation Problems 


26. Protein sequencing 1. Determine the sequence of hexa- 
peptide on the basis of the following data. Note: When the 
sequence is not known, a comma separates the amino acids 
(see Table 3.3). 


Amino acid composition: (2R,A,8,V,Y) 
N-terminal analysis of the hexapeptide: A 
Trypsin digestion: (R,A,V) and (R,5, Y) 
Carboxypeptidase digestion: No digestion. 
Chymotrypsin digestion: (A,R,V,Y) and (R,S) 


27. Protein sequencing 2. Determine the sequence of a pep- 
tide consisting of 14 amino acids on the basis of the follow- 
ing data. 


Amino acid composition: (45,2L,F,G,I,K,M,T,W,Y) 

N-terminal analysis: S 

Carboxypeptidase digestion: L 

Trypsin digestion: (35,2L,F,I,M,T,W) (G,K,5,Y) 

Chymotrypsin digestion: (F,I,S) (G,K,L) (L,S) (M,T) 
(S,W) (S,Y) 

N-terminal analysis of (F,I,S) peptide: S 

Cyanogen bromide treatment: (25,F,G,I,K,L,M*,T,Y) 
(28,L,W) 


M*, methionine detected as homoserine 


28. Applications of two-dimensional electrophoresis. Performic 
acid cleaves the disulfide linkage of cystine and converts the 
sulfhydryl groups into cysteic acid residues, which are then 
no longer capable of disulfide-bond formation. 


Consider the following experiment: You suspect that a pro- 
tein containing three cysteine residues has a single disul- 
fide bond. You digest the protein with trypsin and subject 
the mixture to electrophoresis along one end of a sheet of 
paper. After treating the paper with performic acid, you 
subject the sheet to electrophoresis in the perpendicular 
direction and stain it with ninhydrin. How would the paper 
appear if the protein did not contain any disulfide bonds? 
If the protein contained a single disulfide bond? Propose 
an experiment to identify which cysteine residues form the 


disulfide bond. 
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DNA, RNA, and the Flow of 
Genetic Information 


NA and RNA are long linear polymers, called nucleic acids, that carry 

information in a form that can be passed from one generation to the 
next. These macromolecules consist of a large number of linked nucleotides, 
each composed of a sugar, a phosphate, and a base. Sugars linked by phos- 
phates form a common backbone that plays a structural role, whereas the 
sequence of bases along a nucleic acid chain carries genetic information. The 
DNA molecule has the form of a double helix, a helical structure consisting 
of two complementary nucleic acid strands. Each strand serves as the tem- 
plate for the other in DNA replication. The genes of all cells and many viruses 
are made of DNA. 

Genes specify the kinds of proteins that are made by cells, but DNA 
is not the direct template for protein synthesis. Rather, a DNA strand is 
copied into a class of RNA molecules called messenger RNA (mRNA), the 
information-carrying intermediates in protein synthesis. This process of 
transcription is followed by translation, the synthesis of proteins according to 
instructions given by mRNA templates. Thus, the flow of genetic informa- 
tion, or gene expression, in normal cells is 


Translation 


Protein 


Transcription 


DNA RNA 


This flow of information depends on the genetic code, which defines the 
relation between the sequence of bases in DNA (or its mRNA transcript) 
and the sequence of amino acids in a protein. The code is nearly the same in 
all organisms: a sequence of three bases, called a codon, specifies an amino 
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Having genes in common accounts for the resemblance of a mother to her daughters. Genes 
must be expressed to exert an effect, and proteins regulate such expression. One such regulatory 
protein, a zinc-finger protein (zinc ion is blue, protein is red), is shown bound to a control region 
of DNA (black). [(Left) Barnaby Hall/Photonica. (Right) Drawn from 1AAY.pdb.] 
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A Nucleic Acid Consists of Four 
Kinds of Bases Linked to a 
Sugar-Phosphate Backbone 


A Pair of Nucleic Acid Chains with 
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Most Eukaryotic Genes Are 
Mosaics of Introns and Exons 
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Base; Base;,., 


acid. There is another step in the expression of most eukaryotic genes, 
which are mosaics of nucleic acid sequences called introns and exons. Both 
are transcribed, but before translation takes place, introns are cut out of 
newly synthesized RNA molecules, leaving mature RNA molecules with 
continuous exons. The existence of introns and exons has crucial implica- 
tions for the evolution of proteins. 


4.1 A Nucleic Acid Consists of Four Kinds of Bases Linked 
to a Sugar—Phosphate Backbone 


The nucleic acids DNA and RNA are well suited to function as the carriers 
of genetic information by virtue of their covalent structures. These macro- 
molecules are linear polymers built up 
from similar units connected end to end 
(Figure 4.1). Each monomer unit within 
the polymer is a nucleotide. A single nucle- 
otide unit consists of three components: a 
sugar, a phosphate, and one of four bases. 
The sequence of bases in the polymer unique- 


Base;,> 


Phosphate Phosphate Phosphate Phosphate Phosphate ly characterizes a nucleic acid and consti- 


Figure 4.1 Polymeric structure of nucleic 
acids. 


Ribose 


H 


Pee: 


Deoxyribose 


Figure 4.2 Ribose and deoxyribose. 
Atoms in sugar units are numbered with 
primes to distinguish them from atoms in 
bases (see Figure 4.4). 


tutes a form of linear information—infor- 
mation analogous to the letters that spell 
a person’s name. 


RNA and DNA differ in the sugar component and one of the bases 


The sugar in deoxyribonucleic acid (DNA) is deoxyribose. The prefix deoxy 
indicates that the 2'-carbon atom of the sugar lacks the oxygen atom that is 
linked to the 2’-carbon atom of ribose, as shown in Figure 4.2. Note that 
sugar carbons are numbered with primes to differentiate them from atoms 
in the bases. The sugars in both nucleic acids are linked to one another by 
phosphodiester bridges. Specifically, the 3’-hydroxyl (3’-OH) group of the 
sugar moiety of one nucleotide is esterified to a phosphate group, which is, 
in turn, joined to the 5'-hydroxyl group of the adjacent sugar. The chain of 
sugars linked by phosphodiester bridges is referred to as the backbone of the 
nucleic acid (Figure 4.3). Whereas the backbone is constant in a nucleic 
acid, the bases vary from one monomer to the next. Two of the bases of 
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Figure 4.3 Backbones of DNA and RNA. The backbones of these nucleic acids are formed by 
3'-to-5' phosphodiester linkages. A sugar unit is highlighted in red and a phosphate group in blue. 


? NH, 6 
N N H N 
N76 3 7\ Na \ SN \ 

PURINES Je < | 4 Je | H Ba | H 
H” Sy N H” SN : H2N N \ 
H H H 


Purine Adenine Guanine 


pyrimidines. Atoms within bases 


; NH> 0 0 
nZa 3 nZ~ ~N ~N 
PYRIMIDINES se | | Aa | a | 4 
1 Figure 4.4 Purines an 
a Sy ae 8 No “HOT NT SH O07 NH : 
H H H 


Pyrimidine Cytosine Uracil Thymine 


DNA are derivatives of purine—adenine (A) and guanine (G)—and two of 
pyrimidine—cytosine (C) and thymine (T), as shown in Figure 4.4. 

Ribonucleic acid (RNA), like DNA, is a long unbranched polymer 
consisting of nucleotides joined by 3’-to-5’ phosphodiester linkages (see 
Figure 4.3). The covalent structure of RNA differs from that of DNA in two 
respects. First, the sugar units in RNA are riboses rather than deoxyriboses. 
Ribose contains a 2'-hydroxyl group not present in deoxyribose. Second, 
one of the four major bases in RNA is uracil (U) instead of thymine (T). 

Note that each phosphodiester bridge has a negative charge. This nega- 
tive charge repels nucleophilic species such as hydroxide ions; consequently, 
phosphodiester linkages are much less susceptible to hydrolytic attack 
than are other esters such as carboxylic acid esters. This resistance is crucial 
for maintaining the integrity of information stored in nucleic acids. The 
absence of the 2'-hydroxyl group in DNA further increases its resistance to 
hydrolysis. The greater stability of DNA probably accounts for its use 
rather than RNA as the hereditary material in all modern cells and in many 
viruses. 


Nucleotides are the monomeric units of nucleic acids 


The building blocks of nucleic acids and the precursors of these building 
blocks play many other roles throughout the cell—for instance, as energy 
currency and as molecular signals. Consequently, it is important to be 
familiar with the nomenclature of nucleotides and their precursors. A unit 
consisting of a base bonded to a sugar is referred to as a nucleoside. The four 
nucleoside units in RNA are called adenosine, guanosine, cytidine, and uri- 
dine, whereas those in DNA are called deoxyadenosine, deoxyguanosine, 
deoxycytidine, and thymidine. In each case, N-9 of a purine or N-1 of a 
pyrimidine is attached to C-1' of the sugar by an N-glycosidic linkage 
(Figure 4.5). The base lies above the plane of sugar when the structure is 
written in the standard orientation; that is, the configuration of the 
N-glycosidic linkage is B (Section 11.1). 

A nucleotide is a nucleoside joined to one or more phosphoryl groups by 
an ester linkage. Nucleotide triphosphates, nucleosides joined to three phospho- 
ryl groups, are the monomers—the building blocks—that are linked to form 
RNA and DNA. The four nucleotide units that link to form DNA are nucle- 
otide monophosphates called deoxyadenylate, deoxyguanylate, deoxycytidy- 
late, and thymidylate. Note that thymidylate contains deoxyribose; by 
convention, the prefix deoxy is not added because thymine-containing 


are numbered without primes. 
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is present in RNA instead of thymine. 
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Figure 4.5 B-Glycosidic linkage in a 
nucleoside. 
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Figure 4.7 Structure of a DNA chain. The 
chain has a 5’ end, which is usually attached 
to a phosphoryl group, and a 3’ end, which is 
usually a free hydroxyl group. 


nucleotides are only rarely found in RNA. Similarly, the most common 
nucleotides that link to form RNA are nucleotide monophosphates 
adenylate, guanylate, cytidylate and uridylate. 

Another means of denoting a nucleotide is the base name with the suffix 
ate”. This nomenclature does not describe the number of phosphoryl 
groups or the site of attachment to carbon of the ribose. A more precise 
nomenclature is also commonly used. A compound formed by the attach- 
ment of a phosphory] group to C-5’ of a nucleoside sugar (the most common 
site of phosphate esterification) is called a nucleoside 5'-phosphate or a 
5'-nucleotide. In this naming system for nucleotides, the number of phos- 
phoryl groups and the attachment site are designated. Look, for example at 
adenosine 5'-triphosphate (ATP; Figure 4.6). This nucleotide is tremen- 
dously important because, in addition to being a building block for RNA, it 
is the most commonly used energy currency. The energy released from 
cleavage of the triphosphate group is used to power many cellular processes 
(Chapter 15). Another nucleotide is deoxyguanosine 3’-monophosphate 
(3'-dGMP; see Figure 4.6). This nucleotide differs from ATP in that it 
contains guanine rather than adenine, contains deoxyribose rather than 
ribose (indicated by the prefix “d”’), contains one rather than three phos- 
phoryl groups, and has the phosphoryl group esterified to the hydroxyl 
group in the 3’ rather than the 5’ position. 
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Figure 4.6 Nucleotides adenosine 5’-triphosphate (5'-ATP) and deoxyguanosine 
3’-monophosphate (3’-dGMP). 


Scientific communication frequently requires the sequence of a nucleic 
acid—in some cases, a sequence thousands of nucleotides in length—to be 
written like that on page 17. Rather than writing the cumbersome chemical 
structures, scientists have adopted the use of abbreviations. The abbreviated 
notations pApCpG or ACG denote a trinucleotide of DNA consisting of 
the building blocks deoxyadenylate monophosphate, deoxycytidylate 
monophosphate, and deoxyguanylate monophosphate linked by a phos- 
phodiester bridge, where “p” denotes a phosphory] group (Figure 4.7). The 
5’ end will often have a phosphoryl group attached to the 5’-OH group. 
Note that, like a polypeptide (Section 2.2), a DNA chain has directionality, 
commonly called polarity. One end of the chain has a free 5’-OH group (or 
a 5'-OH group attached to a phosphoryl group) and the other end has a free 
3'-OH group, neither of which is linked to another nucleotide. By conven- 
tion, the base sequence is written in the 5'-to-3' direction. Thus, ACG indi- 
cates that the unlinked 5’-OH group is on deoxyadenylate, whereas the 
unlinked 3’-OH group is on deoxyguanylate. Because of this polarity, ACG 
and GCA correspond to different compounds. 


DNA molecules are very long 


A striking characteristic of naturally occurring DNA molecules is their 
length. A DNA molecule must comprise many nucleotides to carry the 
genetic information necessary for even the simplest organisms. For exam- 
ple, the DNA of a virus such as polyoma, which can cause cancer in certain 
organisms, consists of two intertwined strands of DNA, each 5100 nucle- 
otides in length. The E. coli genome is a single DNA molecule consisting of 
two chains of 4.6 million nucleotides each (Figure 4.8). 

The DNA molecules of higher organisms can be much larger. The 
human genome comprises approximately 3 billion nucleotides in each 
chain of DNA, divided among 24 distinct molecules of DNA called 
chromosomes (22 autosomal chromosomes plus the X and Y sex chromo- 
somes) of different sizes. One of the largest known DNA molecules is found 
in the Indian muntjac, an Asiatic deer; its genome is nearly as large as the 
human genome but is distributed on only 3 chromosomes (Figure 4.9). The 
largest of these chromosomes has two chains of more than 1 billion nucle- 
otides each. If such a DNA molecule could be fully extended, it would 
stretch more than 1 foot in length. Some plants contain even larger DNA 
molecules. 


Figure 4.9 The Indian muntjac and its chromosomes. Cells from a female Indian muntjac 
(right) contain three pairs of very large chromosomes (stained orange). The cell shown is a hybrid 
containing a pair of human chromosomes (stained green) for comparison. [(Left) M. Birkhead, 
OSF/Animals Animals. (Right) J—Y. Lee, M. Koi, E. J. Stanbridge, M. Oshimura, A. T. Kumamoto, 
and A. P. Feinberg. Nat. Genet. 7:30, 1994.] 


4.2 A Pair of Nucleic Acid Chains with Complementary 
Sequences Can Form a Double-Helical Structure 


As discussed in Chapter 1, the covalent structure of nucleic acids accounts 
for their ability to carry information in the form of a sequence of bases along 
a nucleic acid chain. The bases on the two separate nucleic acid strands form 
specific base pairs in such a way that a helical structure is formed. The 
double-helical structure of DNA facilitates the replication of the genetic 
material—that is, the generation of two copies of a nucleic acid from one. 


The double helix is stabilized by hydrogen bonds and van 
der Waals interactions 


The ability of nucleic acids to form specific base pairs was discovered in the 
course of studies directed at determining the three-dimensional structure of 
DNA. Maurice Wilkins and Rosalind Franklin obtained x-ray diffraction 
photographs of fibers of DNA (Figure 4.10). The characteristics of these 
diffraction patterns indicated that DNA is formed of two chains that wind 
ina regular helical structure. From these data and others, James Watson and 


Figure 4.8 Electron micrograph of part of 
the E. coli genome. [Dr. Gopal Murti/ 
Science Photo Library/Photo Researchers.] 


3.4-A spacing 


Figure 4.10 X-ray diffraction photograph 
of a hydrated DNA fiber. When crystals of a 
biomolecule are irradiated with x-rays, the x-rays 
are diffracted and these diffracted x-rays are 
seen as a series of spots, called reflections, on 
a screen behind the crystal. The structure of the 
molecule can be determined by the pattern of 
the reflections (Section 3.6). In regard to DNA 
crystals, the central cross is diagnostic of a 
helical structure. The strong arcs on the 
meridian arise from the stack of nucleotide 
bases, which are 3.4 A apart. [Courtesy of 

Dr. Maurice Wilkins.] 
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34A 


hk Figure 4.11 Watson- 
Crick model of double- 
helical DNA. One 
polynucleotide chain is shown 
in blue and the other in red. 


The purine and pyrimidine 
bases are shown in lighter 
colors than those of the 
sugar—phosphate backbone. 
(A) Side view. The structure 
repeats along the helical axis 
(vertical) at intervals of 34 A, 
which corresponds to 10 
nucleotides on each chain. 
(B) Axial view, looking down 
the helix axis. 


Francis Crick deduced a structural model for DNA that 
accounted for the diffraction pattern and was the source of 
some remarkable insights into the functional properties of 
nucleic acids (Figure 4.11). 

The features of the Watson—Crick model of DNA 
deduced from the diffraction patterns are: 


1. Two helical polynucleotide chains are coiled around a 
common axis with a right-handed screw sense (p. 39). The 
chains are antiparallel, meaning that they have opposite 
polarity. 


2. The sugar—phosphate backbones are on the outside 
and the purine and pyrimidine bases lie on the inside of 
the helix. 


3. The bases are nearly perpendicular to the helix axis, 
and adjacent bases are separated by 3.4 A. This spacing is 
readily apparent in the DNA diffraction pattern (see 
Figure 4.10). The helical structure repeats every 34 A, and 
so there are 10 bases (= 34 A per repeat/3.4 A per base) 
per turn of helix. Each base is rotated 36 degrees from the 
one below it. (360 degrees per full turn/10 bases per 
turn). 


4. The diameter of the helix is 20 A. 


Adenine Thymine 


Figure 4.12 Structures of the base pairs 
proposed by Watson and Crick. 
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How is such a regular structure able to accommodate an arbitrary 
sequence of bases, given the different sizes and shapes of the purines and 
pyrimidines? In attempting to answer this question, Watson and Crick dis- 
covered that guanine can be paired with cytosine and adenine with thymine 
to form base pairs that have essentially the same shape (Figure 4.12). These 
base pairs are held together by specific hydrogen bonds, which, although 
weak (4-21 kJ mol~', or 1-5 kcal mol7!), stabilize the helix because of their 
large numbers ina DNA molecule. These base-pairing rules account for the 
observation, originally made by Erwin Chargaff in 1950, that the ratios of 
adenine to thymine and of guanine to cytosine are nearly the same in all spe- 
cies studied, whereas the adenine-to-guanine ratio varies considerably 
(Table 4.1). 

Inside the helix, the bases are essentially stacked one on top of another 
(Figure 4.13). The stacking of base pairs contributes to the stability of the 
double helix in two ways. First, the double helix is stabilized by the hydro- 
phobic effect (p. 9). The hydrophobic bases cluster in the interior of the 
helix away from the surrounding water, whereas the more polar surfaces are 
exposed to water. This arrangement is reminiscent of protein folding, where 
hydrophobic amino acids are in the protein’s interior and the hydrophilic 
amino acids are on the exterior (Section 2.4). The hydrophobic effect stacks 


Table 4.1 Base compositions experimentally determined for a variety of organisms 


Organism A:T G:C A:G 
Human being 1.00 1.00 1.56 
Salmon 1.02 1.02 1.43 
Wheat 1.00 0.97 1,22 
Yeast H203 1.02 1.67 
Escherichia coli 1.09 0.99 1.05 
Serratia marcescens 0.95 0.86 0.70 


115 
4.2 The Double Helix 


Figure 4.13 Axial view of DNA. Base pairs are 
stacked nearly one on top of another in the 
double helix. 


the bases on top of one another. The stacked base pairs attract one another 
through van der Waals forces (p. 8), appropriately referred to as stacking 
forces, further contributing to stabilization of the helix. The energy associ- 
ated with a single van der Waals interaction is quite small, typically from 2 
to 4 kJ mol! (0.5-1.0 kcal mol~'). In the double helix, however, a large 
number of atoms are in van der Waals contact, and the net effect, summed 
over these atom pairs, is substantial. In addition, base stacking in DNA is 
favored by the conformations of the somewhat rigid five-membered rings of 
the backbone sugars. 


DNA can assume a variety of structural forms 


Watson and Crick based their model (known as 
the B-DNA helix) on x-ray diffraction patterns 
of highly hydrated DNA fibers, which provided 
information about properties of the double 
helix that are averaged over its constituent resi- 
dues. Under physiological conditions, most 
DNA is in the B form. X-ray diffraction studies 
of less-hydrated DNA fibers revealed a different 
form called A-DNA. Like B-DNA, A-DNA is 
a right-handed double helix made up of anti- 
parallel strands held together by Watson—Crick 
base-pairing. The A-form helix is wider and 
shorter than the B-form helix, and its base pairs 
are tilted rather than perpendicular to the helix 
axis (Figure 4.14). 

If the A-form helix were simply a property 
of dehydrated DNA, it would be of little sig- 
nificance. However, double-stranded regions 
of RNA and at least some RNA-—DNA hybrids 
adopt a double-helical form very similar to that 
of A-DNA. What is the biochemical basis for 
differences between the two forms of DNA? 
Many of the structural differences between 
B-DNA and A-DNA arise from different puck- 
erings of their ribose units (Figure 4.15). In 


jas 
A DNA, C-3' lies out of the plane (a conforma kh Figure 4.14 B-form and A-form DNA. Space-filling models of 10 base pairs 
tion referred to as C-3 endo) formed by the of B-form and A-form DNA depict their right-handed helical structures. Notice that the 
other four atoms of the ring; in B-DNA, C-2' B-form helix is longer and narrower than the A-form helix. The carbon atoms of the 
lies out of the plane (a conformation called C-2' backbone are shown in white. [Drawn from 1BNA.pdb and 1DNZ.pdb.] 


C-2' endo (B form) 


Figure 4.15 Sugar pucker. In A-form DNA, 
the C-3’ carbon atom lies above the 
approximate plane defined by the four other 


sugar nonhydrogen atoms (called C-3’ endo). 
In B-form DNA, each deoxyribose is in a C-2'- 


endo conformation, in which C-2’ lies out of 
the plane. 


hk Figure 4.16 Z-DNA. DNA oligomers 
such as CGCGCG adopt an alternative 
conformation under some conditions. This 
conformation is called Z-DNA because the 


phosphoryl groups zigzag along the backbone. 


[Drawn from 131D.pdb.] 


endo). The C-3'-endo puckering in A-DNA leads to an 11-degree tilting of 
the base pairs away from perpendicular to the helix. RNA helices are further 
induced to take the A-DNA form because of steric hindrance from the 
2'-hydroxyl] group: the 2'-oxygen atom would be too close to three atoms of 
the adjoining phosphoryl group and to one atom in the next base. In an 
A-form helix, in contrast, the 2’-oxygen atom projects outward, away from 
other atoms. The phosphoryl] and other groups in the A-form helix bind 
fewer HzO molecules than do those in B-DNA. Hence, dehydration favors 
the A form. 


Z-DNA is a left-handed double helix in which backbone 
phosphates zigzag 


Alexander Rich and his associates discovered a third type of DNA helix 
when they solved the structure of CGCGCG. They found that this 
hexanucleotide forms a duplex of antiparallel strands held together by 
Watson—Crick base-pairing, as expected. What was surprising, however, 
was that this double helix was left-handed, in contrast with the right-handed 
screw sense of the A-DNA and B-DNA helices. Furthermore, the phos- 
phates in the backbone zigzagged; hence, they called this new form Z-DNA 
(Figure 4.16). 

The existence of Z-DNA shows that DNA is a flexible, dynamic mole- 
cule. Although the biological role of Z-DNA is still under investigation, 
Z-DNA-binding proteins required for viral pathogenesis have been isolated 
from poxviruses, including variola, the agent of smallpox. The properties of 
A-, B-, and Z-DNA are compared in Table 4.2. 


Top view Side view 


Table 4.2 Comparison of A-, B-, and Z-DNA 


Helix type 

A B Z 
Shape Broadest Intermediate Narrowest 
Rise per base pair 2.3A 3.4A 3.8A 
Helix diameter 255A 23.7A 18.4A 
Screw sense Right-handed Right-handed Left-handed 
Glycosidic bond* anti anti Alternating anti and syn 
Base pairs per turn of helix 11 10.4 2 
Pitch per turn of helix 25.3A 35.4A 45.6A 
Tilt of base pairs from 
perpendicular to helix axis 19 degrees 1 degree 9 degrees 


*Syn and anti refer to the orientation of the N-glycosidic bond between the base and deoxyribose. In the 
anti orientation, the base extends away from the deoxyribose. In the syn orientation, the base is above the 
deoxyribose. Pyrimidine can be in anti orientations only, whereas purines can be anti or syn. 


Figure 4.17 Electron micrographs 
of circular DNA from mitochondria. 
(A) Relaxed form. (B) Supercoiled form. 
[Courtesy of Dr. David Clayton.] 


Some DNA molecules are circular and supercoiled 


The DNA molecules in human chromosomes are linear. However, electron 
microscopic and other studies have shown that intact DNA molecules from 
bacteria and archaea are circular (Figure 4.17A). The term circular refers to 
the continuity of the DNA chains, not to their geometric form. DNA mol- 
ecules inside cells necessarily have a very compact shape. Note that the 
E. coli chromosome, fully extended, would be about 1000 times as long as 
the greatest diameter of the bacterium. 

A closed DNA molecule has a property unique to circular DNA. The 
axis of the double helix can itself be twisted or supercoiled into a superhelix 
(Figure 4.17B). A circular DNA molecule without any superhelical turns is 
known as a relaxed molecule. Supercoiling is biologically important for two 
reasons. First, a supercoiled DNA molecule is more compact than its relaxed 
counterpart. Second, supercoiling may hinder or favor the capacity of the 
double helix to unwind and thereby affect the interactions between DNA and 
other molecules. These topological features of DNA will be considered fur- 
ther in Chapter 28. 


Single-stranded nucleic acids can adopt elaborate structures 


Single-stranded nucleic acids often fold back on themselves to form well- 
defined structures. Such structures are especially prominent in RNA and 
RNA-containing complexes such as the ribosome—a large complex of 
RNAs and proteins on which proteins are synthesized. 

The simplest and most-common structural motif formed is a stem-loop, 
created when two complementary sequences within a single strand come 
together to form double-helical structures (Figure 4.18). In many cases, these 
double helices are made up entirely of Watson—Crick base pairs. In other 
cases, however, the structures include mismatched base pairs or unmatched 
bases that bulge out from the helix. Such mismatches destabilize the local 
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Figure 4.18 Stem-loop structures. Stem- 
loop structures can be formed from single- 
stranded DNA and RNA molecules. 
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Figure 4.19 Complex structure of an RNA molecule. A single-stranded RNA molecule can fold 
back on itself to form a complex structure. (A) The nucleotide sequence showing Watson—Crick 
base pairs and other nonstandard base pairings in stem-loop structures. (B) The three-dimensional 
structure and one important long-range interaction between three bases. In the three-dimensional 
structure at the left, cytidine nucleotides are shown in blue, adenosine in red, guanosine in black, 
and uridine in green. In the detailed projection, hydrogen bonds within the Watson—Crick base pair 
are shown as dashed black lines; additional hydrogen bonds are shown as dashed green lines. 


structure but introduce deviations from the standard double-helical struc- 
ture that can be important for higher-order folding and for function 
(Figure 4.19). 

Single-stranded nucleic acids can adopt structures that are more com- 
plex than simple stem-loops through the interaction of more widely sepa- 
rated bases. Often, three or more bases interact to stabilize these structures. 
In such cases, hydrogen-bond donors and acceptors that do not participate 
in Watson—Crick base pairs participate in hydrogen bonds to form nonstan- 
dard pairings. Metal ions such as magnesium ion (Mg”*) often assist in the 
stabilization of these more elaborate structures. These complex structures 
allow RNA to perform a host of functions that the double-stranded DNA 
molecule cannot. Indeed, the complexity of some RNA molecules rivals 
that of proteins, and these RNA molecules perform a number of functions 
that had formerly been thought the private domain of proteins. 


4.3 The Double Helix Facilitates the Accurate Transmission 
of Hereditary Information 


The double-helical model of DNA and the presence of specific base pairs 
immediately suggested how the genetic material might replicate. The 
sequence of bases of one strand of the double helix precisely determines the 
sequence of the other strand: a guanine base on one strand is always paired 
with a cytosine base on the other strand, and so on. Thus, separation of a 
double helix into its two component chains would yield two single-stranded 
templates onto which new double helices could be constructed, each of 
which would have the same sequence of bases as the parent double helix. 
Consequently, as DNA is replicated, one of the chains of each daughter 
DNA molecule is newly synthesized, whereas the other is passed unchanged 
from the parent DNA molecule. This distribution of parental atoms is 
achieved by semiconservative replication. 


Differences in DNA density established the validity of the 
semiconservative-replication hypothesis 


Matthew Meselson and Franklin Stahl carried out a critical test of this 
hypothesis in 1958. They labeled the parent DNA with !°N, a heavy isotope 
of nitrogen, to make it denser than ordinary DNA. The labeled DNA 
was generated by growing E. coli for many generations in a medium 
that contained *NH,Cl as the sole nitrogen source. After the incorpora- 
tion of heavy nitrogen was complete, the bacteria were abruptly trans- 
ferred to a medium that contained '*N, the ordinary isotope of nitrogen. 
The question asked was: What is the distribution of “Nand }°N in the 
DNA molecules after successive rounds of replication? 

The distribution of ''N and '°N was revealed by the technique of 
density-gradient equilibrium sedimentation. A small amount of DNA was 
dissolved in a concentrated solution of cesium chloride having a density 
close to that of the DNA (1.7 g cm °). This solution was centrifuged until 
it was nearly at equilibrium. At that point, the opposing processes of sedi- 
mentation and diffusion created a gradient in the concentration of cesium 
chloride across the centrifuge cell. The result was a stable density gradient 
ranging from 1.66 to 1.76 gcm_ °. The DNA molecules in this density gra- 
dient were driven by centrifugal force into the region where the solution’s 
density was equal to their own. The DNA yielded a narrow band that was 
detected by its absorption of ultraviolet light. A mixture of “NI DNA and 
‘SN DNA molecules gave clearly separate bands because they differ in den- 
sity by about 1% (Figure 4.20). 

DNA was extracted from the bacteria at various times after they were 
transferred from a !°N to a ‘*N medium. Analysis of these samples by the 
density-gradient technique showed that there was a single band of DNA 
after one generation. The density of this band was precisely halfway between 
the densities of the ‘‘N DNA and '°N DNA bands (Figure 4.21). The 
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Figure 4.20 Resolution of '*N DNA and 
'5N DNA by density-gradient 
centrifugation. (A) Ultraviolet-absorption 
photograph of a centrifuged cell showing the 
two distinct bands of DNA. (B) Densitometric 
tracing of the absorption photograph. [From 
M. Meselson and F. W. Stahl. Proc. Natl. Acad. 
Sci. U. S.A. 44:671-682, 1958] 


Figure 4.21 Detection of 
semiconservative replication of 

E. coli DNA by density-gradient 
centrifugation. The position of a band of 
DNA depends on its content of '4N and '°N. 
After 1.0 generation, all of the DNA molecules 
were hybrids containing equal amounts of '4N 
and '°N. [From M. Meselson and F. W. Stahl. 
Proc. Natl. Acad. Sci. U. S. A. 44:671-682, 
1958] 


Original parent molecule 


S| 


First-generation daughter molecules 


Second-generation daughter molecules 
Figure 4.22 Diagram of semiconservative 
replication. Parental DNA is shown in blue 
and newly synthesized DNA in red. [After M. 


Meselson and F. W. Stahl. Proc. Natl. Acad. 
Sci. U. S.A. 44:671-682, 1958.] 
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absence of '°N DNA indicated that parental DNA was not preserved as an 
intact unit after replication. The absence of ‘tN DNA indicated that all the 
daughter DNA derived some of their atoms from the parent DNA. This 
proportion had to be half because the density of the hybrid DNA band 
was halfway between the densities of the “NDNA and °N DNA bands. 

After two generations, there were equal amounts of two bands of DNA. 
One was hybrid DNA, and the other was 14\. DNA. Meselson and Stahl 
concluded from these incisive experiments that replication was semiconser- 
vative, and so each new double helix contains a parent strand and a newly 
synthesized strand. Their results agreed perfectly with the Watson—Crick 
model for DNA replication (Figure 4.22). 


The double helix can be reversibly melted 


In DNA replication and other processes, the two strands of the double helix 
must be separated from each other, at least in a local region. The two strands 
of a DNA helix readily come apart when the hydrogen bonds between base 
pairs are disrupted. In the laboratory, the double helix can be disrupted by 
heating a solution of DNA or by adding acid or alkali to ionize its bases. The 
dissociation of the double helix is called melting because it occurs abruptly 
at a certain temperature. The melting temperature (T,,) of DNA is defined as 
the temperature at which half the helical structure is lost. Inside cells, how- 
ever, the double helix is not melted by the addition of heat. Instead, proteins 
called helicases use chemical energy (from ATP) to disrupt the helix 
(Chapter 28). 

Stacked bases in nucleic acids absorb less ultraviolet light than do 
unstacked bases, an effect called hypochromism. Thus, the melting of nucleic 
acids is readily monitored by measuring their absorption of light, which is 
maximal at a wavelength of 260 nm (Figure 4.23). 

Separated complementary strands of nucleic acids spontaneously reas- 
sociate to form a double helix when the temperature is lowered below Ty. 
This renaturation process is sometimes called annealing. The facility with 
which double helices can be melted and then reassociated is crucial for the 
biological functions of nucleic acids. 
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Figure 4.23 Hypochromism. (A) Single-stranded DNA absorbs light more effectively than does 
double-helical DNA. (B) The absorbance of a DNA solution at a wavelength of 260 nm increases 
when the double helix is melted into single strands. 


The ability to melt and reanneal DNA reversibly in the laboratory pro- 
vides a powerful tool for investigating sequence similarity. For instance, 
DNA molecules from two different organisms can be melted and allowed to 
reanneal, or hybridize, in the presence of each other. If the sequences are 
similar, hybrid DNA duplexes, with DNA from each organism contribut- 
ing a strand of the double helix, can form. The degree of hybridization is an 
indication of the relatedness of the genomes and hence the organisms. 
Similar hybridization experiments with RNA and DNA can locate genes in 
a cell’s DNA that correspond to a particular RNA. We will return to this 
important technique in Chapter 5. 


4.4 DNA Is Replicated by Polymerases That Take 
Instructions from Templates 


We now turn to the molecular mechanism of DNA replication. The full 
replication machinery in a cell comprises more than 20 proteins engaged 
in intricate and coordinated interplay. In 1958, Arthur Kornberg and his 
colleagues isolated from E. coli the first known of the enzymes, called 
DNA polymerases, that promote the formation of the bonds joining units 
of the DNA backbone. E. coli has a number of DNA polymerases, desig- 
nated by roman numerals, that participate in DNA replication and repair 
(Chapter 28). 


DNA polymerase catalyzes phosphodiester-bridge formation 


DNA polymerases catalyze the step-by-step addition of deoxyribonucleotide 
units to a DNA chain (Figure 4.24). The reaction catalyzed, in its simplest 
form, is 


(DNA), + dNTP == (DNA)n+1 + PP; 


where dNTP stands for any deoxyribonucleotide and PP; is a pyrophos- 
phate ion. 
DNA synthesis has the following characteristics: 


1. The reaction requires all four activated precursors—that is, the deoxy- 
nucleoside 5'-triphosphates dATP, dGTP, dCTP, and TTP—as well as 
Mg?" ion. 


2. The new DNA chain is assembled directly on a preexisting DNA template. 
DNA polymerases catalyze the formation of a phosphodiester linkage effi- 
ciently only if the base on the incoming nucleoside triphosphate is comple- 
mentary to the base on the template strand. Thus, DNA polymerase is a 
template-directed enzyme that synthesizes a product with a base sequence 
complementary to that of the template. 
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Figure 4.24 Polymerization reaction catalyzed by DNA polymerases. 
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Figure 4.25 Chain-elongation reaction. DNA polymerases catalyze the formation of a 
phosphodiester bridge. 


3. DNA polymerases require a primer to begin synthesis. A primer strand hav- 
ing a free 3’-OH group must be already bound to the template strand. The 
chain-elongation reaction catalyzed by DNA polymerases is a nucleophilic 
attack by the 3’-OH terminus of the growing chain on the innermost phos- 
phorus atom of the deoxynucleoside triphosphate (Figure 4.25). A phos- 
phodiester bridge is formed and pyrophosphate is released. The subsequent 
hydrolysis of pyrophosphate to yield two ions of orthophosphate (P;) by 
pyrophosphatase helps drive the polymerization forward. Elongation of the 
DNA chain proceeds in the 5'-to-3' direction. 


4. Many DNA polymerases are able to correct mistakes in DNA by removing 
mismatched nucleotides. These polymerases have a distinct nuclease activity 
that allows them to excise incorrect bases by a separate reaction. This nucle- 
ase activity contributes to the remarkably high fidelity of DNA replication, 
which has an error rate of less than 107 ® per base pair. 


The genes of some viruses are made of RNA 


Genes in all cellular organisms are made of DNA. The same is true for some 
viruses but, for others, the genetic material is RNA. Viruses are genetic ele- 
ments enclosed in protein coats that can move from one cell to another but 
are not capable of independent growth. A well-studied example of an RNA 
virus is the tobacco mosaic virus, which infects the leaves of tobacco plants. 
This virus consists of a single strand of RNA (6390 nucleotides) surrounded 
by a protein coat of 2130 identical subunits. An RNA polymerase that takes 
direction from an RNA template, called an RNA-directed RNA polymerase, 
copies the viral RNA. The infected cells die because of virus-instigated 
programmed cell death; in essence, the virus instructs the cell to commit 
suicide. Cell death results in discoloration in the tobacco leaf in a variegated 
pattern, hence the name mosaic virus. 

Another important class of RNA virus comprises the retroviruses, so 
called because the genetic information flows from RNA to DNA rather 
than from DNA to RNA. This class includes human immunodeficiency 
virus 1 (HIV-1), the cause of AIDS, as well as a number of RNA viruses that 
produce tumors in susceptible animals. Retrovirus particles contain two 
copies of a single-stranded RNA molecule. On entering the cell, the RNA 
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is copied into DNA through the action of a viral enzyme called reverse tran- 
scriptase (Figure 4.26). The resulting double-helical DNA version of the 
viral genome can become incorporated into the chromosomal DNA of the 
host and is replicated along with the normal cellular DNA. At a later time, 
the integrated viral genome is expressed to form viral RNA and viral pro- 
teins, which assemble into new virus particles. 


4.5 Gene Expression Is the Transformation of DNA 
Information into Functional Molecules 


The information stored as DNA becomes useful when it is expressed in the 
production of RNA and proteins. This rich and complex topic is the subject 
of several chapters later in this book, but here we introduce the basics of 
gene expression. DNA can be thought of as archival information, stored and 
manipulated judiciously to minimize damage (mutations). It is expressed in 
two steps. First, an RNA copy is made that encodes directions for protein 
synthesis. This messenger RNA can be thought of as a photocopy of the 
original information: it can be made in multiple copies, used, and then 
disposed of. Second, the information in messenger RNA is translated to 
synthesize functional proteins. Other types of RNA molecules exist to 
facilitate this translation. 


Several kinds of RNA play key roles in gene expression 


Scientists used to believe that RNA played a passive role in gene expression, 
as a mere conveyor of information. However, recent investigations have 
shown that RNA plays a variety of roles, from catalysis to regulation. Cells 
contain several kinds of RNA (Table 4.3): 


1. Messenger RNA (mRNA) is the template for protein synthesis, or trans- 


lation. An mRNA molecule may be produced for each gene or group of 
genes that is to be expressed in FE. coli, whereas a distinct mRNA is 


Table 4.3 RNA molecules in E. coli 


Relative Sedimentation Number of 

Type amount (%) coefficient (s) Mass (kd) nucleotides 
Ribosomal RNA (rRNA) 80 23 1.2 x 10° 3700 
16 0.55 x 10° 1700 
3.6 X 10! 120 
Transfer RNA (tRNA) 15 4 DS << i1G! 75 


Messenger RNA (mRNA) 5 Heterogeneous 
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Figure 4.26 Flow of information from 
RNA to DNA in retroviruses. The RNA 
genome of a retrovirus is converted into DNA 
by reverse transcriptase, an enzyme brought 
into the cell by the infecting virus particle. 
Reverse transcriptase possesses several 
activities and catalyzes the synthesis of a 
complementary DNA strand, the digestion of 
the RNA, and the subsequent synthesis of the 
DNA strand. 
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Kilobase (kb) 
A unit of length equal to 1000 base pairs of 
a double-stranded nucleic acid molecule (or 
1000 bases of a single-stranded molecule). 
One kilobase of double-stranded DNA 
has a length of 0.34 xm at its maximal 
extension (called the contour length) and a 
mass of about 660 kd. 


produced for each gene in eukaryotes. Consequently, mRNA is a heteroge- 
neous class of molecules. In prokaryotes, the average length of an mRNA 
molecule is about 1.2 kilobases (kb). In eukaryotes, mRNA has structural 
features, such as stem-loop structures, that regulate the efficiency of trans- 
lation and the lifetime of the mRNA. 


2. Transfer RNA (tRNA) carries amino acids in an activated form to the 
ribosome for peptide-bond formation, in a sequence dictated by the mRNA 
template. There is at least one kind of tRNA for each of the 20 amino acids. 
Transfer RNA consists of about 75 nucleotides (having a mass of about 


25 kd). 


3. Ribosomal RNA (rRNA) is the major component of ribosomes (Chap- 
ter 30). In prokaryotes, there are three kinds of rRNA, called 23S, 16S, and 
5S RNA because of their sedimentation behavior. One molecule of each of 
these species of rRNA is present in each ribosome. Ribosomal RNA was 
once believed to play only a structural role in ribosomes. We now know that 
rRNA is the actual catalyst for protein synthesis. 


Ribosomal RNA is the most abundant of these three types of RNA. 
Transfer RNA comes next, followed by messenger RNA, which constitutes 
only 5% of the total RNA. Eukaryotic cells contain additional small RNA 
molecules. 


4. Small nuclear RNA (snRNA) molecules participate in the splicing of 
RNA exons. 


5. Asmall RNA molecule is an essential component of the signal-recognition 
particle, an RNA—protein complex in the cytoplasm that helps guide newly 
synthesized proteins to intracellular compartments and extracellular 
destinations. 


6. Micro RNA (miRNA) isa class of small (about 21 nucleotides) noncod- 
ing RNAs that bind to complementary mRNA molecules and inhibit their 
translation. 


7. Small interfering RNA (siRNA) is a class of small RNA molecules that 
bind to mRNA and facilitate its degradation. Micro RNA and small inter- 
fering RNA also provide scientists with powerful experimental tools for 
inhibiting the expression of specific genes in the cell. 


8. RNA isa component of telomerase, an enzyme that maintains the telom- 
eres (ends) of chromosomes during DNA replication. 


In this chapter, we will consider rRNA, mRNA, and tRNA. 


All cellular RNA is synthesized by RNA polymerases 


The synthesis of RNA from a DNA template is called transcription and is 
catalyzed by the enzyme RNA polymerase (Figure 4.27). RNA polymerase 
catalyzes the initiation and elongation of RNA chains. The reaction 
catalyzed by this enzyme is 


(RNA), + ribonucleoside triphosphate === (RNA) n+; + PP; 
RNA polymerase requires the following components: 


1. A template. The preferred template is double-stranded DNA. Single- 
stranded DNA also can serve as a template. RNA, whether single or double 
stranded, is not an effective template; nor are RNA—DNA hybrids. 


2. Activated precursors. All four ribonucleoside triphosphates—ATP, GTP, 
UTP, and CT P—are required. 


hk Figure 4.27 RNA Polymerase. This large enzyme comprises many subunits, including B 
(red) and B’ (yellow), which form a “claw” that holds the DNA to be transcribed. Notice that the 
active site includes a Mg?* ion (green) at the center of the structure. The curved tubes making 
up the protein in the image represent the backbone of the polypeptide chain. [Drawn from IL9Z. 


pdb.] 


3. A divalent metal ion. Either Me?* or Mn?" is effective. 


The synthesis of RNA is like that of DNA in several respects (Figure 4.28). 
First, the direction of synthesis is 5’ —> 3’. Second, the mechanism of 
elongation is similar: the 3’-OH group at the terminus of the growing chain 
makes a nucleophilic attack on the innermost phosphoryl group of the 
incoming nucleoside triphosphate. Third, the synthesis is driven forward by 
the hydrolysis of pyrophosphate. In contrast with DNA polymerase, 
however, RNA polymerase does not require a primer. In addition, the 
ability of RNA polymerase to correct mistakes is not as extensive as that of 


DNA polymerase. 


All three types of cellular RNA—mRNA, tRNA, and rRNA—are 
synthesized in E. coli by the same RNA polymerase according to 
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Figure 4.28 
Transcription 
mechanism of the 
chain-elongation 
reaction catalyzed by 
RNA polymerase. 
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Table 4.4 Base composition 
(percentage) of RNA 
synthesized from a viral 


DNA template 
DNA template 
(plus, or coding, 
strand of bX174) RNA product 
A 29 U 25 
ap 33) A By 
G34 =. 
Cc 18 G 20 


instructions given by a DNA template. In mammalian cells, there is a 
division of labor among several different kinds of RNA polymerases. 
We shall return to these RNA polymerases in Chapter 29. 


RNA polymerases take instructions from DNA templates 


RNA polymerase, like the DNA polymerases described earlier, takes 
instructions from a DNA template. The earliest evidence was the finding 
that the base composition of newly synthesized RNA is the complement 
of that of the DNA template strand, as exemplified by the RNA synthe- 
sized from a template of single-stranded DNA from the )X174 virus 
(Table 4.4). Hybridization experiments also revealed that RNA synthesized 
by RNA polymerase is complementary to its DNA template. In these 
experiments, DNA is melted and allowed to reassociate in the presence of 
mRNA. RNA-DNA hybrids will form if the RNA and DNA have comple- 
mentary sequences. The strongest evidence for the fidelity of transcription 
came from base-sequence studies. For instance, the nucleotide sequence of 
a segment of the gene encoding the enzymes required for tryptophan 
synthesis was determined with the use of DNA-sequencing techniques 
(Section 5.1). Likewise, the sequence of the mRNA for the corresponding 
gene was determined. The results showed that the RNA sequence is the 
precise complement of the DNA template sequence (Figure 4.29). 


5'— GCGGCGACGCGCAGUUAAU CCCACAGCCGCCAGUU CCGCUGGCGGCAU—3’ mRNA 
3'— CGCCGCT GCGCGT CAA TTAGGGT GT CGGCGGT CAAGGCGACCGCCGTA—5' Template strand of DNA 
5’— GCGGCGACGCGCAGTT AAT CCCACAGCCGCCAGT T CCGCT GGCGGCAT —3’ Coding strand of DNA 


Consensus sequence 


Not all base sequences of promoter sites are 
identical. However, they do possess common 
features, which can be represented by an 
idealized consensus sequence. Each base 

in the consensus sequence TATAAT is found 
in most prokaryotic promoters. Nearly all 
promoter sequences differ from this 
consensus sequence at only one or 

two bases. 


Figure 4.30 Promoter sites for 
transcription in (A) prokaryotes and 

(B) eukaryotes. Consensus sequences are 
shown. The first nucleotide to be transcribed is 


numbered +1. The adjacent nucleotide on the 


5' side is numbered —1. The sequences 


shown are those of the coding strand of DNA. 


Figure 4.29 Complementarity between mRNA and DNA. The base sequence of mRNA (red) 
is the complement of that of the DNA template strand (blue). The sequence shown here is from 
the tryptophan operon, a segment of DNA containing the genes for five enzymes that catalyze the 
synthesis of tryptophan. The other strand of DNA (black) is called the coding strand because it has 
the same sequence as the RNA transcript except for thymine (T) in place of uracil (U). 


Transcription begins near promoter sites and ends at terminator sites 


RNA polymerase must detect and transcribe discrete genes from within 
large stretches of DNA. What marks the beginning of the unit to be tran- 
scribed? DNA templates contain regions called promoter sites that specifi- 
cally bind RNA polymerase and determine where transcription begins. In 
bacteria, two sequences on the 5’ (upstream) side of the first nucleotide to 


—35 -10 +1 
DNA template See]. "aaa" 
—35 region Pribnow ——e 
box Start of 
RNA 
(A) Prokaryotic promoter site 
-75 —25 +1 
DNA template ——_eNeaarefil ——"ataan =] 
CAAT box TATA box —_—eeor> 
(sometimes present) (Hogness box) Start of 
RNA 


(B) Eukaryotic promoter site 


be transcribed function as promoter sites (Figure 4.30A). One of them, 
called the Pribnow box, has the consensus sequence TATAAT and is 
centered at —10 (10 nucleotides on the 5’ side of the first nucleotide 
transcribed, which is denoted by +1). The other, called the —35 region, has 
the consensus sequence TTGACA. The first nucleotide transcribed is 
usually a purine. 

Eukaryotic genes encoding proteins have promoter sites witha TATAAA 
consensus sequence, called a TATA box or a Hogness box, centered at about 
—25 (Figure 4.30B). Many eukaryotic promoters also have a CAAT box 
with a GGNCAATCT consensus sequence centered at about —75. The 
transcription of eukaryotic genes is further stimulated by enhancer sequences, 
which can be quite distant (as many as several kilobases) from the start site, 
on either its 5’ or its 3’ side. 

RNA polymerase proceeds along the DNA template, transcribing one of 
its strands until it synthesizes a terminator sequence. This sequence encodes 
a termination signal, which in E. coli is a base-paired hairpin on the newly 
synthesized RNA molecule (Figure 4.31). This hairpin is formed by base- 
pairing of self-complementary sequences that are rich in G and C. Nascent 
RNA spontaneously dissociates from RNA polymerase when this hairpin is 
followed by a string of U residues. Alternatively, RNA synthesis can be 
terminated by the action of rho, a protein. Less is known about the termina- 
tion of transcription in eukaryotes. A more detailed discussion of the initia- 
tion and termination of transcription will be given in Chapter 29. The 
important point now is that discrete start and stop signals for transcription are 
encoded in the DNA template. 

In eukaryotes, the messenger RNA is modified after transcription 
(Figure 4.32). A “cap” structure is attached to the 5’ end, and a sequence of 
adenylates, the poly(A) tail, is added to the 3’ end. These modifications will 
be presented in detail in Chapter 29. 


fo a Poly(A) tail 
Fy ae rereee nine nen 3’ 


Coding region 


Transfer RNAs are the adaptor molecules in protein synthesis 


We have seen that mRNA is the template for protein synthesis. How then 
does it direct amino acids to become joined in the correct sequence to form 
a protein? In 1958, Francis Crick wrote: 


RNA presents mainly a sequence of sites where hydrogen bonding could 
occur. One would expect, therefore, that whatever went onto the template in 
a specific way did so by forming hydrogen bonds. It is therefore a natural 
hypothesis that the amino acid is carried to the template by an adaptor 
molecule, and that the adaptor is the part that actually fits onto the 
RNA. In its simplest form, one would require twenty adaptors, one for each 
amino acid. 


This highly innovative hypothesis soon became established as fact. The 
adaptors in protein synthesis are transfer RNAs. The structure and reactions 
of these remarkable molecules will be considered in detail in Chapter 30. 
For the moment, it suffices to note that tRNAs contain an amino acid- 
attachment site and a template-recognition site. A tRNA molecule carries a 
specific amino acid in an activated form to the site of protein synthesis. The 
carboxyl group of this amino acid is esterified to the 3’- or 2’-hydroxy]l 
group of the ribose unit at the 3’ end of the tRNA chain (Figure 4.33). The 
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Figure 4.31 Base sequence of the 3’ end 
of an mRNA transcript in E. coli. A stable 
hairpin structure is followed by a sequence of 
uridine (U) residues. 


Figure 4.32 Modification of mRNA. 
Messenger RNA in eukaryotes is modified 
after transcription. A nucleotide “cap” structure 
is added to the 5’ end, and a poly(A) tail is 
added at the 3’ end. 


H>C adenine 


R NH3* 


Figure 4.33 Attachment of an amino acid 
to a tRNA molecule. The amino acid 
(shown in blue) is esterified to the 3’-hydroxyl 
group of the terminal adenylate of tRNA. 
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Amino acid 


O 


~——_ Amino acid- 
attachment site 


aO|O|> 


Phosphorylated 
5’ terminus ———> 5’ p— 


<—— Anticodon 


Figure 4.34 General structure of an aminoacyl-tRNA. The amino acid is attached at the 3’ 
end of the RNA. The anticodon is the template-recognition site. Notice that the tRNA has a 
cloverleaf structure with many hydrogen bonds (green dots) between bases. 


joining of an amino acid toa tRNA molecule to form an aminoacyl-tRNA is 
catalyzed by a specific enzyme called an aminoacyl-tRNA synthetase. This 
esterification reaction is driven by ATP cleavage. There is at least one spe- 
cific synthetase for each of the 20 amino acids. The template-recognition 
site on tRNA is a sequence of three bases called an anticodon (Figure 4.34). 
The anticodon on tRNA recognizes a complementary sequence of three 
bases, called a codon, on mRNA. 


4.6 Amino Acids Are Encoded by Groups of Three Bases 
Starting from a Fixed Point 


The genetic code is the relation between the sequence of bases in DNA (or 
its RNA transcripts) and the sequence of amino acids in proteins. 
Experiments by Marshall Nirenberg, Har Gobind Khorana, Francis Crick, 
Sydney Brenner, and others established the following features of the 
genetic code by 1961: 


1. Three nucleotides encode an amino acid. Proteins are built from a basic set 
of 20 amino acids, but there are only four bases. Simple calculations show 
that a minimum of three bases is required to encode at least 20 amino acids. 
Genetic experiments showed that an amino acid is in fact encoded by a group 
of three bases, or codon. 


2. The code is nonoverlapping. Consider a base sequence ABCDEF. In an 
overlapping code, ABC specifies the first amino acid, BCD the next, CDE 
the next, and so on. In a nonoverlapping code, ABC designates the first 
amino acid, DEF the second, and so forth. Genetic experiments again 
established the code to be nonoverlapping. 


3. The code has no punctuation. In principle, one base (denoted as Q ) might 
serve as a “comma” between groups of three bases. 


... QABCQDEFQGHIQJKLQ... 


However, it is not the case. Rather, the sequence of bases is read sequentially 
from a fixed starting point, without punctuation. 


4. The genetic code is degenerate. Most amino acids are encoded by more 
than one codon. There are 64 possible base triplets and only 20 amino acids, 
and in fact 61 of the 64 possible triplets specify particular amino acids. 
Three triplets (called stop codons) designate the termination of translation. 
Thus, for most amino acids, there is more than one code word. 


Major features of the genetic code 


All 64 codons have been deciphered (Table 4.5). Because the code is highly 
degenerate, only tryptophan and methionine are encoded by just one triplet 
each. Each of the other 18 amino acids is encoded by two or more. Indeed, 
leucine, arginine, and serine are specified by six codons each. The number 
of codons for a particular amino acid correlates with its frequency of occur- 
rence in proteins. 

Codons that specify the same amino acid are called synonyms. For 
example, CAU and CAC are synonyms for histidine. Note that syn- 
onyms are not distributed haphazardly throughout the genetic code. In 
Table 4.5, an amino acid specified by two or more synonyms occupies a 
single box (unless it is specified by more than four synonyms). The 
amino acids in a box are specified by codons that have the same first two 
bases but differ in the third base, as exemplified by GUU, GUC, GUA, 
and GUG. Thus, most synonyms differ only in the last base of the triplet. 
Inspection of the code shows that XYC and XYU always encode the same 
amino acid, whereas XYG and XYA usually encode the same amino acid. 
The structural basis for these equivalences of codons will become evident 


Table 4.5 The genetic code 


First Position Second Position Third Position 
(5' end) U C A G (3’ end) 
Phe Ser Tyr Cys U 
U Phe Ser Tyr Cys Cc 
Leu Ser Stop Stop A 
Leu Ser Stop Trp G 
Leu Pro His Arg U 
G Leu Pro His Arg €) 
Leu Pro Gln Arg A 
Leu Pro Gln Arg G 
Tle Thr Asn Ser U 
A Ile Thr Asn Ser C 
Ile Thr Lys Arg A 
Met Thr Lys Arg G 
Val Ala Asp Gly U 
G Val Ala Asp Gly Cc 
Val Ala Glu Gly A 
Val Ala Glu Gly G 
mRNA specifies methionine, whereas CAU specifies histidine. UAA, UAG, and UGA are termination sig- 


nals. AUG is part of the initiation signal, in addition to coding for internal methionine residues. 
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when we consider the nature of the anticodons of tRNA molecules 
(Section 30.3). 

What is the biological significance of the extensive degeneracy of the 
genetic code? If the code were not degenerate, 20 codons would designate 
amino acids and 44 would lead to chain termination. The probability of 
mutating to chain termination would therefore be much higher with a 
nondegenerate code. Chain-termination mutations usually lead to inactive 
proteins, whereas substitutions of one amino acid for another are usually 
rather harmless. Moreover, the code is constructed such that a change in 
any single nucleotide base of a codon results in a synonym or an amino acid 
with similar chemical properties. Thus, degeneracy minimizes the deleterious 
effects of mutations. 


Messenger RNA contains start and stop signals for protein synthesis 


Messenger RNA is translated into proteins on ribosomes—large molecular 
complexes assembled from proteins and ribosomal RNA. How is mRNA 
interpreted by the translation apparatus? The start signal for protein 
synthesis is complex in bacteria. Polypeptide chains in bacteria start with 
a modified amino acid—namely, formylmethionine (fMet). A specific 
tRNA, the initiator tRNA, carries fMet. This fMet-tRNA recognizes the 
codon AUG or, less frequently, GUG. However, AUG is also the codon 
for an internal methionine residue, and GUG is the codon for an internal 
valine residue. Hence, the signal for the first amino acid in a prokaryotic 
polypeptide chain must be more complex than that for all subsequent ones. 
AUG (or GUG) is only part of the initiation signal (Figure 4.35). In bacteria, 
the initiating AUG (or GUG) codon is preceded several nucleotides away 
by a purine-rich sequence, called the Shine—Dalgarno sequence, that base- 
pairs with a complementary sequence in a ribosomal RNA molecule 
(Section 30.3). In eukaryotes, the AUG closest to the 5’ end of an mRNA 
molecule is usually the start signal for protein synthesis. This particular 
AUG is read by an initiator tRNA conjugated to methionine. After the ini- 
tiator AUG has been located, the reading frame is established—groups of 
three nonoverlapping nucleotides are defined, beginning with the initiator 
AUG codon. 

As already mentioned, UAA, UAG, and UGA designate chain termina- 
tion. These codons are read not by tRNA molecules but rather by specific 
proteins called release factors (Section 30.3). Binding of a release factor to the 
ribosome releases the newly synthesized protein. 


-10 +1 
5’ Purine-rich ~~~ AUG | mRNA 
Base-pairs with fMet —~~~~~~~~~~-~-~-__ > Protein 
ribosomal RNA 
(A) Prokaryotic start signal 
+1 First AUG from 5’ end 
mRNA 


5- Cap ~~~~~~~ AUG 


H,N-Met anOeeEOeeOeneeer SS =~ ~Prrotein 


(B) Eukaryotic start signal 


Figure 4.35 Initiation of protein synthesis. Start signals are required for the initiation of protein 
synthesis in (A) prokaryotes and (B) eukaryotes. 


The genetic code is nearly universal 


Is the genetic code the same in all organisms? This question was answered 
by examining the base sequences of many wild-type and mutant genes, as 
well as the amino acid sequences of their encoded proteins. For each mutant, 
the nucleotide change in the gene leads to a change in the amino acid as 
predicted by the genetic code. Furthermore, mRNAs can be correctly trans- 
lated by the protein-synthesizing machinery of very different species. For 
example, human hemoglobin mRNA is correctly translated by a wheat- 
germ extract, and bacteria efficiently express recombinant DNA molecules 
encoding human proteins such as insulin. These experimental findings 
strongly suggested that the genetic code is universal. 

A surprise was encountered when the sequence of human mitochondrial 
DNA became known. Ribosomes read UGA in human mitochondria as a 
codon for tryptophan rather than as a stop signal (Table 4.6). Furthermore, 
AGA and AGG are read as stop signals rather than as codons for arginine, 
and AUA is read as a codon for methionine instead of isoleucine. 
Mitochondria of other species, such as those of yeast, also have genetic 
codes that differ slightly from the standard one. The genetic code of mito- 
chondria can differ from that of the rest of the cell because mitochondrial 
DNA encodes a distinct set of tRNAs. Do any cellular protein-synthesizing 
systems deviate from the standard genetic code? At least 16 organisms 
deviate from the standard genetic code. Ciliated protozoa differ from most 
organisms in reading UAA and UAG as codons for amino acids rather than 
as stop signals; UGA is their sole termination signal. Thus, the genetic code 
is nearly but not absolutely universal. Variations clearly exist in mitochondria 
and in species, such as ciliates, that branched off very early in eukaryotic 
evolution. It is interesting to note that two of the codon reassignments in 
human mitochondria diminish the information content of the third base of 
the triplet. For instance, in the common genetic code, AUG encodes 
methionine only while AUA is a codon for isoleucine. However, in human 
mitochondria both AUA and AUG specify methionine. Most variations 
from the standard genetic code are in the direction of a simpler code. 

Why has the code remained nearly invariant through billions of years 
of evolution, from bacteria to human beings? A mutation that altered the 
reading of mRNA would change the amino acid sequence of most, if not all, 
proteins synthesized by that particular organism. Many of these changes 
would undoubtedly be deleterious, and so there would be strong selection 
against a mutation with such pervasive consequences. 


4.7 Most Eukaryotic Genes Are Mosaics of 
Introns and Exons 


In bacteria, polypeptide chains are encoded by a continuous array of triplet 
codons in DNA. For many years, genes in higher organisms also were 
assumed to be continuous; the DNA sequence encoding a gene had a 
discrete beginning and ending with no other interrupting, noncoding DNA 
sequences. This view was unexpectedly shattered in 1977, when investiga- 
tors, including Philip Sharp and Richard Roberts, discovered that several 
genes are discontinuous. The mosaic nature of eukaryotic genes was revealed 
by electron microscopic studies of hybrids formed between mRNA and a 
segment of DNA containing the corresponding gene (Figure 4.36). For 
example, the gene for the B chain of hemoglobin is interrupted within its 
amino acid-coding sequence by a long intron of 550 base pairs and a short 
one of 120 base pairs. Thus, the B-globin gene is split into three coding 
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Table 4.6 Distinctive codons of 


human mitochondria 


Standard Mitochondrial 


Codon code code 
UGA Stop Trp 
UGG Trp Trp 
AUA Tle Met 
AUG Met Met 
AGA Arg Stop 
AGG Arg Stop 
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Figure 4.37 Structure of the B-globin gene. 
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Figure 4.38 Transcription and processing 
of the B-globin gene. The gene is 
transcribed to yield the primary transcript, 
which is modified by cap and poly(A) 
addition. The introns in the primary RNA 
transcript are removed to form the mRNA. 
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Figure 4.36 Detection of introns by electron microscopy. An MRNA molecule (shown in red) 
is hybridized to genomic DNA containing the corresponding gene. (A) A single loop of single- 
stranded DNA (shown in blue) is seen if the gene is continuous. (B) Two loops of single-stranded 
DNA (blue) and a loop of double-stranded DNA (blue and green) are seen if the gene contains an 
intron. Additional loops are evident if more than one intron is present. 


sequences (Figure 4.37). The average human gene has 8 
introns, and some have more than 100. The size ranges 
250 Base pairs from 50 to 10,000 nucleotides. 


RNA processing generates mature RNA 


At what stage in gene expression are introns removed? 

Newly synthesized RNA chains (pre-mRNA or pri- 
mary transcript) isolated from nuclei are much larger than the mRNA mol- 
ecules derived from them; in regard to B-globin RNA, the former consists 
of approximately 1600 nucleotides and the latter approximately 900 nucle- 
otides. In fact, the primary transcript of the B-globin gene contains two 
regions that are not present in the mRNA. These regions in primary tran- 
script are excised, and the coding sequences are simultaneously linked by a 
precise splicing enzyme to form the mature mRNA (Figure 4.38). Regions that 
are removed from the primary transcript are called introns (for intervening 
sequences), whereas those that are retained in the mature RNA are called 
exons (for expressed sequences). A common feature in the expression of 
discontinuous, or split, genes is that their exons are ordered in the same 
sequence in mRNA as in DNA. Thus, the codons in split genes, like con- 
tinuous genes, are in the same linear order as the amino acids in the poly- 
peptide products. 

Splicing is a complex operation that is carried out by spliceosomes, which 
are assemblies of proteins and small RNA molecules. RNA plays the 
catalytic role (Section 29.3). This enzymatic machinery recognizes signals 
in the nascent RNA that specify the splice sites. Introns nearly always begin 


5. 3’ 
splice site splice site 
| | 
Intron 


Figure 4.39 Consensus sequence for the splicing of mRNA precursors. 


with GU and end with an AG that is preceded by a pyrimidine- 
rich tract (Figure 4.39). This consensus sequence is part of the 
signal for splicing. 


Many exons encode protein domains 


Most genes of higher eukaryotes, such as birds and mammals, 
are split. Lower eukaryotes, such as yeast, have a much higher 
proportion of continuous genes. In prokaryotes, split genes are 
extremely rare. Have introns been inserted into genes in the 
evolution of higher organisms? Or have introns been removed 


Recombination 


) 


) 


from genes to form the streamlined genomes of prokaryotes and _ Figure 4.40 Exon shuffling. Exons can be readily shuffled by 
simple eukaryotes? Comparisons of the DNA sequences of recombination of DNA to expand the genetic repertoire. 


genes encoding proteins that are highly conserved in evolu- 

tion suggest that introns were present in ancestral genes and were 

lost in the evolution of organisms that have become optimized for 

very rapid growth, such as prokaryotes. The positions of introns in some 
genes are at least 1 billion years old. Furthermore, acommon mechanism of 
splicing developed before the divergence of fungi, plants, and vertebrates, 
as shown by the finding that mammalian cell extracts can splice yeast 
RNA. 

What advantages might split genes confer? Many exons encode discrete 
structural and functional units of proteins. An attractive hypothesis is that 
new proteins arose in evolution by the rearrangement of exons encoding discrete 
structural elements, binding sites, and catalytic sites, a process called exon 
shuffling. Because it preserves functional units but allows them to interact in 
new ways, exon shuffling is a rapid and efficient means of generating novel 
genes (Figure 4.40). Introns are extensive regions in which DNA can break 
and recombine with no deleterious effect on encoded proteins. In contrast, 
the exchange of sequences between different exons usually leads to loss of 
function. 

Another advantage conferred by split genes is the potential for generating a 
series of related proteins by splicing a nascent RNA transcript in different ways. 
For example, a precursor of an antibody-producing cell forms an antibody 
that is anchored in the cell’s plasma membrane (Figure 4.41). The attached 
antibody recognizes a specific foreign antigen, an event that leads to cell 
differentiation and proliferation. The activated antibody-producing cells 
then splice their nascent RNA transcript in an alternative manner to form 
soluble antibody molecules that are secreted rather than retained on the cell 
surface. We see here a clear-cut example of a benefit conferred by the 
complex arrangement of introns and exons in higher organisms. Alternative 
splicing is a facile means of forming a set of proteins that are variations of a 
basic motif according to a developmental program without requiring a gene for 
each protein. 
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Figure 4.41 Alternative splicing. 
Alternative splicing generates mRNAs that 
are templates for different forms of a 


protein: (A) a membrane-bound antibody 
on the surface of a lymphocyte and (B) its 
soluble counterpart, exported from the cell. 


The membrane-bound antibody is 
anchored to the plasma membrane by a 
helical segment (highlighted in yellow) th 
is encoded by its own exon. 
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Summary 


4.1 


4.2 


4.3 


A Nucleic Acid Consists of Four Kinds of Bases Linked to a 
Sugar—Phosphate Backbone 

DNA and RNA are linear polymers of a limited number of monomers. 
In DNA, the repeating units are nucleotides, with the sugar being a 
deoxyribose and the bases being adenine (A), thymine (T), guanine (G), 
and cytosine (C). In RNA, the sugar is a ribose and the base uracil (U) 
is used in place of thymine. DNA is the molecule of heredity in all 
prokaryotic and eukaryotic organisms. In viruses, the genetic material is 
either DNA or RNA. 


A Pair of Nucleic Acid Chains with Complementary Sequences Can 
Form a Double-Helical Structure 

All cellular DNA consists of two very long, helical polynucleotide 
chains coiled around a common axis. The sugar—phosphate backbone of 
each strand is on the outside of the double helix, whereas the purine and 
pyrimidine bases are on the inside. The two chains are held together by 
hydrogen bonds between pairs of bases: adenine is always paired with 
thymine, and guanine is always paired with cytosine. Hence, one strand 
of a double helix is the complement of the other. The two strands of the 
double helix run in opposite directions. Genetic information is encoded 
in the precise sequence of bases along a strand. 

DNA is a structurally dynamic molecule that can exist in a variety 
of helical forms: A-DNA, B-DNA (the classic Watson—Crick helix), 
and Z-DNA. In A-, B-, and Z-DNA, two antiparallel chains are held 
together by Watson—Crick base pairs and stacking interactions between 
bases in the same strand. A- and B-DNA are right-handed helices. 
In B-DNA, the base pairs are nearly perpendicular to the helix axis. 
Z-DNA is a left-handed helix. Most of the DNA in a cell is in the 
B-form. 

Double-stranded DNA can also wrap around itself to form a super- 
coiled structure. The supercoiling of DNA has two important conse- 
quences. Supercoiling compacts the DNA and, because supercoiled 
DNA is partly unwound, it is more accessible for interactions with other 
biomolecules. 

Single-stranded nucleic acids, most notably RNA, can form compli- 
cated three-dimensional structures that may contain extensive double- 
helical regions that arise from the folding of the chain into hairpins. 


The Double Helix Facilitates the Accurate Transmission of 

Hereditary Information 

The structural nature of the double helix readily accounts for the accu- 
rate replication of genetic material because the sequence of bases in one 
strand determines the sequence of bases in the other strand. In replica- 
tion, the strands of the helix separate and a new strand complementary 
to each of the original strands is synthesized. Thus, two new double 
helices are generated, each composed of one strand from the original 
molecule and one newly synthesized strand. This mode of replication is 
called semiconservative replication because each new helix retains one 
of the original strands. 

In order for replication to take place, the strands of the double helix 
must be separated. In vitro, heating a solution of double-helical DNA 
separates the strands, a process called melting. On cooling, the strands 
reanneal and re-form the double helix. In the cell, special proteins tem- 
porarily separate the strands in replication. 


4.4 


4.5 


4.6 


4.7 


DNA Is Replicated by Polymerases That Take 

Instructions from Templates 

In the replication of DNA, the two strands of a double helix unwind and 
separate as new chains are synthesized. Each parent strand acts as a 
template for the formation of anew complementary strand. The replica- 
tion of DNA is acomplex process carried out by many proteins, includ- 
ing several DNA polymerases. The activated precursors in the synthesis 
of DNA are the four deoxyribonucleoside 5'-triphosphates. The new 
strand is synthesized in the 5’ —> 3’ direction by a nucleophilic attack 
by the 3’-hydroxyl terminus of the primer strand on the innermost 
phosphorus atom of the incoming deoxyribonucleoside triphosphate. 
Most important, DNA polymerases catalyze the formation of a phos- 
phodiester linkage only if the base on the incoming nucleotide is 
complementary to the base on the template strand. In other words, 
DNA polymerases are template-directed enzymes. The genes of some 
viruses, such as tobacco mosaic virus, are made of single-stranded 
RNA. An RNA-directed RNA polymerase mediates the replication 
of this viral RNA. Retroviruses, exemplified by HIV-1, have a single- 
stranded RNA genome that undergoes reverse transcription into 
double-stranded DNA by reverse transcriptase, an RNA-directed 
DNA polymerase. 


Gene Expression Is the Transformation of DNA Information into 
Functional Molecules 

The flow of genetic information in normal cells is from DNA to RNA 
to protein. The synthesis of RNA from a DNA template is called tran- 
scription, whereas the synthesis of a protein from an RNA template is 
termed translation. Cells contain several kinds of RNA, among which 
are messenger RNA (mRNA), transfer RNA (tRNA), and ribosomal 
RNA (rRNA), which vary in size from 75 to more than 5000 nucle- 
otides. All cellular RNA is synthesized by RNA polymerases according 
to instructions given by DNA templates. The activated intermediates 
are ribonucleoside triphosphates and the direction of synthesis, like that 
of DNA, is 5’ 3’. RNA polymerase differs from DNA polymerase in 
not requiring a primer. 


Amino Acids Are Encoded by Groups of Three Bases 

Starting from a Fixed Point 

The genetic code is the relation between the sequence of bases in DNA 
(or its RNA transcript) and the sequence of amino acids in proteins. 
Amino acids are encoded by groups of three bases (called codons) start- 
ing from a fixed point. Sixty-one of the 64 codons specify particular 
amino acids, whereas the other 3 codons (UAA, UAG, and UGA) are 
signals for chain termination. Thus, for most amino acids, there is more 
than one code word. In other words, the code is degenerate. The genetic 
code is nearly the same in all organisms. Natural mRNAs contain start 
and stop signals for translation, just as genes do for directing where 
transcription begins and ends. 


Most Eukaryotic Genes Are Mosaics of Introns and Exons 

Most genes in higher eukaryotes are discontinuous. Coding sequences 
in these split genes, called exons, are separated by noncoding sequences, 
called introns, which are removed in the conversion of the primary tran- 
script into mRNA and other functional mature RNA molecules. Split 
genes, like continuous genes, are colinear with their polypeptide prod- 
ucts. A striking feature of many exons is that they encode functional 
domains in proteins. New proteins probably arose in the course of 
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primordial genes but were lost in the evolution of such fast-growing 


organisms as bacteria and yeast. 
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Problems 


1. A t instead of an s? Differentiate between a nucleoside 
and a nucleotide. 


2. A lovely pair. What is a Watson—Crick base pair? 


3. Chargaff rules! Biochemist Erwin Chargaff was the first 
to note that, in DNA, [A] = [T] and [G] = [C], equalities 
now called Chargraff’s rule. Using this rule, determine the 
percentages of all the bases in DNA that is 20% thymine. 


4, But not always. A single strand of RNA is 20% U. What 
can you predict about the percentages of the remaining 
bases? 


5. Complements. Write the complementary sequence (in 
the standard 5’ — 3’ notation) for (a) GATCAA, (b) 
TCGAAG, (c) ACGCGT, and (d) TACCAT. 


6. Compositional constraint. The composition (in mole- 
fraction units) of one of the strands of a double-helical 
DNA molecule is [A] = 0.30 and [G] = 0.24. (a) What can 
you say about [T] and [C] for the same strand? (b) What 
can you say about [A], [G], [T], and [C] of the complemen- 
tary strand? 


7. Size matters. Why are GC and AT the only base pairs 
permissible in the double helix? 


8. Strong, but not strong enough. Why does heat denature, 
or melt, DNA in solution? 


9. Uniqueness. The human genome contains 3 billion nucle- 
otides arranged in a vast array of sequences. What is the 
minimum length of a DNA sequence that will, in all prob- 
ability, appear only once in the human genome? You need 
consider only one strand and may assume that all four 
nucleotides have the same probability of appearance. 


10. Coming and going. What does it mean to say that the 
DNA chains in a double helix have opposite polarity? 


11. All for one. If the forces—hydrogen bonds and stacking 
forces—holding a helix together are weak, why is it difficult 
to disrupt a double helix? 


12. Overcharged. DNA in the form of a double helix must 
be associated with cations, usually Mg’*. Why is this 
requirement the case? 


13. Not quite from A to Z. Describe the three forms that a 
double helix can assume. 


14. Lost DNA. The DNA of a deletion mutant of \ bacte- 
riophage has a length of 15 pm instead of 17 pm. How 
many base pairs are missing from this mutant? 


15. An unseen pattern. What result would Meselson and 
Stahl have obtained if the replication of DNA were conser- 
vative (i.e., the parental double helix stayed together)? Give 
the expected distribution of DNA molecules after 1.0 and 
2.0 generations for conservative replication. 


16. Tagging DNA. (a) Suppose that you want to radioac- 
tively label DNA but not RNA in dividing and growing 
bacterial cells. Which radioactive molecule would you add 
to the culture medium? (b) Suppose that you want to pre- 
pare DNA in which the backbone phosphorus atoms are 
uniformly labeled with °’P. Which precursors should be 
added to a solution containing DNA polymerase and 
primed template DNA? Specify the position of radioactive 
atoms in these precursors. 


17. Finding a template. A solution contains DNA poly- 
merase and the Mg?* salts of dATP, dGTP, dCTP, and 


TTP. The following DNA molecules are added to aliquots 
of this solution. Which of them would lead to DNA synthe- 
sis? (a) A single-stranded closed circle containing 1000 
nucleotide units. (b) A double-stranded closed circle con- 
taining 1000 nucleotide pairs. (c) A single-stranded closed 
circle of 1000 nucleotides base-paired to a linear strand of 
500 nucleotides with a free 3’-OH terminus. (d) A double- 
stranded linear molecule of 1000 nucleotide pairs with a free 
3'-OH group at each end. 


18. Retrograde. What is a retrovirus and how does informa- 
tion flow for a retrovirus differ from that for the infected 
cell? 


19. The right start. Suppose that you want to assay reverse 
transcriptase activity. If polyriboadenylate is the template 
in the assay, what should you use as the primer? Which 
radioactive nucleotide should you use to follow chain 
elongation? 


20. Essential degradation. Reverse transcriptase has ribo- 
nuclease activity as well as polymerase activity. What is the 
role of its ribonuclease activity? 


21. Virus hunting. You have purified a virus that infects 
turnip leaves. Treatment of a sample with phenol removes 
viral proteins. Application of the residual material to scraped 
leaves results in the formation of progeny virus particles. 
You infer that the infectious substance is a nucleic acid. 
Propose a simple and highly sensitive means of determining 
whether the infectious nucleic acid is DNA or RNA. 


22. Mutagenic consequences. Spontaneous deamination of 
cytosine bases in DNA takes place at low but measurable 
frequency. Cytosine is converted into uracil by loss of its 
amino group. After this conversion, which base pair occu- 
pies this position in each of the daughter strands resulting 
from one round of replication? Two rounds of replication? 


23. Information content. (a) How many different 8-mer 
sequences of DNA are there? (Hint: There are 16 possible 
dinucleotides and 64 possible trinucleotides.) We can quan- 
tify the information-carrying capacity of nucleic acids in the 
following way. Each position can be one of four bases, cor- 
responding to two bits of information (2” = 4). Thus, a 
chain of 5100 nucleotides corresponds to 2 X 5100 = 
10,200 bits, or 1275 bytes (1 byte = 8 bits). (b) How many 
bits of information are stored in an 8-mer DNA sequence? 
In the E. coli genome? In the human genome? (c) Compare 
each of these values with the amount of information that 
can be stored on a computer compact disc, or CD (about 
700 megabytes). 


24. Key polymerases. Compare DNA polymerase and RNA 
polymerase from E. coli in regard to each of the following 
features: (a) activated precursors, (b) direction of chain 
elongation, (c) conservation of the template, and (d) need 
for a primer. 
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25. Family resemblance. Differentiate among mRNA, 
rRNA and tRNA. 


26. Encoded sequences. (a) Write the sequence of the mRNA 
molecule synthesized from a DNA template strand having 
the following sequence. 


5'-ATCGTACCGTTA-3’ 


(b) What amino acid sequence is encoded by the following 
base sequence of an mRNA molecule? Assume that the 
reading frame starts at the 5’end. 


5'-UUGCCUAGUGAUUGGAUG-3’ 


(c) What is the sequence of the polypeptide formed on 
addition of poly(UUAC) to a cell-free protein-synthesizing 
system? 


27. A tougher chain. RNA is readily hydrolyzed by alkali, 
whereas DNA is not. Why? 


28. A picture is worth a thousand words. Write a reaction 
sequence showing why RNA is more susceptible to nucleo- 
philic attack than DNA. 


29. Flowing information. What is meant by the phrase gene 
expression? 


30. We can all agree on that. What is a consensus 
sequence? 


31. A potent blocker. How does cordycepin (3'-deoxyade- 
nosine) block the synthesis of RNA? 


32. Silent RNA. The code word GGG cannot be deci- 
phered in the same way as can UUU, CCC, and AAA, 
because poly(G) does not act as a template. Poly(G) forms a 
triple-stranded helical structure. Why is it an ineffective 
template? 


33. Sometimes it is not so bad. What is meant by the degen- 
eracy of the genetic code? 


34. In fact, it can be good. What is the biological benefit of 
a degenerate genetic code? 


35. To bring together as associates. Match the components 
in the right-hand column with the appropriate process in 
the left-hand column. 


RNA polymerase 
DNA polymerase 
Ribosome 

dNTP 

tRNA 

NTP 

mRNA 

primer 

rRNA 


promoter 


(a) Replication ___ 
(b) Transcription —__ 
(c) Translation __ 
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36. A lively contest. Match the components in the right- 
hand column with the appropriate process in the left-hand 
column. 


(a) {Met 1. continuous message 
(b) Shine-Dalgarno 2. removed 

(c) intron 3. the first of many 

(d) exon 4, uniter 

(e) pre-mRNA 5. joined 

(f) mRNA 6. locate the start 

(g) spliceosome 7. discontinuous message 


37. Two from one. Synthetic RNA molecules of defined 
sequence were instrumental in deciphering the genetic 
code. Their synthesis first required the synthesis of DNA 
molecules to serve as templates. H. Gobind Khorana syn- 
thesized, by organic-chemical methods, two complemen- 
tary deoxyribonucleotides, each with nine residues: 
d(TAC)3 and d(GTA)3. Partly overlapping duplexes that 
formed on mixing these oligonucleotides then served as 
templates for the synthesis by DNA polymerase of long, 
repeating double-helical DNA chains. The next step was 
to obtain long polyribonucleotide chains with a sequence 
complementary to only one of the two DNA strands. 
How did Khorana obtain only poly(UAC)? Only 
poly(GUAY? 


38. Triple entendre. The RNA transcript of a region of T4 
phage DNA contains the sequence 5'-AAAUGAGGA-3’. 
This sequence encodes three different polypeptides. What 
are they? 


39. Valuable synonyms. Proteins generally have low contents 
of Met and Trp, intermediate contents of His and Cys, and 
high contents of Leu and Ser. What is the relation between 
the number of an amino acid’s codons and the frequency 
with which the amino acid is present in proteins? What 
might be the selective advantage of this relation? 


40. A new translation. A transfer RNA with a UGU 
anticodon is enzymatically conjugated to ‘C-labeled 
cysteine. The cysteine unit is then chemically modified to 
alanine (with the use of Raney nickel, which removes the 
sulfur atom of cysteine). The altered aminoacyl-tRNA is 
added to a protein-synthesizing system containing normal 
components except for this tRNA. The mRNA added to 
this mixture contains the following sequence: 


5'-UUUUGCCAUGUUUGUGCU-3' 


What is the sequence of the corresponding radiolabeled 
peptide? 


41. A tricky exchange. Define exon shuffling and explain 
why its occurrence might be an evolutionary advantage. 


42. The unity of life. What is the significance of the fact that 
human mRNA can be accurately translated in E. coli? 


Chapter Integration Problems 


43. Back to the bench. A protein chemist told a molecular 
geneticist that he had found a new mutant hemoglobin in 
which aspartate replaced lysine. The molecular geneticist 
expressed surprise and sent his friend scurrying back to the 
laboratory. (a) Why did the molecular geneticist doubt the 
reported amino acid substitution? (b) Which amino acid 
substitutions would have been more palatable to the molec- 
ular geneticist? 


44. Eons apart. The amino acid sequences of a yeast protein 
and a human protein having the same function are found to 
be 60% identical. However, the corresponding DNA 
sequences are only 45% identical. Account for this differing 
degree of identity. 


Data Interpretation Problems 


45. 3 is greater than 2. The adjoining illustration graphs the 
relation between the percentage of GC base pairs in DNA 
and the melting temperature. Account for these results. 
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[After R. J. Britten and D. E. Kohne, Science 161:529-540, 1968.] 


46. Blast from the past. The illustration below is a graph 
called a Cot curve (pronounced “cot”). The y-axis shows 
the percentage of DNA that is double stranded. The 
x-axis is the product of the concentration of DNA and the 
time required for the double-stranded molecules to form. 
Explain why the mixture of poly(A) and poly(U) and the 
three DNAs shown vary in the Cot value required to 
completely anneal. MS2 and T4 are bacterial viruses 
(bacteriophages) with genome sizes of 3569 and 168,903 bp, 
respectively. The E. coli genome is 4.6 X 10° bp. 
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Exploring Genes and Genomes 


CHAPTER 5D 


Processes such as the development from a caterpillar 
into a butterfly entail dramatic changes in patterns of 
gene expression. The expression levels of thousands of 
genes can be monitored through the use of DNA arrays. 


At the right, a DNA microarray reveals the expression 
levels of more than 12,000 human genes; the brightness 
of each spot indicates the expression level of the 
corresponding gene. [(Left) Cathy Keifer/istockphoto. 
com. (Right) Agilent Technologies.] 


ince its emergence in the 1970s, recombinant DNA technology has revo- 

lutionized biochemistry. The genetic endowment of organisms can now 
be precisely changed in designed ways. Recombinant DNA technology is 
the fruit of several decades of basic research on DNA, RNA, and viruses. It 
depends, first, on having enzymes that can cut, join, and replicate DNA and 
those that can reverse transcribe RNA. Restriction enzymes cut very long 
DNA molecules into specific fragments that can be manipulated; DNA 
ligases join the fragments together. Many kinds of restriction enzymes are 
available. By applying this assortment cleverly, researchers can treat DNA 
sequences as modules that can be moved at will from one DNA molecule to 
another. Thus, recombinant DNA technology is based on the use of 
enzymes that act on nucleic acids as substrates. 

A second foundation is the base-pairing language that allows comple- 
mentary sequences to recognize and bind to each other. Hybridization with 
complementary DNA (cDNA) or RNA probes is a sensitive means of 
detecting specific nucleotide sequences. In recombinant DNA technology, 
base-pairing is used to construct new combinations of DNA as well as to 
detect and amplify particular sequences. 

Third, powerful methods have been developed for determining the 
sequence of nucleotides in DNA. These methods have been harnessed to 
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sequence complete genomes: first, small genomes from viruses; then, larger 
genomes from bacteria; and, finally, eukaryotic genomes, including the 
3-billion-base-pair human genome. Scientists are just beginning to exploit 
the enormous information content of these genome sequences. 

Finally, recombinant DNA technology critically depends on our ability 
to deliver foreign DNA into host organisms. For example, DNA fragments 
can be inserted into plasmids, where they can be replicated within a short 
period of time in their bacterial hosts. In addition, viruses efficiently deliver 
their own DNA (or RNA) into hosts, subverting them either to replicate the 
viral genome and produce viral proteins or to incorporate viral DNA into 
the host genome. 

These new methods have wide-ranging benefits across a broad spectrum 
of disciplines, including biotechnology, agriculture, and medicine. Among 
these benefits is the dramatic expansion of our understanding of human 
disease. Throughout this chapter, a specific disorder, amyotrophic lateral 
sclerosis (ALS), will be used to illustrate the effect that recombinant DNA 
technology has had on our knowledge of disease mechanisms. ALS was first 
described clinically in 1869 by the French neurologist Jean-Martin Charcot 
as a fatal neurodegenerative disease of progressive weakening and atrophy 
of voluntary muscles. ALS is commonly referred to as Lou Gehrig’s Disease, 
for the baseball legend whose career and life were prematurely cut short as 
a result of this devastating disease. For many years, little progress had been 
made in the study of the mechanisms underlying ALS. As we shall see, 
significant advances have been made with the use of research tools facili- 
tated by recombinant DNA technology. 


5.1 The Exploration of Genes Relies on Key Tools 


The rapid progress in biotechnology—indeed its very existence—is a result 
of a few key techniques. 


1. Restriction-Enzyme Analysis. Restriction enzymes are precise molecular 
scalpels that allow an investigator to manipulate DNA segments. 


2. Blotting Techniques. Southern and northern blots are used to separate and 
characterize DNA and RNA, respectively. The western blot, which uses 
antibodies to characterize proteins, was described in Chapter 3. 


3. DNA Sequencing. The precise nucleotide sequence of a molecule of 
DNA can be determined. Sequencing has yielded a wealth of information 
concerning gene architecture, the control of gene expression, and protein 
structure. 


4. Solid-Phase Synthesis of Nucleic Acids. Precise sequences of nucleic acids 
can be synthesized de novo and used to identify or amplify other nucleic 
acids. 


5. The Polymerase Chain Reaction (PCR). The polymerase chain reaction 
leads to a billionfold amplification of a segment of DNA. One molecule of 
DNA can be amplified to quantities that permit characterization and 
manipulation. This powerful technique can be used to detect pathogens and 
genetic diseases, determine the source of a hair left at the scene of a crime, 
and resurrect genes from the fossils of extinct organisms. 


A final set of techniques relies on the computer, without which, it would 
be impossible to catalog, access, and characterize the abundant information 


generated by the techniques just outlined. Such uses of the computer will be 
presented in Chapter 6. 


Restriction enzymes split DNA into specific fragments 


Restriction enzymes, also called restriction endonucleases, recognize specific 
base sequences in double-helical DNA and cleave, at specific places, both 
strands of that duplex. To biochemists, these exquisitely precise scalpels are 
marvelous gifts of nature. They are indispensable for analyzing chromo- 
some structure, sequencing very long DNA molecules, isolating genes, and 
creating new DNA molecules that can be cloned. Werner Arber and 
Hamilton Smith discovered restriction enzymes, and Daniel Nathans pio- 
neered their use in the late 1960s. 

Restriction enzymes are found in a wide variety of prokaryotes. Their 
biological role is to cleave foreign DNA molecules. Many restriction 
enzymes recognize specific sequences of four to eight base pairs and hydro- 
lyze a phosphodiester bond in each strand in this region. A striking charac- 
teristic of these cleavage sites is that they almost always possess twofold 
rotational symmetry. In other words, the recognized sequence is palindromic, 
or an inverted repeat, and the cleavage sites are symmetrically positioned. 
For example, the sequence recognized by a restriction enzyme from 
Streptomyces achromogenes is 


Cleavage site 


u 
5) C—C—G—C—G—G 3 
Poe Ae 
3’ G—G—C+G—C—C 5’ 
: 


Cleavage site Symmetry axis 

In each strand, the enzyme cleaves the C—G phosphodiester bond on the 3’ 
side of the symmetry axis. As we shall see in Chapter 9, this symmetry cor- 
responds to that of the structures of the restriction enzymes themselves. 

Several hundred restriction enzymes have been purified and character- 
ized. Their names consist of a three-letter abbreviation for the host organ- 
ism (e.g., Eco for Escherichia coli, Hin for Haemophilus influenzae, Hae for 
Haemophilus aegyptius) followed by a strain designation (if needed) and a 
roman numeral (if more than one restriction enzyme from the same strain 
has been identified). The specificities of several of these enzymes are shown 
in Figure 5.1. 

Restriction enzymes are used to cleave DNA molecules into specific frag- 
ments that are more readily analyzed and manipulated than the entire parent 
molecule. For example, the 5.1-kb circular duplex DNA of the tumor-pro- 
ducing 5V40 virus is cleaved at one site by EcoRI, at four sites by Hpal, and 
at 11 sites by HindIII. A piece of DNA, called a restriction fragment, pro- 
duced by the action of one restriction enzyme can be specifically cleaved into 
smaller fragments by another restriction enzyme. The pattern of such frag- 
ments can serve as a fingerprint of a DNA molecule, as will be considered 
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Palindrome 

A word, sentence, or verse that reads the 
same from right to left as it does from left to 
right. 

Radar 

Senile felines 

Do geese see God? 

Roma tibi subito motibus ibit amor 
Derived from the Greek palindromos, 


“running back again.’ 


y 
5’ GGATCC 3! 
3' CCTAGG 5’ 
t 


BamHI 
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5’ GAATTC 3' 
3’ CTTAAG 5’ 
t 


EcoRI 
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uy 

5’ GCGC 3’ 
3' CGCG 5’ 
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5’ CTCGAG3' 
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Figure 5.1 Specificities of some 
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restriction endonucleases. The sequences 
that are recognized by these enzymes contain 
a twofold axis of symmetry. The two strands in 
these regions are related by a 180-degree 
rotation about the axis marked by the green 
symbol. The cleavage sites are denoted by red 
arrows. The abbreviated name of each 
restriction enzyme is given at the right of the 
sequence that it recognizes. Note that the cuts 
may be staggered or even. 


shortly. Indeed, complex chromosomes containing hundreds of millions of 
base pairs can be mapped by using a series of restriction enzymes. 


Restriction fragments can be separated by gel electrophoresis 
and visualized 


Small differences between related DNA molecules can be readily detected 
because their restriction fragments can be separated and displayed by gel 
electrophoresis. In Chapter 3, we considered the use of gel electrophoresis 
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Figure 5.2 Gel-electrophoresis pattern of 
a restriction digest. This gel shows the 
fragments produced by cleaving SV40 DNA 
with each of three restriction enzymes. These 
fragments were made fluorescent by staining 
the gel with ethidium bromide. [Courtesy of 
Dr. Jeffrey Sklar.] 


to separate protein molecules (Section 3.1). Because the phosphodiester 
backbone of DNA is highly negatively charged, this technique is also 
suitable for the separation of nucleic acid fragments. For most gels, the 
shorter the DNA fragment, the farther the migration. Polyacrylamide gels 
are used to separate, by size, fragments containing as many as 1000 base 
pairs, whereas more-porous agarose gels are used to resolve mixtures of 
larger fragments (as large as 20 kb). An important feature of these gels is 
their high resolving power. In certain kinds of gels, fragments differing in 
length by just one nucleotide of several hundred can be distinguished. 
Bands or spots of radioactive DNA in gels can be visualized by autoradiog- 
raphy. Alternatively, a gel can be stained with ethidium bromide, which 
fluoresces an intense orange when bound to a double-helical DNA mole- 
cule (Figure 5.2). A band containing only 50 ng of DNA can be readily 
seen. 

A restriction fragment containing a specific base sequence can be 
identified by hybridizing it with a labeled complementary DNA strand 
(Figure 5.3). A mixture of restriction fragments is separated by electro- 
phoresis through an agarose gel, denatured to form single-stranded DNA, and 
transferred to a nitrocellulose sheet. The positions of the DNA fragments 
in the gel are preserved on the nitrocellulose sheet, where they are 
exposed to a **P-labeled single-stranded DNA probe. The probe hybridizes 
with a restriction fragment having a complementary sequence, and autora- 
diography then reveals the position of the restriction-fragment—probe 
duplex. A particular fragment amid a million others can be readily identi- 
fied in this way. This powerful technique is named Southern blotting, for its 
inventor Edwin Southern. 

Similarly, RNA molecules can be separated by gel electrophoresis, and 
specific sequences can be identified by hybridization subsequent to their 
transfer to nitrocellulose. This analogous technique for the analysis of RNA 
has been whimsically termed northern blotting. A further play on words 
accounts for the term western blotting, which refers to a technique for detect- 
ing a particular protein by staining with specific antibody (Section 3.3). 
Southern, northern, and western blots are also known respectively as DNA, 
RNA, and protein blots. 
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Figure 5.3 Southern blotting. A DNA fragment containing a specific sequence can be identified 
by separating a mixture of fragments by electrophoresis, transferring them to nitrocellulose, and 
hybridizing with a >*P-labeled probe complementary to the sequence. The fragment containing the 
sequence is then visualized by autoradiography. 


DNA can be sequenced by controlled termination of replication 


The analysis of DNA structure and its role in gene expression also have 
been markedly facilitated by the development of powerful techniques for 
the sequencing of DNA molecules. The key to DNA sequencing is the gen- 
eration of DNA fragments whose length depends on the last base in the 
sequence. Collections of such fragments can be generated through the con- 
trolled termination of replication (Sanger dideoxy method), a method devel- 
oped by Frederick Sanger and coworkers. This technique has superseded 
alternative methods because of its simplicity. The same procedure is per- 
formed on four reaction mixtures at the same time. In all these mixtures, a 
DNA polymerase is used to make the complement of a particular sequence 
within a single-stranded DNA molecule. The synthesis is primed by a 
chemically synthesized fragment that is complementary to a part of the 
sequence known from other studies. In addition to the four deoxyribonucle- 
oside triphosphates (radioactively labeled), each reaction mixture contains a 
small amount of the 2',3'-dideoxy analog of one of the nucleotides, a differ- 
ent nucleotide for each reaction mixture. 


2', 3'-Dideoxy analog 


The incorporation of this analog blocks further growth of the new chain 
because it lacks the 3'-hydroxyl terminus needed to form the next phos- 
phodiester bond. The concentration of the dideoxy analog is low enough 
that chain termination will take place only occasionally. The polymerase 
will insert the correct nucleotide sometimes and the dideoxy analog other 
times, stopping the reaction. For instance, if the dideoxy analog of dATP 
is present, fragments of various lengths are produced, but all will be termi- 
nated by the dideoxy analog (Figure 5.4). Importantly, this dideoxy analog 
of dATP will be inserted only where a T was located in the DNA being 
sequenced. Thus, the fragments of different length will correspond to the 
positions of T. Four such sets of chain-terminated fragments (one for each 
dideoxy analog) then undergo electrophoresis, and the base sequence of the 
new DNA is read from the autoradiogram of the four 
lanes. 

Fluorescence detection is a highly effective alternative 
to autoradiography because it eliminates the use of radioac- 
tive reagents and can be readily automated. A fluorescent 
tag is incorporated into each dideoxy analog—a differently 
colored one for each of the four chain terminators (e.g., a 
blue emitter for termination at A and ared one for termina- 
tion at C). With the use of a mixture of terminators, a single 
reaction can be performed and the resulting fragments are 


DNA to be sequenced 


3'— GAATTCGCTAATGC 
5’——CTTAA 


Primer 


DNA polymerase | 
Labeled dATP, TTP, 
dCTP, dGTP 

Dideoxy analog of dATP 


3'—— GAATTCGCTAATGC 
5’—CTTAAGCGATTA 

+ 
3'—— GAATTCGCTAATGC 
5’—CTTAAGCGA 


New DNA strands are separated 
and subjected to electrophoresis 


Figure 5.4 Strategy of the chain- 
termination method for sequencing DNA. 
Fragments are produced by adding the 
2',3'-dideoxy analog of a dNTP to each of 
four polymerization mixtures. For example, the 
addition of the dideoxy analog of dATP 
(shown in red) results in fragments ending in 
A. The strand cannot be extended past the 
dideoxy analog. 


ATAGTGTCACCTAAATAGCTTGGCGTAATCATGGTCATAGC 
100 110 120 130 


separated by a technique known as capillary electrophoresis, Figure 5.5 Fluorescence detection of oligonucleotide 
in which the mixture is passed through a very narrow tube _ fragments produced by the dideoxy method. A sequencing 
at high voltage to achieve efficient separation within a reaction is performed with four chain-terminating dideoxy nucleotides, 


short time. As the DNA fragments emerge from the capil- 
lary, they are detected by their fluorescence; the sequence 


each labeled with a tag that fluoresces at a different wavelength (e.g, 
red for T). Each of the four colors represents a different base in a 
chromatographic trace produced by fluorescence measurements at 


of their colors directly gives the base sequence (Figure 5.5 h four wavelengths. [After A. J. F. Griffiths et al., An Introduction to 
Sequences of as many as 500 bases can be determined in Genetic Analysis, 8th ed. (W. H. Freeman and Company, 2005),] 
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this way. Indeed, modern DNA-sequencing instruments can sequence 
more than 1 million bases per day with the use of this method. 


DNA probes and genes can be synthesized by automated 
solid-phase methods 


DNA strands, like polypeptides (Section 3.4), can be syn- 
thesized by the sequential addition of activated monomers 
to a growing chain that is linked to an insoluble support. 
The activated monomers are protected deoxyribonucleoside 
3’-phosphoramidites. In step 1, the 3’-phosphorus atom of 
this incoming unit becomes joined to the 5’-oxygen atom of 
the growing chain to form a phosphite triester (Figure 5.6). 
The 5’-OH group of the activated monomer is unreactive 
because it is blocked by a dimethoxytrityl (DMT) protect- 
ing group, and the 3’-phosphoryl group is rendered unre- 
active by attachment of the B-cyanoethyl (RCE) group. 


OCH; 


Dimethoxytrityl 
(DMT) group 


[NV / Likewise, amino groups on the purine and pyrimidine 

fo) | pone bases are blocked. 
HsC i Coupling is carried out under anhydrous conditions 
CH, because water reacts with phosphoramidites. In step 2, the 


phosphite triester (in which P is trivalent) is oxidized by 
iodine to form a phosphotriester (in which P is pentavalent). 
In step 3, the DMT protecting group on the 5'-OH group 
of the growing chain is removed by the addition of dichlo- 
roacetic acid, which leaves other protecting groups intact. The DNA chain 
is now elongated by one unit and ready for another cycle of addition. Each 
cycle takes only about 10 minutes and usually elongates more than 99% of 
the chains. 

This solid-phase approach 1s ideal for the synthesis of DNA, as it is for 
polypeptides, because the desired product stays on the insoluble support 
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Figure 5.6 Solid-phase synthesis of a DNA chain by the phosphite triester method. The 
activated monomer added to the growing chain is a deoxyribonucleoside 3’-phosphoramidite 
containing a dimethoxytrityl (DMT) protecting group on its 5’-oxygen atom, a B-cyanoethyl (BCE) 
protecting group on its 3’-phosphoryl oxygen atom, and a protecting group on the base. 
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until the final release step. All the reactions take place in a single vessel, and 
excess soluble reagents can be added to drive reactions to completion. At the 
end of each step, soluble reagents and by-products are washed away from 
the resin that bears the growing chains. At the end of the synthesis, NH3 is 
added to remove all protecting groups and release the oligonucleotide from 
the solid support. Because elongation is never 100% complete, the new 
DNA chains are of diverse lengths—the desired chain is the longest one. 
The sample can be purified by high-pressure liquid chromatography or by 
electrophoresis on polyacrylamide gels. DNA chains of as many as 100 
nucleotides can be readily synthesized by this automated method. 

The ability to rapidly synthesize DNA chains of any selected sequence 
opens many experimental avenues. For example, a synthesized oligonucle- 
otide labeled at one end with *“P or a fluorescent tag can be used to search 
for a complementary sequence in a very long DNA molecule or even in a 
genome consisting of many chromosomes. The use of labeled oligonucle- 
otides as DNA probes is powerful and general. For example, a DNA probe 
that can base-pair to a known complementary sequence in a chromosome 
can serve as the starting point of an exploration of adjacent uncharted 
DNA. Such a probe can be used as a primer to initiate the replication of 
neighboring DNA by DNA polymerase. An exciting application of the 
solid-phase approach is the synthesis of new tailor-made genes. New 
proteins with novel properties can now be produced in abundance by the 
expression of synthetic genes. Finally, the synthetic scheme heretofore 
described can be slightly modified for the solid-phase synthesis of RNA 
oligonucleotides, which can be very powerful reagents for the degradation 
of specific mRNA molecules in living cells by a technique known as RNA 
interference (Section 5.4). 


Selected DNA sequences can be greatly amplified by the polymerase 
chain reaction 


In 1984, Kary Mullis devised an ingenious method called the polymerase 
chain reaction (PCR) for amplifying specific DNA sequences. Consider a 
DNA duplex consisting of a target sequence surrounded by nontarget 
DNA. Millions of copies of the target sequences can be readily obtained by 
PCR if the flanking sequences of the target are known. PCR is carried out 
by adding the following components to a solution containing the target 
sequence: (1) a pair of primers that hybridize with the flanking sequences of 
the target, (2) all four deoxyribonucleoside triphosphates (dNTPs), and 
(3) a heat-stable DNA polymerase. A PCR cycle consists of three steps 
(Figure 5.7). 


1. Strand Separation. The two strands of the parent DNA molecule are 
separated by heating the solution to 95°C for 15 s. 


2. Hybridization of Primers. The solution is then abruptly cooled to 54°C to 
allow each primer to hybridize to a DNA strand. One primer hybridizes to 
the 3' end of the target on one strand, and the other primer hybridizes to the 
3’ end on the complementary target strand. Parent DNA duplexes do not 
form, because the primers are present in large excess. Primers are typically 
from 20 to 30 nucleotides long. 


3. DNA Synthesis. The solution is then heated to 72°C, the optimal tem- 
perature for heat-stable polymerases. One such enzyme is Taq DNA poly- 
merase, which is derived from Thermus aquaticus, a thermophilic bacterium 
that lives in hot springs. The polymerase elongates both primers in the 
direction of the target sequence because DNA synthesis is in the 5’-to-3’ 


145 


5.1 Tools of Gene Exploration 


Flanking 


sequence 
q Target sequence 


Add excess primers 
Heat to separate 
strands 

«= to anneal primers 


= i 
“— Primers —~ 


|p new DNA 


EEE SSS 
SS > 


| 
— aT el) 


Figure 5.7 The first cycle in the 
polymerase chain reaction (PCR). A cycle 
consists of three steps: strand separation, the 
hybridization of primers, and the extension of 
primers by DNA synthesis. 
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direction. DNA synthesis takes place on both strands but extends 
beyond the target sequence. 


These three steps—strand separation, hybridization of primers, 
and DNA synthesis—constitute one cycle of the PCR amplifica- 
tion and can be carried out repetitively just by changing the tem- 
perature of the reaction mixture. The thermostability of the poly- 
merase makes it feasible to carry out PCR in a closed container; no 
reagents are added after the first cycle. At the completion of the 
second cycle, four duplexes containing the targeting sequence have 
been generated (Figure 5.8). Of the eight DNA strands comprising 
these duplexes, two short strands constitute only the target 
sequence—the sequence including and bounded by the primers. 
Subsequent cycles will amplify the target sequence exponentially. 
Ideally, after n cycles, the desired sequence is amplified 2”-fold. 
The amplification is a millionfold after 20 cycles and a billionfold 
after 30 cycles, which can be carried out in less than an hour. 

Several features of this remarkable method for amplifying 
DNA are noteworthy. First, the sequence of the target need not be 
known. All that is required is knowledge of the flanking sequences 
so that complementary primers can be synthesized. Second, the 
target can be much larger than the primers. Targets larger than 
10 kb have been amplified by PCR. Third, primers do not have to 
be perfectly matched to flanking sequences to amplify targets. With 
the use of primers derived from a gene of known sequence, it is 
possible to search for variations on the theme. In this way, families 
of genes are being discovered by PCR. Fourth, PCR is highly 
specific because of the stringency of hybridization at relatively 
high temperature. Stringency is the required closeness of the match 
between primer and target, which can be controlled by tempera- 
ture and salt. At high temperatures, only the DNA between 
hybridized primers is amplified. A gene constituting less than a 
millionth of the total DNA of a higher organism is accessible by 
PCR. Fifth, PCR is exquisitely sensitive. A single DNA molecule 
can be amplified and detected. 


PCR is a powerful technique in medical diagnostics, forensics, 
and studies of molecular evolution 


PCR can provide valuable diagnostic information in medicine. 
Bacteria and viruses can be readily detected with the use of spe- 
cific primers. For example, PCR can reveal the presence of small 
amounts of DNA from the human immunodeficiency virus (HIV) 
in persons who have not yet mounted an immune response to this 
pathogen. In these patients, assays designed to detect antibodies 
against the virus would yield a false negative test result. Finding 
Mycobacterium tuberculosis bacilli in tissue specimens is slow and 
laborious. With PCR, as few as 10 tubercle bacilli per million 
human cells can be readily detected. PCR is a promising method 
for the early detection of certain cancers. This technique can iden- 
tify mutations of certain growth-control genes, such as the ras 


Figure 5.8 Multiple cycles of the polymerase chain reaction. The two short 
strands produced at the end of the third cycle (along with longer stands not shown) 
represent the target sequence. Subsequent cycles will amplify the target sequence 
exponentially and the parent sequence arithmetically. 


genes (Chapter 14). The capacity to greatly amplify selected regions of 
DNA can also be highly informative in monitoring cancer chemotherapy. 
Tests using PCR can detect when cancerous cells have been eliminated and 
treatment can be stopped; they can also detect a relapse and the need to 
immediately resume treatment. PCR is ideal for detecting leukemias caused 
by chromosomal rearrangements. 

PCR is also having an effect in forensics and legal medicine. An indi- 
vidual DNA profile is highly distinctive because many genetic loci are 
highly variable within a population. For example, variations at one specific 
location determines a person’s HLA type (human leukocyte antigen type; 
Section 34.5); organ transplants are rejected when the HLA types of the 
donor and recipient are not sufficiently matched. PCR amplification of 
multiple genes is being used to establish biological parentage in disputed 
paternity and immigration cases. Analyses of blood stains and semen sam- 
ples by PCR have implicated guilt or innocence in numerous assault and 
rape cases. The root of a single shed hair found at a crime scene contains 
enough DNA for typing by PCR (Figure 5.9). 

DNA is a remarkably stable molecule, particularly when shielded from 
air, light, and water. Under such circumstances, large fragments of DNA 
can remain intact for thousands of years or longer. PCR provides an ideal 
method for amplifying such ancient DNA molecules so that they can be 
detected and characterized (Section 6.5). PCR can also be used to amplify 
DNA from microorganisms that have not yet been isolated and cultured. As 
will be discussed in Chapter 6, sequences from these PCR products can be 
sources of considerable insight into evolutionary relationships between 
organisms. 


The tools for recombinant DNA technology have been used to identify 
disease-causing mutations 


Let us consider how the techniques just described have been utilized in 
concert to study ALS, introduced at the beginning of this chapter. Five 
percent of all patients suffering from ALS have family members who also 
have been diagnosed with the disease. A heritable disease pattern is indica- 
tive of a strong genetic component of disease causation. To identify these 
disease-causing genetic alterations, researchers identify polymorphisms 
(instances of genetic variation) within an affected family that correlate with 
the emergence of disease. Polymorphisms may themselves cause disease or 
be closely linked to another genetic alteration that does. One class of poly- 
morphisms are restriction-fragment-length polymorphisms (RFLPs), which 
are mutations within restriction sites that change the sizes of DNA frag- 
ments produced by the appropriate restriction enzyme. Using restriction 
digests and Southern blots of the DNA from members of ALS-affected 
families, researchers identified RFLPs that were found preferentially in 
those family members with a positive diagnosis. For some of these families, 
strong evidence was obtained for the disease-causing mutation within a 
specific region of chromosome 21. 

After the probable location of one disease-causing gene had been identi- 
fied, this same research group compared the locations of the ALS-associated 
RFLPs with the known sequence of chromosome 21. They noted that this 
chromosomal locus contains the SOD1 gene, which encodes the Cu/Zn 
superoxide dismutase protein SOD1, an enzyme important for the protec- 
tion of cells against oxidative damage (Section 18.3). PCR amplification of 
regions of the SOD1 gene from the DNA of affected family members, 
followed by Sanger dideoxy sequencing of the targeted fragment, enabled 
the identification of 11 disease-causing mutations from 13 different families. 
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Figure 5.9 DNA and forensics. DNA 
isolated from bloodstains on the pants and 
shirt of a defendant was amplified by PCR, 
then compared with DNA from the victim as 
well as the defendant by using gel 
electrophoresis and autoradiography. DNA 
from the bloodstains on the defendant's 
clothing matched the pattern of the victim but 
not that of the defendant. The frequency of a 
coincidental match of the DNA pattern on the 
clothing and the victim is approximately 1 in 
33 billion. Lanes \, 1kb, and TS refer to 
control DNA samples; lane D, DNA from the 
defendant; jeans and shirt, DNA isolated from 
bloodstains on defendant's pants and shirt 
(two different amounts analyzed); V, DNA 
sample from victim's blood. [Courtesy of 
Cellmark Diagnostics, Germantown, Maryland.] 
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Figure 5.10 Joining of DNA molecules by the cohesive-end 
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This work was pivotal for focusing further inquiry into the roles that super- 
oxide dismutase and its corresponding mutant forms play in the pathology 
of ALS. 


5.2 Recombinant DNA Technology Has Revolutionized 
All Aspects of Biology 


The pioneering work of Paul Berg, Herbert Boyer, and Stanley Cohen in the 
early 1970s led to the development of recombinant DNA technology, which 
has taken biology from an exclusively analytical science to a synthetic one. 
New combinations of unrelated genes can be constructed in the laboratory 
by applying recombinant DNA techniques. These novel combinations can 
be cloned—amplified many-fold—by introducing them into suitable cells, 
where they are replicated by the DNA-synthesizing machinery of the host. 
The inserted genes are often transcribed and translated in their new setting. 
What is most striking is that the genetic endowment of the host can be 
permanently altered in a designed way. 


Restriction enzymes and DNA ligase are key tools in forming 
recombinant DNA molecules 


Let us begin by seeing how novel DNA molecules can be constructed in the 
laboratory. An essential tool for the manipulation of recombinant DNA isa 
vector, a DNA molecule that can replicate autonomously in an appropriate 
host organism. Vectors are designed to enable the rapid, covalent insertion 
of DNA fragments of interest. Plasmids (naturally occurring circles of 
DNA that act as accessory chromosomes in bacteria) and bacteriophage 
lambda (A phage), a virus, are choice vectors for cloning in E. coli. The 
vector can be prepared for accepting a new DNA fragment by cleaving it at 
a single specific site with a restriction enzyme. For example, the plasmid 
p5C101, a9.9-kb double-helical circular DNA molecule, is split at a unique 
site by the EcoRI restriction enzyme. The staggered cuts made by this 
enzyme produce complementary single-stranded ends, which have specific 
affinity for each other and hence are known as cohesive or sticky ends. Any 
DNA fragment can be inserted into this plasmid if it has the same cohe- 
sive ends. Such a fragment can be prepared from a larger piece of DNA by 
using the same restriction enzyme as was used to open the plasmid DNA 
(Figure 5.10). 
The single-stranded ends of the fragment are then 

complementary to those of the cut plasmid. The DNA 

fragment and the cut plasmid can be annealed and then 


GAATIC joined by DNA ligase, which catalyzes the formation of 
CTTAAG a phosphodiester bond at a break in a DNA chain. DNA 


Cleave with Eco 


ai ligase requires a free 3’ -hydroxyl group and a 5’-phospho- 


restriction enzyme ryl group. Furthermore, the chains joined by ligase must 


be in a double helix. An energy source such as ATP 


rejoin with DNA ligase 


= —_ or NAD* is required for the joining reaction, as will be 
discussed in Chapter 28. 
Anneal DNA fragments and What if the target DNA is not naturally flanked by the 


appropriate restriction sites? How is the fragment cut and 


Sa annealed to the vector? The cohesive-end method for join- 
Cit ing DNA molecules can still be used in these cases by add- 


ing a short, chemically synthesized DNA linker that can be 


method. Two DNA molecules, cleaved with a common restriction cleaved by restriction enzymes. First, the linker is cova- 
enzyme such as EcoRI, can be ligated to form recombinant molecules. lently joined to the ends of a DNA fragment. For example, 


the 5’ ends of a decameric linker and a DNA molecule are phosphorylated 
by polynucleotide kinase and then joined by the ligase from T4 phage 
(Figure 5.11). This ligase can form a covalent bond between blunt-ended 
(flush-ended) double-helical DNA molecules. Cohesive ends are produced 
when these terminal extensions are cut by an appropriate restriction 
enzyme. Thus, cohesive ends corresponding to a particular restriction enzyme 
can be added to virtually any DNA molecule. We see here the fruits of com- 
bining enzymatic and synthetic chemical approaches in crafting new DNA 
molecules. 


5° ®)- -OH 3’ 
3’ HO- -®) 5’ 


DNA fragment or vector 


5’ (P)- CGGAATTCGG -OH 3’ 


T4 ligase | 5, HO- GGCTTAAGCC -(P) 5’ 
Decameric linker 
5’ ®)- CGGAATTCGG CGGAATTCGG -OH 3’ 
3’ HO- GGCTTAAGCC GGCTTAAGCC -®) 5’ 
EcoRI restriction 
enzyme 
5’ ®)- AATTCGG CGG-OH 3! 
3! HO-GCC GGCTTAA -(P) 5’ 


Plasmids and lambda phage are choice vectors for DNA cloning 
in bacteria 


Many plasmids and bacteriophages have been ingeniously modified by 
researchers to enhance the delivery of recombinant DNA molecules into 
bacteria and to facilitate the selection of bacteria harboring these vectors. As 
already mentioned, plasmids are circular double-stranded DNA molecules 
that occur naturally in some bacteria. They range in size from two to several 
hundred kilobases. Plasmids carry genes for the inactivation of antibiotics, 
the production of toxins, and the breakdown of natural products. These 
accessory chromosomes can replicate independently of the host chromo- 
some. In contrast with the host genome, they are dispensable under certain 
conditions. A bacterial cell may have no plasmids at all or it may house as 
many as 20 copies of a plasmid. 

Many plasmids have been optimized for a particular experimental task. 
For example, one class of plasmids, known as cloning vectors, is particularly 
suitable for the rapid insertion and replication of a collection of DNA frag- 
ments. The creative placement of antibiotic-resistance genes or reporter 
genes or both within these plasmids enables the rapid identification of those 
vectors that harbor the desired DNA insert. For example, in pBR322, one 
of the first plasmids used for this purpose, insertion of DNA at the Sall or 
BamHI restriction site (Figure 5.12) inactivates the gene for tetracycline 
resistance, an effect called insertional inactivation. Cells containing pBR322 
with a DNA insert at one of these restriction sites are resistant to ampicillin 
but sensitive to tetracycline, and so they can be readily selected. Another 
class of plasmids have been optimized for use as expression vectors for the 
production of large amounts of protein. In addition to antibiotic-resistance 
genes, they contain promoter sequences designed to drive the transcription 
of large amounts of a protein-coding DNA sequence. Often, these vectors 
contain sequences flanking the cloning site that simplify the addition of 
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Figure 5.11 Formation of cohesive ends. 
Cohesive ends can be formed by the addition 
and cleavage of a chemically synthesized 
linker. 
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Figure 5.12 Genetic map of the plasmid 
pBR322. This plasmid carries two genes for 
antibiotic resistance. Like all other plasmids, it 
is a circular duplex DNA. 
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Figure 5.13 A polylinker in the plasmid 
pUC18. The plasmid pUC18 includes a 
polylinker within a gene for B-galactosidase 
(often called the /acZ gene). Insertion of a 
DNA fragment into one of the many restriction 
sites within this polylinker can be detected by 
the absence of B-galactosidase activity. 
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fusion tags to the protein of interest (Section 3.1), greatly facilitating the 
purification of the overexpressed protein. Both types of plasmid vectors 
often feature a polylinker region that includes many unique restriction sites 
within its sequence (Figure 5.13). This polylinker can be cleaved with many 
different restriction enzymes or combinations of enzymes, providing great 
versatility in the DNA fragments that can be inserted. 

Another widely used vector, A phage, enjoys a choice of life styles: this 
bacteriophage can destroy its host or it can become part of its host 
(Figure 5.14). In the lytic pathway, viral functions are fully expressed: viral 
DNA and proteins are quickly produced and packaged into virus particles, 
leading to the lysis (destruction) of the host cell and the sudden appearance 
of about 100 progeny virus particles, or virions. In the lysogenic pathway, the 
phage DNA becomes inserted into the host-cell genome and can be repli- 
cated together with host-cell DNA for many generations, remaining inac- 
tive. Certain environmental changes can trigger the expression of this dor- 
mant viral DNA, which leads to the formation of progeny viruses and lysis 
of the host. Large segments of the 48-kb DNA of \ phage are not essential 
for productive infection and can be replaced by foreign DNA, thus making 
\ phage an ideal vector. 
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Figure 5.14 Alternative infection modes for \ phage. Lambda phage can multiply within a 
host and lyse it (lytic pathway) or its DNA can become integrated into the host genome (lysogenic 
pathway), where it is dormant until activated. 
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Figure 5.15 Mutant \ phage as a cloning 
vector. The packaging process selects DNA 
molecules that contain an insert. 


Mutant \ phages designed for cloning have been constructed. An espe- 
cially useful one called Xgt-AB contains only two EcoRI cleavage sites 
instead of the five normally present (Figure 5.15). After cleavage, the mid- 
dle segment of this \ DNA molecule can be removed. The two remaining 
pieces of DNA (called arms) have a combined length equal to 72% of a nor- 
mal genome length. This amount of DNA is too little to be packaged into a 
particle, which can take up only DNA measuring from 78% to 105% of a 
normal genome. However, a suitably long DNA insert (such as 10 kb) 
between the two ends of \ DNA enables such a recombinant DNA molecule 
(93% of normal length) to be packaged. Nearly all infectious \ particles 
formed in this way will contain an inserted piece of foreign DNA. Another 
advantage of using these modified viruses as vectors is that they enter 
bacteria much more easily than do plasmids. Among the variety of \ 
mutants that have been constructed for use as cloning vectors, one of them, 
called a cosmid, is essentially a hybrid of \ phage and a plasmid that can 
serve as a vector for large DNA inserts (as large as 45 kb). 


| Telomere 


Bacterial and yeast artificial chromosomes — 
Autonomously replicating 


Much larger pieces of DNA can be propagated in bacterial artificial chromo- |__| sequence (ARS) 
somes (BACs) or yeast artificial chromosomes (YACs). BACs are highly engi- Centromere 
neered versions of the E. coli fertility (F factor) that can include inserts as 
large as 300 kb. YACs contain a centromere, an autonomously replicating — 
sequence (ARS, where replication begins), a pair of telomeres (normal ends 
of eukaryotic chromosomes), selectable marker genes, and a cloning site DNA insert 
(Figure 5.16). Inserts as large as 1000 kb can be cloned into YAC vectors. (100 to 1000 kb) 


Specific genes can be cloned from digests of genomic DNA 


Ingenious cloning and selection methods have made it possible to isolate | 
small stretches of DNA in a genome containing more than 3 X 10° kb. The 
approach is to prepare a large collection (library) of DNA fragments and |_| Telomere 
then to identify those members of the collection that have the gene of inter- , ‘ dai 

: ; : f Figure 5.16 Diagram of a yeast artificial 
est. Hence, to clone a gene that is present just once in an entire genome, two chromosome (YAC). These vectors include 
critical components must be available: a specific oligonucleotide probe for features necessary for replication and stability 
the gene of interest and a DNA library that can be screened rapidly. in yeast cells. 


Figure 5.17 Probes generated from a 
protein sequence. A probe can be 
generated by synthesizing all possible 
oligonucleotides encoding a particular 
sequence of amino acids. Because of the 
degeneracy of the genetic code, 256 distinct 
oligonucleotides must be synthesized to 
ensure that the probe matching the sequence 
of seven amino acids in this example is 
present. 
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How is a specific probe obtained? In one approach, a probe for a gene 
can be prepared if a part of the amino acid sequence of the protein encoded 
by the gene is known. Peptide sequencing of a purified protein (Chapter 3) 
or knowledge of the sequence of a homologous protein from a related species 
(Chapter 6) are two potential sources of such information. However, a prob- 
lem arises because a single peptide sequence can be encoded by a number of 
different oligonucleotides (Figure 5.17). Thus, for this purpose, peptide 
sequences containing tryptophan and methionine are preferred, because 
these amino acids are specified by a single codon, whereas other amino acid 
residues have between two and six codons (see Table 4.5). All the DNA 
sequences (or their complements) that encode the selected peptide sequence 
are synthesized by the solid-phase method and made radioactive by phos- 
phorylating their 5’ ends with ae pe 

Alternatively, probes can be obtained from the corresponding mRNA 
from cells in which it is abundant. For example, precursors of red blood cells 
contain large amounts of mRNA for hemoglobin, and plasma cells are rich 
in mRNAs for antibody molecules. The mRNAs from these cells can be 
fractionated by size to enrich for the mRNA of interest. As will be described 
shortly, a DNA complementary to this mRNA can be synthesized in vitro 
and cloned to produce a highly specific probe. 

To prepare the DNA library, a sample containing many copies of total 
genomic DNA is first mechanically sheared or partly digested by restriction 
enzymes into large fragments (Figure 5.18). This process yields a nearly 
random population of overlapping DNA fragments. These fragments are 
then separated by gel electrophoresis to isolate the set of all fragments that 
are about 15 kb long. Synthetic linkers are attached to the ends of these frag- 
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Figure 5.18 Creation of a genomic library. A 
genomic library can be created from a digest of a 
whole complex genome. On fragmentation of the 
genomic DNA into overlapping segments, the Amplification by 
DNA is inserted into the \ phage vector (shown infection of E. coli 
in yellow). Packaging into virions and amplification 

by infection in E. coli yields a genomic library. Genomic library in 4 phage 


ments, cohesive ends are formed, and the fragments are then inserted into a 
vector, such as \ phage DNA, prepared with the same cohesive ends. E. coli 
bacteria are then infected with these recombinant phages. These phages 
replicate themselves and then lyse their bacterial hosts. The resulting lysate 
contains fragments of human DNA housed in a sufficiently large number of 
virus particles to ensure that nearly the entire genome is represented. These 
phages constitute a genomic library. Phages can be propagated indefinitely, 
and so the library can be used repeatedly over long periods. 

This genomic library is then screened to find the very small number of 
phages harboring the gene of interest. For the human genome, a calculation 
shows that a 99% probability of success requires screening about 500,000 
clones; hence, a very rapid and efficient screening process is essential. Rapid 
screening can be accomplished by DNA hybridization. 

A dilute suspension of the recombinant phages is first plated on a lawn 
of bacteria (Figure 5.19). Where each phage particle has landed and infected 
a bacterium, a plaque containing identical phages develops on the plate. A 
replica of this master plate is then made by applying a sheet of nitrocellu- 
lose. Infected bacteria and phage DNA released from lysed cells adhere to 
the sheet in a pattern of spots corresponding to the plaques. Intact bacteria 
on this sheet are lysed with NaOH, which also serves to denature the DNA 
so that it becomes accessible for hybridization with a *“P-labeled probe. The 
presence of a specific DNA sequence in a single spot on the replica can be 
detected by using a radioactive complementary DNA or RNA molecule as 
a probe. Autoradiography then reveals the positions of spots harboring 
recombinant DNA. The corresponding plaques are picked out of the intact 
master plate and grown. A single investigator can readily screen a million 
clones in a day. This method makes it possible to isolate virtually any gene, 
provided that a probe is available. 
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Figure 5.19 Screening a genomic library 
for a specific gene. Here, a plate is tested 
for plaques containing gene a of Figure 5.18. 
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Figure 5.20 Formation 
of a cDNA duplex. A 
complementary DNA 
(cDNA) duplex is created 
from mRNA by using 
reverse transcriptase to 


synthesize a cDNA strand, 


first along the mRNA 
template and then, after 
digestion of the mRNA, 
along that same newly 
synthesized cDNA strand. 


Oligo(T) primer 


Complementary DNA prepared from mRNA can be expressed 
in host cells 


The preparation of eukaryotic DNA libraries presents unique challenges, 
especially if the researcher is interested primarily in the protein-coding 
region of a particular gene. Recall that most mammalian genes are mosaics 
of introns and exons. These interrupted genes cannot be expressed by bac- 
teria, which lack the machinery to splice introns out of the primary tran- 
script. However, this difficulty can be circumvented by causing bacteria to 
take up recombinant DNA that is complementary to mRNA, where the 
intronic sequences have been removed. 

The key to forming complementary DNA is the enzyme reverse tran- 
scriptase. As discussed in Section 4.3, a retrovirus uses this enzyme to form 
a DNA-RNA hybrid in replicating its genomic RNA. Reverse transcriptase 
synthesizes a DNA strand complementary to an RNA template if the tran- 
scriptase is provided with a DNA primer that is base-paired to the RNA 
and contains a free 3’-OH group. We can use a simple sequence of linked 
thymidine [oligo(T)] residues as the primer. This oligo(T) sequence pairs 
with the poly(A) sequence at the 3’ end of most eukaryotic mRNA mole- 
cules (Section 4.4), as shown in Figure 5.20. The reverse transcriptase then 
synthesizes the rest of the cDNA strand in the presence of the four deoxy- 
ribonucleoside triphosphates. The RNA strand of this RNA—DNA hybrid 
is subsequently hydrolyzed by raising the pH. Unlike RNA, DNA 1s resis- 
tant to alkaline hydrolysis. The single-stranded DNA is converted into 
double-stranded DNA by creating another primer site. The enzyme termi- 
nal transferase adds nucleotides—for instance, several residues of dG—to 
the 3’ end of DNA. Oligo(dC) can bind to dG residues and prime the 
synthesis of the second DNA strand. Synthetic linkers can be added to this 
double-helical DNA for ligation to a suitable vector. Complementary DNA 
for all mRNA that a cell contains can be made, inserted into vectors, and 
then inserted into bacteria. Such a collection is called a cDNA library. 
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Complementary DNA molecules can be inserted into expression vectors 
to enable the production of the corresponding protein of interest. Clones of 
cDNA can be screened on the basis of their capacity to direct the synthesis 
of a foreign protein in bacteria, a technique referred to as expression cloning. 
A radioactive antibody specific for the protein of interest can be used to 
identify colonies of bacteria that express the corresponding protein product 
(Figure 5.21). As described earlier, spots of bacteria on a replica plate are 
lysed to release proteins, which bind to an applied nitrocellulose filter. With 
the addition of '**I-labeled antibody specific for the protein of interest, 
autoradiography reveals the location of the desired colonies on the master 
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Figure 5.21 Screening of cDNA 
clones. A method of screening for cDNA 
clones is to identify expressed products 
Autoradiogram by staining with specific antibody. 


plate. This immunochemical screening approach can be used whenever a 
protein is expressed and corresponding antibody is available. 

Complementary DNA has many applications beyond the generation of 
genetic libraries. The overproduction and purification of most eukaryotic 
proteins in prokaryotic cells necessitates the insertion of cDNA into plas- 
mid vectors. For example, proinsulin, a precursor of insulin, is synthesized 
by bacteria harboring plasmids that contain DNA complementary to 
mRNA for proinsulin (Figure 5.22). Indeed, bacteria produce much of the 
insulin used today by millions of diabetics. 
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Figure 5.22 Synthesis of proinsulin by bacteria. Proinsulin, a precursor of insulin, can be 
synthesized by transformed (genetically altered) clones of E. coll. The clones contain the 
mammalian proinsulin gene. 
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Proteins with new functions can be created through directed changes 
in DNA 


Much has been learned about genes and proteins by analyzing the effects 
that mutations have on their structure and function. In the classic genetic 
approach, mutations are generated randomly throughout the genome of a 
host organism, and those individuals exhibiting a phenotype of interest are 
selected. Analysis of these mutants then reveals which genes are altered, and 
DNA sequencing identifies the precise nature of the changes. Recombinant 
DNA technology now makes the creation of specific mutations feasible in vitro. 
We can construct new genes with designed properties by making three 
kinds of directed changes: deletions, insertions, and substitutions. 


Deletions. A specific deletion can be produced by cleaving a plasmid at two 
sites with a restriction enzyme and ligating to form a smaller circle. This 
simple approach usually removes a large block of DNA. A smaller deletion 
can be made by cutting a plasmid at a single site. The ends of the linear 
DNA are then digested by an exonuclease that removes nucleotides from 
both strands. The shortened piece of DNA is then ligated to form a circle 
that is missing a short length of DNA about the restriction site. 


Substitutions: oligonucleotide-directed mutagenesis. Mutant proteins with 
single amino acid substitutions can be readily produced by oligonucleotide- 
directed mutagenesis (Figure 5.23). Suppose that we want to replace a 
particular serine residue with cysteine. This mutation can be made if (1) we 
have a plasmid containing the gene or cDNA for the protein and (2) we 
know the base sequence around the site to be altered. If the serine of interest 
is encoded by TCT, mutation of the central base from C to G yields the 
TGT codon, which encodes cysteine. This type of mutation is called a point 
mutation because only one base is altered. To introduce this mutation into 
our plasmid, we prepare an oligonucleotide primer that is complementary to 
this region of the gene except that it contains TGT instead of TCT. The two 
strands of the plasmid are separated, and the primer is then annealed to the 
complementary strand. The mismatch of 1 of 15 base pairs is tolerable if the 
annealing is carried out at an appropriate temperature. After annealing to 
the complementary strand, the primer is elongated by DNA polymerase, 
and the double-stranded circle is closed by adding DNA ligase. Subsequent 
replication of this duplex yields two kinds of progeny plasmid, half with the 
original TCT sequence and half with the mutant TGT sequence. Expression 
of the plasmid containing the new TGT sequence will produce a protein 
with the desired substitution of cysteine for serine at a unique site. We will 
encounter many examples of the use of oligonucleotide-directed mutagen- 
esis to precisely alter regulatory regions of genes and to produce proteins 
with tailor-made features. 
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Figure 5.23 Oligonucleotide-directed mutagenesis. A primer containing a mismatched 
nucleotide is used to produce a desired change in the DNA sequence. 


Insertions: cassette mutagenesis. In cassette mutagenesis, a variety of Cleavage sites 
mutations, including insertions, deletions, and multiple point mutations, ( ) 
can be introduced into the gene of interest. A plasmid harboring the original 

gene is cut with a pair of restriction enzymes to remove a short segment 

(Figure 5.24). A synthetic double-stranded oligonucleotide (the cassette) 

carrying the genetic alterations of interest is prepared with cohesive ends 

that are complementary to the ends of the cut plasmid. Ligation of the 

cassette into the plasmid yields the desired mutated gene product. 


Plasmid 


Designer genes. Novel proteins can also be created by splicing together with original gene 

gene segments that encode domains that are not associated in nature. For 

example, a gene for an antibody can be joined to a gene for a toxin to pro- Cut with endonucleases 
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duce a chimeric protein that kills cells that are recognized by the antibody. 
These immunotoxins are being evaluated as anticancer agents. Furthermore, 
noninfectious coat proteins of viruses can be produced in large amounts by (ise the large 
recombinant DNA methods. They can serve as synthetic vaccines that are fragment 
safer than conventional vaccines prepared by inactivating pathogenic 
viruses. A subunit of the hepatitis B virus produced in yeast is proving to be 
an effective vaccine against this debilitating viral disease. Finally, entirely 

new genes can be synthesized de novo by the solid-phase method. These a ae ene 
genes can encode proteins with no known counterparts in nature. 
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Recombinant methods enable the exploration of the functional effects ee 
of disease-causing mutations 


The application of recombinant DNA technology to the production of 
mutated proteins has had a significant effect in the study of ALS. Recall 
that genetic studies had identified a number of ALS-inducing mutations 
within the gene encoding Cu/Zn superoxide dismutase. As we shall learn in 
Section 18.3, SOD1 catalyzes the conversion of the superoxide radical anion 
into hydrogen peroxide, which, in turn, is converted into molecular oxygen 
and water by catalase. To study the potential effect of ALS-causing muta- pineaata 

tions on SOD 1 structure and function, the SOD1 gene was isolated from a with new gene 
human cDNA library by PCR amplification. The amplified fragments 


: : ; 4 ee Figure 5.24 Cassette mutagenesis. DNA is 
containing the gene were then digested with by an appropriate restriction 


cleaved at a pair of unique restriction sites by 


enzyme and inserted into a similarly digested plasmid vector. Mutations two different restriction endonucleases. A 
corresponding to those observed in ALS patients were introduced into these synthetic oligonucleotide with ends that are 
plasmids by oligonucleotide-directed mutagenesis and the protein products complementary to these sites (the cassette) is 


then ligated to the cleaved DNA. The method 
is highly versatile because the inserted DNA 
can have any desired sequence. 


were expressed and assayed for their catalytic activity. Surprisingly, these 
mutations did not significantly alter the enzymatic activity of the corre- 
sponding recombinant proteins. These observations have led to the prevail- 
ing notion that these mutations impart toxic properties to 5OD1. Although 
the nature of this toxicity is not yet completely understood, one hypothesis 
is that mutant SOD 1 is prone to form toxic aggregates in the cytoplasm of 
neuronal cells. 


5.3 Complete Genomes Have Been Sequenced 
and Analyzed 


The methods just described are extremely effective for the isolation and 
characterization of fragments of DNA. However, the genomes of organisms 
ranging from viruses to human beings contain longer sequences of DNA, 
arranged in very specific ways crucial for their integrated functions. Is it 
possible to sequence complete genomes and analyze them? For small 
genomes, this sequencing was accomplished soon after DNA-sequencing 
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CHAPTER 5 Exploring Genes and plete sequence of the 5,386 bases in the DNA of the 6X174 DNA virus in 

Genomes 1977, just a quarter century after Sanger’s pioneering elucidation of the 
amino acid sequence of a protein. This tour de force was followed several 
years later by the determination of the sequence of human mitochondrial 
DNA, a double-stranded circular DNA molecule containing 16,569 base 
pairs. It encodes 2 ribosomal RNAs, 22 transfer RNAs, and 13 proteins. 
Many other viral genomes were sequenced in subsequent years. However, 
the genomes of free-living organisms presented a great challenge because 
even the simplest comprises more than 1 million base pairs. Thus, sequenc- 
ing projects require both rapid sequencing techniques and efficient meth- 
ods for assembling many short stretches of 300 to 500 base pairs into a 
complete sequence. 


The genomes of organisms ranging from bacteria to multicellular 
eukaryotes have been sequenced 


With the development of automatic DNA sequencers based on fluorescent 
dideoxynucleotide chain terminators, high-volume, rapid DNA sequencing 
became a reality. The genome sequence of the bacterium Haemophilus influ- 
enzae was determined in 1995 by using a “shotgun” approach. The genomic 
DNA was sheared randomly into fragments that were then sequenced. 
Computer programs assembled the complete sequence by matching up 
overlapping regions between fragments. The H. influenzae genome com- 
prises 1,830,137 base pairs and encodes approximately 1,740 proteins 
(Figure 5.25). Using similar approaches, investigators have determined the 
sequences of more than 100 bacterial and archaeal species including key 
model organisms suchas E. coli, Salmonella typhimurium, and Archaeoglobus 
fulgidus, as well as pathogenic organisms such as Yersina pestis (causing 
bubonic plague) and Bacillus anthracis (anthrax). 

The first eukaryotic genome to be completely sequenced was that of 
baker’s yeast, Saccharomyces cerevisiae, in 1996. The yeast genome com- 
prises approximately 12 million base pairs, distributed on 16 chromosomes, 
and encodes more than 6,000 proteins. This achievement was followed in 
1998 by the first complete sequencing of the genome of a multicellular 
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Figure 5.25 A complete genome. The 
diagram depicts the genome of Haemophilus 
influenzae, the first complete genome of a 
free-living organism to be sequenced. The 
genome encodes more than 1700 proteins - 7 : 
and 70 RNA molecules. The likely function of ~~ 600000 
approximately one-half of the proteins was 
determined by comparisons with sequences 
of proteins already characterized in other 
species. [From R. D. Fleischmann et al., 
Science 269:496-512, 1995; scan courtesy 
of The Institute for Genomic Research.] 
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organism, the nematode Caenorhabditis elegans, which contains 97 million 
base pairs. This genome includes more than 19,000 genes. The genomes 
of many additional organisms widely used in biological and biomedi- 
cal research have now been sequenced, including those of the fruit fly 
Drosophila melanogaster, the model plant Arabidopsis thaliana, the mouse, 
the rat, and the dog. Note that the sequencing of a complex genome pro- 
ceeds in various stages from “draft” through “completed” to “finished.” 
Even after a sequence has been declared “finished,” some sections, such as 
the repetitive sequences that make up heterochromatin, may be missing 
because these DNA sequences are very difficult to manipulate with the use 
of standard techniques. 


The sequencing of the human genome has been finished 


The ultimate goal of much of genomics research has been the sequencing 
and analysis of the human genome. Given that the human genome com- 
prises approximately 3 billion base pairs of DNA distributed among 24 
chromosomes, the challenge of producing a complete sequence was daunt- 
ing. However, through an organized international effort of academic labo- 
ratories and private companies, the human genome has now progressed 
from a draft sequence first reported in 2001 to a finished sequence reported 
in late 2004 (Figure 5.26). 

The human genome is a rich source of information about many aspects 
of humanity including biochemistry and evolution. Analysis of the genome 
will continue for many years to come. Developing an inventory of protein- 
encoding genes is one of the first tasks. At the beginning of the genome- 
sequencing project, the number of such genes was estimated to be approxi- 
mately 100,000. With the availability of the completed (but not finished) 
genome, this estimate was reduced to between 30,000 and 35,000. With the 
finished sequence, the estimate fell to 20,000 to 25,000. We will use the 
estimate of 23,000 throughout this book. The reduction in this estimate is 
due, in part, to the realization that there are a large number of pseudogenes, 
many of which are formerly functional genes that have picked up mutations 
and are no longer expressed. For example, more than half of the genomic 
regions that correspond to olfactory receptors— 
key molecules responsible for our sense of smell— 
are pseudogenes (Section 33.1). The correspond- 
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3-Hydroxy-3-methylglutaryl- 


ing regions in the genomes of other primates and coenzyme A reductase 


rodents encode functional olfactory receptors. 
Nonetheless, the surprisingly small number of 
genes belies the complexity of the human pro- 
teome. Many genes encode more than one protein 
through mechanisms such as alternative splicing of 
mRNA and posttranslational modifications of pro- > oe ew 
teins. The different proteins encoded by a single 

gene often display important variations in func- 


tional properties. 
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mobile genetic elements. These elements, related to 
retroviruses (Section 4.3), have inserted them- 
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Figure 5.26 The human genome. The human genome is arrayed on 46 
chromosomes—22 pairs of autosomes and the X and Y sex chromosomes. The 


selves throughout the genome over time. Most of _Igcations of several genes associated with important pathways in biochemistry are 


these elements have accumulated mutations and _ highlighted. 
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are no longer functional. More than 1 million Alu sequences, each approxi- 
mately 300 bases in length, are present in the human genome. Alu sequences 
are examples of SINES, short interspersed elements. The human genome also 
includes nearly 1 million LINES, long interspersed elements, DNA sequences 
that can be as long as 10 kilobase pairs (kbp). The roles of these elements 
as neutral genetic parasites or instruments of genome evolution are under 
current investigation. 


“Next-generation” sequencing methods enable the rapid determination 
of a whole genome sequence 


Since the introduction of Sanger dideoxy method in the mid-1970s, signifi- 
cant advances have been made in DNA-sequencing technologies, enabling 
the readout of progressively longer sequences with higher fidelity and short- 
er run times. The recent development of “next-generation” sequencing 
methods has extended this capability to formerly unforeseen levels. By com- 
bining technological breakthroughs in the handling of very small amounts of 
liquid, high-resolution optics, and computing power, these methods enable 
the parallel sequencing of more than 400,000 individual DNA fragments, at 
several hundred bases per fragment. Hence, a single 10-hour sequencing 
experiment can generate more than 100,000,000 bases (100 megabases). 
Although significant hurdles remain, this sequencing capacity suggests that 
the rapid sequencing of anyone’s genome at low cost is a very real possibility. 
Individual genome sequences will provide information about genetic varia- 
tion within populations and may usher in an era of personalized medicine, 
when these data can be used to guide treatment decisions. 


Comparative genomics has become a powerful research tool 


Comparisons with genomes from other organisms are a source of insight 
into the human genome. The sequencing of the genome of the chimpanzee, 
our closest living relative, is nearing completion. The genomes of other 
mammals that are widely used in biological research, such as the mouse and 


Human chromosomes Mouse chromosomes 
6 7 8 9 ] 2 8 9 

() 10 19 (11 
i i i | 6 i9 

9 8 
1 

2 Z 19 
in 4 15 
[116 15 19 6 

16 
1 20 3 

1 


6 19 X Y 
Qn a. 


Figure 5.27 Genome comparison. A schematic comparison of the human genome and the 
mouse genome shows reassortment of large chromosomal fragments. 


the rat, have been completed. Comparisons reveal that an astonishing 99% 
of human genes have counterparts in these rodent genomes. However, these 
genes have been substantially reassorted among chromosomes in the esti- 
mated 75 million years of evolution since humans and rodents had a com- 
mon ancestor (Figure 5.27). 

The genomes of other organisms also have been determined specifically 
for use in comparative genomics. For example, the genomes of two species 
of puffer fish, Takifugu rubripes and Tetraodon nigroviridis, have been deter- 
mined. These genomes were selected because they are very small and lack 
much of the intergenic DNA present in such abundance in the human 
genome. The puffer fish genomes include fewer than 400 megabase pairs 
(Mbp), one-eighth of the number in the human genome, yet the puffer fish 
and human genomes contain essentially the same number of genes. 
Comparison of the genomes of these species with that of humans revealed 
more than 1000 formerly unrecognized human genes. Furthermore, com- 
parison of the two species of puffer fish, which had a common ancestor 
approximately 25 million years ago, is a source of insight into more-recent 
events in evolution. Comparative genomics is a powerful tool, both for 
interpreting the human genome and for understanding major events in the 
origin of genera and species. 


5.4 Eukaryotic Genes Can Be Quantitated and Manipulated 
with Considerable Precision 


After a gene of interest has been identified, cloned, and sequenced, it is 
often desirable to understand how that gene and its corresponding protein 
product function in the context of a whole cell or organism. It is now pos- 
sible to determine how the expression of a particular gene is regulated, how 
mutations in the gene affect the function of the corresponding protein 
product, and how the behavior of an entire cell or model organism is altered 
by the introduction of mutations within specific genes. Levels of transcrip- 
tion of large families of genes within cells and tissues can be readily quanti- 
tated and compared across a range of environmental conditions. Eukaryotic 
genes can be introduced into bacteria, and the bacteria can be used as facto- 
ries to produce a desired protein product. DNA can also be introduced into 
the cells of higher organisms. Genes introduced into animals are valuable 
tools for examining gene action, and they are the basis of gene therapy. 
Genes introduced into plants can make the plants resistant to pests, able to 
grow in harsh conditions, or carry greater quantities of essential nutrients. 
The manipulation of eukaryotic genes holds much promise as a source of 
medical and agricultural benefits, but it is also a source of controversy. 


Gene-expression levels can be comprehensively examined 


Most genes are present in the same quantity in every cell—namely, one copy 
per haploid cell or two copies per diploid cell. However, the level at which a 
gene is expressed, as indicated by mRNA quantities, can vary widely, rang- 
ing from no expression to hundreds of mRNA copies per cell. Gene- 
expression patterns vary from cell type to cell type, distinguishing, for 
example, a muscle cell from a nerve cell. Even within the same cell, gene- 
expression levels may vary as the cell responds to changes in physiological 
circumstances. Note that mRNA levels sometimes correlate with the levels 
of proteins expressed, but this correlation does not always hold. Thus, care 
must be exercised when interpreting the results of mRNA levels alone. 
The quantity of individual mRNA transcripts can be determined by 
quantitative PCR (qPCR), or real-time PCR. RNA is first isolated from the 
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Figure 5.29 Gene-expression analysis 
with the use of microarrays. The 
expression levels of thousands of genes can 
be simultaneously analyzed by using DNA 
microarrays (gene chips). Here, an analysis 
of 1733 genes in 84 breast-tumor samples 
reveals that the tumors can be divided into 
distinct classes on the basis of their 
gene-expression patterns. Red corresponds 
to gene induction and green corresponds to 
gene repression. [After C. M. Perou et al., 
Nature 406:747-752, 2000.] 
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of the original CDNA template. [After N. J. 
Walker, Science 296:557-559, 2002.] 


cell or tissue of interest. With the use of reverse transcriptase, cDNA is 
prepared from this RNA sample. In one qPCR approach, the transcript of 
interest is PCR amplified with the appropriate primers in the presence of 
the dye SYBR Green I, which fluoresces brightly when bound to double- 
stranded DNA. In the initial PCR cycles, not enough duplex is present to 
allow a detectable fluorescence signal. However, after repeated PCR cycles, 
the fluorescence intensity exceeds the detection threshold and continues to 
rise as the number of duplexes corresponding to the transcript of interest 
increases (Figure 5.28). Importantly, the cycle number at which the fluores- 
cence becomes detectable over a defined threshold (or Cr) is indirectly 
proportional to the number of copies of the original template. After the rela- 
tion between the original copy number and the Cr has been established 
with the use of a known standard, subsequent qPCR experiments can be 
used to determine the number of copies of any desired transcript in the 
original sample, provided the appropriate primers are available. 

Although qPCR is a powerful technique for quantitation of a small 
number of transcripts in any given experiment, we can now use our knowl- 
edge of complete genome sequences to investigate an entire transcriptome, 
the pattern and level of expression of all genes in a particular cell or tissue. 
One of the most powerful methods developed to date for this purpose is 
based on hybridization. Oligonucleotides or cDNAs are affixed to a solid 
support such as a microscope slide, creating a DNA microarray. Fluores- 
cently labeled cDNA is hybridized to the slide to reveal the expression level 
for each gene, identifiable by its known position within the microarray 
(Figure 5.29). The intensity of the fluorescent spot on the chip reveals the 
extent of the transcription of a particular gene. DNA chips have been pre- 
pared that contain oligonucleotides complementary to all known protein- 
encoding genes, 6200 in number, within the yeast genome (Figure 5.30). An 
analysis of mRNA pools with the use of these chips revealed, for example, 
that approximately 50% of all yeast genes are expressed at steady-state levels 
of 0.1 to 1.0 mRNA copy per cell. This method readily detected variations 
in expression levels displayed by specific genes under different growth 
conditions. 

Microarray analyses can be quite informative in the study of gene- 
expression changes in diseased mammals compared with their healthy 


counterparts. As noted earlier, although ALS-causing mutations within the 
SOD1 gene had been identified, the mechanism by which the mutant SOD 1 
protein ultimately leads to motor-neuron loss remains a mystery. Many 
research groups have used microarray analysis of neuronal cells isolated 
from humans and mice carrying SOD1 mutations to search for clues into 
the pathways of disease progression and to suggest potential avenues 
for treatment. These studies have implicated the participation of a variety 
of biochemical pathways, including immunological activation, handling of 
oxidative stress, and protein degradation, in the cellular response to the 
mutant, toxic forms of SOD 1. 


New genes inserted into eukaryotic cells can be efficiently expressed 


Bacteria are ideal hosts for the amplification of DNA molecules. They can 
also serve as factories for the production of a wide range of prokaryotic and 
eukaryotic proteins. However, bacteria lack the necessary enzymes to carry 
out posttranslational modifications such as the specific cleavage of poly- 
peptides and the attachment of carbohydrate units. Thus, many eukaryotic 
genes can be correctly expressed only in eukaryotic host cells. The intro- 
duction of recombinant DNA molecules into cells of higher organisms can 
also be a source of insight into how their genes are organized and expressed. 
How are genes turned on and off in embryological development? How does 
a fertilized egg give rise to an organism with highly differentiated cells that 
are organized in space and time? These central questions of biology can 
now be fruitfully approached by expressing foreign genes in mammalian 
cells. 

Recombinant DNA molecules can be introduced into animal cells in 
several ways. In one method, foreign DNA molecules precipitated by 
calcium phosphate are taken up by animal cells. A small fraction of the 
imported DNA becomes stably integrated into the chromosomal DNA. 
The efficiency of incorporation is low, but the method is useful because it is 
easy to apply. In another method, DNA is microinjected into cells. A fine- 
tipped (0.1-mm_-diameter) glass micropipette containing a solution of for- 
eign DNA is inserted into a nucleus (Figure 5.31). A skilled investigator can 
inject hundreds of cells per hour. About 2% of injected mouse cells are 
viable and contain the new gene. Ina third method, viruses are used to intro- 
duce new genes into animal cells. The most effective vectors are retroviruses, 
which replicate through DNA intermediates, the reverse of 
the normal flow of information. A striking feature of the life 
cycle of a retrovirus is that the double-helical DNA form of 
its genome, produced by the action of reverse transcriptase, 
becomes randomly incorporated into host chromosomal 
DNA. This DNA version of the viral genome, called proviral 
DNA, can be efficiently expressed by the host cell and repli- 
cated along with normal cellular DNA. Retroviruses do not 
usually kill their hosts. Foreign genes have been efficiently 
introduced into mammalian cells by infecting them with vec- 
tors derived from the Moloney murine leukemia virus, which 
can accept inserts as long as 6 kb. Some genes introduced by 
this retroviral vector into the genome of a transformed host 
cell are efficiently expressed. 

Two other viral vectors are extensively used. Vaccinia 
virus, a large DNA-containing virus, replicates in the cyto- 
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Figure 5.30 Monitoring changes in yeast 
gene expression. This microarray analysis 
shows levels of gene expression for yeast 
genes under different conditions. [After V. R. 
lyer et al., Nature 409:533—538, 2001. 
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plasm of mammalian cells, where it shuts down host-cell 
protein synthesis. Baculovirus infects insect cells, which can 


Figure 5.31 Microinjection of DNA. Cloned plasmid DNA is 
being microinjected into the male pronucleus of a fertilized 
mouse egg. 
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Figure 5.32 Transgenic mice. Mice 
expressing human SOD1 harboring a known 
ALS-causing mutation exhibit a phenotype 
similar to the human disease, including the 
loss of motor neurons, voluntary muscle 
weakness, and paralysis. [After C. S. Lobsinger 
et al., PNAS 104:7319-7326, 2007. 
Copyright 2007 National Academy of 
Sciences, U. S. A] 


be conveniently cultured. Insect larvae infected with this virus can serve as 
efficient protein factories. Vectors based on these large-genome viruses have 
been engineered to express DNA inserts efficiently. 


Transgenic animals harbor and express genes introduced into their 
germ lines 


As shown in Figure 5.31, plasmids harboring foreign genes can be microin- 
jected into the male pronucleus of fertilized mouse eggs, which are then 
inserted into the uterus of a foster-mother mouse. A subset of the resulting 
embryos in this host will then harbor the foreign gene; these embryos may 
develop into mature animals. Southern blotting of the DNA of the progeny 
can be used to determine which offspring carry the introduced gene. These 
transgenic mice are a powerful means of exploring the role of a specific gene 
in the development, growth, and behavior of an entire organism. Transgenic 
animals often serve as useful models for a particular disease process, 
enabling researchers to test the efficacy and safety of a newly developed 
therapy. 

Let us return to our example of ALS. Research groups have generated 
transgenic mouse lines that express forms of human superoxide dismutase 
that harbor mutations matching those identified in earlier genetic analyses. 
Many of these strains exhibit a clinical picture similar to that observed in 
ALS patients: progressive weakness of voluntary muscles and eventual 
paralysis, motor-neuron loss, and rapid progression to death (Figure 5.32). 
Since their first characterization in 1994, these strains continue to serve as 
valuable sources of information for the exploration of the mechanism, and 
potential treatment, of ALS. 
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Gene disruption provides clues to gene function 


A gene’s function can also be probed by inactivating the gene and looking 
for resulting abnormalities. Powerful methods have been developed for 
accomplishing gene disruption (also called gene knockout) in organisms such 
as yeast and mice. These methods rely on the process of homologous recom- 
bination. Through this process, regions of strong sequence similarity 
exchange segments of DNA. Foreign DNA inserted into a cell can thus 
disrupt any gene that is at least partly homologous by the exchange of seg- 
ments (Figure 5.33). Specific genes can be targeted if their nucleotide 
sequences are known. 

For example, the gene-knockout approach has been applied to the genes 
encoding gene-regulatory proteins (also called transcription factors) that 
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- 
Mutated gene Figure 5.33 Gene disruption by 
homologous recombination. 
(B) (A) A mutated version of the gene to be 
disrupted is constructed, maintaining some 
regions of homology with the normal gene 
(red). When the foreign mutated gene is 
introduced into an embryonic stem cell, 
Homologous recombination (B) recombination takes place at regions of 
homology and (C) the normal (targeted) 
gene is replaced, or “knocked out,” by the 
foreign gene. The cell is inserted into 
embryos, and mice lacking the gene 
Mutation in the targeted gene (knockout mice) are produced. 


control the differentiation of muscle cells. When both copies of the gene for 
the regulatory protein myogenin are disrupted, an animal dies at birth 
because it lacks functional skeletal muscle. Microscopic inspection reveals 
that the tissues from which muscle normally forms contain precursor cells 
that have failed to differentiate fully (Figure 5.34). Heterozygous mice con- 
taining one normal myogenin gene and one disrupted gene appear normal, 
suggesting that the level of gene expression is not essential for its function. 
Analogous studies have probed the function of many other genes to gener- 
ate animal models for known human genetic diseases. 


(B) 


Figure 5.34 Consequences of gene disruption. Sections of muscle from normal (A) and 
gene-disrupted (B) mice, as viewed under the light microscope. Muscles do not develop properly in 
mice having both myogenin genes disrupted. [From P. Hasty, A. Bradley, J. H. Morris, D. G. 
Edmondson, J. M. Venuti, E. N. Olson, and W. H. Klein, Nature 364:501—506, 1993.] 


RNA interference provides an additional tool for disrupting 
gene expression 


An extremely powerful tool for disrupting gene expression was serendipi- 
tously discovered in the course of studies that required the introduction of 
RNA into a cell. The introduction of a specific double-stranded RNA mol- 
ecule into a cell was found to suppress the transcription of genes that con- 
tained sequences present in the double-stranded RNA molecule. Thus, the 
introduction of a specific RNA molecule can interfere with the expression 
of a specific gene. 
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Cleaved segments of mRNA 


Figure 5.35 RNA interference 
mechanism. A double-stranded RNA 
molecule is cleaved into 21-bp fragments by 
the enzyme Dicer to produce siRNAs. These 
siRNAs are incorporated into the RNA-induced 
silencing complex (RISC), where the single- 
stranded RNAs guide the cleavage of mRNAs 
that contain complementary sequences. 


The mechanism of RNA interference has been largely established (Fig- 
ure 5.35). When a double-stranded RNA molecule is introduced into an 
appropriate cell, the RNA is cleaved by an enzyme referred to as Dicer into 
fragments approximately 21 nucleotides in length. Each fragment, termed a 
small interfering RNA (siRNA), consists of 19 bp of double-stranded RNA 
and 2 bases of unpaired RNA on each 5’ end. The siRNA is loaded into an 
assembly of several proteins referred to as the RNA-induced silencing com- 
plex (RISC), which unwinds the RNA duplex and cleaves one of the strands, 
the so-called passenger strand. The uncleaved single-stranded RNA seg- 
ment, the guide strand, remains incorporated into the enzyme. The fully 
assembled RISC cleaves mRNA molecules that contain exact complements 
of the guide-strand sequence. Thus, levels of such mRNA molecules are 
dramatically reduced. 

The machinery necessary for RNA interference is found in many cells. 
In some organisms such as C. elegans, RNA interference is quite efficient. 
Indeed, RNA interference can be induced simply by feeding C. elegans 
strains of E. coli that have been engineered to produce appropriate double- 
stranded RNA molecules. Although not as efficient in mammalian cells, 
RNA interference has emerged as a powerful research tool for reducing the 
expression of specific genes. Moreover, initial clinical trials of therapies 
based on RNA interference are underway. 


Tumor-inducing plasmids can be used to introduce new genes into 
plant cells 


The common soil bacterium Agrobacterium tumefaciens infects plants and 
introduces foreign genes into plants cells (Figure 5.36). A lump of tumor 
tissue called a crown gall grows at the site of infection. Crown galls synthe- 
size opines, a group of amino acid derivatives that are metabolized by the 
infecting bacteria. In essence, the metabolism of the plant cell is diverted to 
satisfy the highly distinctive appetite of the intruder. Tumor-inducing plas- 
mids (Ti plasmids) that are carried by A. tumefaciens carry instructions for 
the switch to the tumor state and the synthesis of opines. A small part of the 
Ti plasmid becomes integrated into the genome of infected plant cells; this 
20-kb segment is called T-DNA (transferred DNA; Figure 5.37). 
Ti-plasmid derivatives can be used as vectors to deliver foreign genes 
into plant cells. First, a segment of foreign DNA is inserted into the T-DNA 


Figure 5.36 Tumors in plants. Crown gall, 
a plant tumor, is caused by a bacterium 
(Agrobacterium tumefaciens) that carries a 
tumor-inducing plasmid (Ti plasmid). [From 
M. Escobar et al., PVAS 98:13437-13442, 
2001. Copyright 2001 National Academy of 
Sciences, U. S. A] 


region of a small plasmid through the use of restriction enzymes and ligases. 
This synthetic plasmid is added to A. tumefaciens colonies harboring natu- 
rally occurring Ti plasmids. By recombination, Ti plasmids containing the 
foreign gene are formed. These Ti vectors hold great promise as tools for 
exploring the genomes of plant cells and modifying plants to improve their 
agricultural value and crop yield. However, they are not suitable for trans- 
forming all types of plants. Ti-plasmid transfer is effective with dicots 
(broad-leaved plants such as grapes) and a few kinds of monocots but not as 
effective with economically important cereal monocots. 

Foreign DNA can be introduced into cereal monocots as well as dicots 
by applying intense electric fields, a technique called electroporation 
(Figure 5.38). First, the cellulose wall surrounding plant cells is removed by 
adding cellulase; this treatment produces protoplasts, plant cells with 
exposed plasma membranes. Electric pulses are then applied to a suspen- 
sion of protoplasts and plasmid DNA. Because high electric fields make 
membranes transiently permeable to large molecules, plasmid DNA mole- 
cules enter the cells. The cell wall is then allowed to reform, and the plant 
cells are again viable. Maize cells and carrot cells have been stably trans- 
formed in this way with the use of plasmid DNA that includes genes for 
resistance to antibiotics. Moreover, the transformed cells efficiently express 
the plasmid DNA. Electroporation is also an effective means of delivering 
foreign DNA into animal cells and bacterial cells. 

The most effective means of transforming plant cells is through the use 
of “gene guns,” or bombardment-mediated transformation. DNA is coated 
onto 1-mm-diameter tungsten pellets, and these microprojectiles are fired 
at the target cells with a velocity greater than 400 ms 1. Despite its apparent 
crudeness, this technique is proving to be the most effective way of trans- 
forming plants, especially important crop species such as soybean, corn, 
wheat, and rice. The gene-gun technique affords an opportunity to develop 
genetically modified organisms (GMOs) with beneficial characteristics. 
Such characteristics could include the ability to grow in poor soils, resis- 
tance to natural climatic variation, resistance to pests, and nutritional forti- 
fication. These crops might be most useful in developing countries. The use 
of genetically modified organisms is highly controversial at this point 
because of fears of unexpected side effects. 

The first GMO to come to market was a tomato characterized by 
delayed ripening, rendering it ideal for shipment. Pectin is a polysaccha- 
ride that gives tomatoes their firmness and is naturally destroyed by the 
enzyme polygalacturonase. As pectin is destroyed, the tomatoes soften, 
making shipment difficult. DNA was introduced that disrupts the polyga- 
lacturonase gene. Less of the enzyme was produced, and the tomatoes 
stayed fresh longer. However, the tomato’s poor taste hindered its com- 
mercial success. 


Human gene therapy holds great promise for medicine 


¥ The field of gene therapy attempts to express specific genes within the 
human body in such a way that beneficial results are obtained. The 
gene targeted for expression may be already present or specially introduced. 
Alternatively, gene therapy may attempt to modify genes containing 
sequence variations that have harmful consequences. A tremendous amount 
of research remains to be done before gene therapy becomes practical. 
Nonetheless, considerable progress has been made. For example, some 
people lack functional genes for adenosine deaminase and succumb to infec- 
tions if exposed to a normal environment, a condition called severe combined 
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Figure 5.37 Ti plasmids. Agrobacteria 
containing Ti plasmids can deliver foreign 
genes into some plant cells. [After M. Chilton. 
A vector for introducing new genes into plants. 
Copyright © 1983 by Scientific American, Inc. 
All rights reserved] 
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Figure 5.38 Electroporation. Foreign DNA 
can be introduced into plant cells by 
electroporation, the application of intense 
electric fields to make their plasma 
membranes transiently permeable. 
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168 immunodeficiency (SCID). Functional genes for this enzyme have been 

CHAPTER 5 Exploring Genes and introduced by using gene-therapy vectors based on retroviruses. Although 

Genomes these vectors have produced functional enzyme and reduced the clinical 
symptoms, challenges remain. These challenges include increasing the lon- 
gevity of the effects and eliminating unwanted side effects. Future research 
promises to transform gene therapy into an important tool for clinical 
medicine. 


Summary 


5.1 The Exploration of Genes Relies on Key Tools 

The recombinant DNA revolution in biology is rooted in the repertoire 
of enzymes that act on nucleic acids. Restriction enzymes are a key 
group among them. These endonucleases recognize specific base 
sequences in double-helical DNA and cleave both strands of the 
duplex, forming specific fragments of DNA. These restriction frag- 
ments can be separated and displayed by gel electrophoresis. The pat- 
tern of these fragments on the gel is a fingerprint of a DNA molecule. 
A DNA fragment containing a particular sequence can be identified by 
hybridizing it with a labeled single-stranded DNA probe (Southern 
blotting). 

Rapid sequencing techniques have been developed to further the 
analysis of DNA molecules. DNA can be sequenced by controlled 
interruption of replication. The fragments produced are separated by 
gel electrophoresis and visualized by autoradiography of a 2°P label 
at the 5’ end or by fluorescent tags. 

DNA probes for hybridization reactions, as well as new genes, can 
be synthesized by the automated solid-phase method. The technique is 
to add deoxyribonucleoside 3’-phosphoramidites to one another to form 
a growing chain that is linked to an insoluble support. DNA chains a 
hundred nucleotides long can be readily synthesized. The polymerase 
chain reaction makes it possible to greatly amplify specific segments of 
DNA in vitro. The region amplified is determined by the placement of 
a pair of primers that are added to the target DNA along with a thermo- 
stable DNA polymerase and deoxyribonucleoside triphosphates. The 
exquisite sensitivity of PCR makes it a choice technique in detecting 
pathogens and cancer markers, in genotyping, and in reading DNA 
from fossils that are many thousands of years old. 


5.2 Recombinant DNA Technology Has Revolutionized All 

Aspects of Biology 

New genes can be constructed in the laboratory, introduced into host 
cells, and expressed. Novel DNA molecules are made by joining frag- 
ments that have complementary cohesive ends produced by the action 
of a restriction enzyme. DNA ligase seals breaks in DNA chains. 
Vectors for propagating the DNA include plasmids, \ phage, and bacte- 
rial and yeast artificial chromosomes. Specific genes can be cloned from 
a genomic library with the use of a DNA or RNA probe. Foreign DNA 
can be expressed after insertion into prokaryotic and eukaryotic cells by 
the appropriate vector. Specific mutations can be generated in vitro to 
engineer novel proteins. A mutant protein with a single amino acid sub- 
stitution can be produced by priming DNA replication with an oligo- 
nucleotide encoding the new amino acid. Plasmids can be engineered to 


5.3 


5.4 


permit the facile insertion of a DNA cassette containing any desired 
mutation. The techniques of protein and nucleic acid chemistry are 
highly synergistic. Investigators now move back and forth between gene 
and protein with great facility. 


Complete Genomes Have Been Sequenced and Analyzed 

The sequences of many important genomes are known in their entirety. 
More than 100 bacterial and archaeal genomes have been sequenced, 
including those from key model organisms and important pathogens. 
The sequence of the human genome has now been completed with 
nearly full coverage and high precision. Only from 20,000 to 25,000 
protein-encoding genes appear to be present in the human genome, a 
substantially smaller number than earlier estimates. Comparative 
genomics has become a powerful tool for analyzing individual genomes 
and for exploring evolution. Genomewide gene-expression patterns can 
be examined through the use of DNA microarrays. 


Eukaryotic Genes Can Be Quantitated and Manipulated with 
Considerable Precision 

Changes in gene expression can be readily determined by such tech- 
niques as quantitative PCR and hybridization to microarrays. The 
production of transgenic mice carrying mutations known to cause ALS 
in humans has been a source of considerable insight into the disease 
mechanism and its possible treatment. The functions of particular 
genes can be investigated by disruption. One method of disrupting the 
expression of a particular gene is through RNA interference, which 
depends on the introduction of specific double-stranded RNA mole- 
cules into eukaryotic cells. New DNA can be brought into plant cells by 
the soil bacterium Agrobacterium tumefaciens, which harbors Ti plas- 
mids. DNA can also be introduced into plant cells by applying intense 
electric fields, which render them transiently permeable to very large 
molecules, or by bombarding them with DNA-coated microparticles. 
Gene therapy holds great promise for clinical medicine, but many chal- 
lenges remain. 
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Problems 


1. Reading sequences. An autoradiogram of a sequencing gel 
containing four lanes of DNA fragments is shown in 
the adjoining illustration. (a) What is the sequence of the 
DNA fragment? (b) Suppose that the Sanger dideoxy 
method shows that the template strand sequence is 
5’- TGCAATGGC-3’. Sketch the gel pattern that would 


lead to this conclusion. 


Termination 
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= 
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2. The right template. Ovalbumin is the major protein of 
egg white. The chicken ovalbumin gene contains eight 
exons separated by seven introns. Should ovalbumin cDNA 
or ovalbumin genomic DNA be used to form the protein in 


E. coli? Why? 


3. Handle with care. Ethidium bromide is acommonly used 
stain for DNA molecules after separation by gel electropho- 
resis. The chemical structure of ethidium bromide is shown 
here. Based on this structure, suggest how this stain binds 


to DNA. 


NH2 


N CH 
HN ZA The ge 


Ethidium bromide 


4, Cleavage frequency. The restriction enzyme Alul cleaves 
at the sequence 5’-AGCT-3’, and NotI cleaves at 
5’-GCGGCCGC-3’. What would be the average distance 
between cleavage sites for each enzyme on digestion of 
double-stranded DNA? Assume that the DNA contains 
equal proportions of A, G, C, and T. 


5. The right cuts. Suppose that a human genomic library is 
prepared by exhaustive digestion of human DNA with the 
EcoRI restriction enzyme. Fragments averaging about 4 kb 
in length would be generated. Is this procedure suitable for 
cloning large genes? Why or why not? 


6. A revealing cleavage. Sickle-cell anemia arises from a 
mutation in the gene for the 8 chain of human hemoglobin. 
The change from GAG to GTG in the mutant eliminates a 
cleavage site for the restriction enzyme MstII, which recog- 
nizes the target sequence CCTGAGG. These findings 
form the basis of a diagnostic test for the sickle-cell gene. 
Propose a rapid procedure for distinguishing between the 
normal and the mutant gene. Would a positive result prove 
that the mutant contains GTG in place of GAG? 


7. Sticky ends? The restriction enzymes KpnI and Acc65I 
recognize and cleave the same 6-bp sequence. However, the 
sticky end formed from KpnI cleavage cannot be ligated 
directly to the sticky end formed from Acc65I cleavage. 
Explain why. 


1 L 
5'GGTACC:3' 5'GGTACC:3' 
e e 
3°CCATGGs' 3°} CCATGGSs' 
t r 

Kpnl Acc651 


8. Many melodies from one cassette. Suppose that you have 
isolated an enzyme that digests paper pulp and have obtained 
its cDNA. The goal is to produce a mutant that is effective 
at high temperature. You have engineered a pair of unique 
restriction sites in the cDNA that flank a 30-bp coding 
region. Propose a rapid technique for generating many dif- 
ferent mutations in this region. 


9. A blessing and a curse. The power of PCR can also create 
problems. Suppose someone claims to have isolated dino- 
saur DNA by using PCR. What questions might you ask to 
determine if it is indeed dinosaur DNA? 


10. Rich or poor? DNA sequences that are highly enriched 
in G-C base pairs typically have high melting tempera- 
tures. Moreover, once separated, single strands containing 
these regions can form rigid secondary structures. How 
might the presence of G—C-rich regions ina DNA template 
affect PCR amplification? 


11. Questions of accuracy. The stringency of PCR amplifi- 
cation can be controlled by altering the temperature at 
which the primers and the target DNA undergo hybridiza- 
tion. How would altering the temperature of hybridization 
affect the amplification? Suppose that you have a particular 
yeast gene A and that you wish to see if it has a counterpart 


in humans. How would controlling the stringency of the 
hybridization help you? 


12. Terra incognita. PCR is typically used to amplify 
DNA that lies between two known sequences. Suppose 
that you want to explore DNA on both sides of a single 
known sequence. Devise a variation of the usual PCR pro- 
tocol that would enable you to amplify entirely new 
genomic terrain. 


13. A puzzling ladder. A gel pattern displaying PCR prod- 
ucts shows four strong bands. The four pieces of DNA have 
lengths that are approximately in the ratio of 1:2:3:4. The 
largest band is cut out of the gel, and PCR is repeated with 
the same primers. Again, a ladder of four bands is evident 
in the gel. What does this result reveal about the structure 
of the encoded protein? 


14. Chromosome walking. Propose a method for isolating a 
DNA fragment that is adjacent in the genome to a previ- 
ously isolated DNA fragment. Assume that you have access 
to a complete library of DNA fragments in a BAC vector 
but that the sequence of the genome under study has not 
yet been determined. 


15. Probe design. Which of the following amino acid 
sequences would yield the most optimal oligonucleotide 
probe? 


Ala-Met-Ser-Leu-Pro-Trp 
Gly-Trp-Asp-Met-His-Lys 
Cys-Val-Trp-Asn-Lys-Ile 
Arg-Ser-Met-Leu-Gln-Asn 


16. Man’s best friend. Why might the genomic analysis 
of dogs be particularly useful for investigating the 
genes responsible for body size and other physical 
characteristics? 


17. Of mice and men. You have identified a gene that is 
located on human chromosome 20 and wish to identify its 
location within the mouse genome. On which chromosome 
would you be most likely to find the mouse counterpart of 
this gene? 


Chapter Integration Problems 


18. Designing primers I. A successful PCR experiment 
often depends on designing the correct primers. In particu- 
lar, the T,, for each primer should be approximately the 
same. What is the basis of this requirement? 


19. Designing primers II. You wish to amplify a segment 
of DNA from a plasmid template by PCR with the use of 
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Problems 


the following primers: 5'’-GGATCGATGCTCGCGA-3’ 
and 5'-AGGATCGGGTCGCGAG-3’. Despite repeated 
attempts, you fail to observe a PCR product of the expected 
length after electrophoresis on an agarose gel. Instead, you 
observe a bright smear on the gel with an approximate 
length of 25 to 30 base pairs. Explain these results. 


Chapter Integration and Data Interpretation Problem 


20. Any direction but east. A series of people are found to 
have difficulty eliminating certain types of drugs from 
their bloodstreams. The problem has been linked to a gene 
X, which encodes an enzyme Y. Six people were tested 
with the use of various techniques of molecular biology. 
Person A is a normal control, person B is asymptomatic 
but some of his children have the metabolic problem, and 
persons C through F display the trait. Tissue samples from 
each person were obtained. Southern analysis was per- 
formed on the DNA after digestion with the restriction 
enzyme HindIII. Northern analysis of mRNA also was 
done. In both types of analysis, the gels were probed with 
labeled X cDNA. Finally, a western blot with an enzyme- 
linked monoclonal antibody was used to test for the pres- 
ence of protein Y. The results are shown here. Why is 
person B without symptoms? Suggest possible defects in 
the other people. 


Southern blots 


Northern blots 


Western blots 
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Data Interpretation Problems 


21. DNA diagnostics. Representations of sequencing gels 
for variants of the a chain of human hemoglobin are shown 
here. What is the nature of the amino acid change in each of 
the variants? The first triplet encodes valine. 


HEMOGLOBIN TYPE 
Normal Chongqing Karachi Swan River 
GATC GATC GATC GATC 


22. Two peaks. In the course of studying a gene and its pos- 
sible mutation in humans, you obtain genomic DNA sam- 


ples from a collection of persons and PCR amplify a region 
of interest within this gene. For one of the samples, you 
obtain the sequencing chromatogram shown here. Provide 
an explanation for the appearance of these data at position 
49 (indicated by the arrow): 


50 
ATTAG GNGGTATGTA 


Animated Techniques 

h Visit www.whfreeman.com/Berg7e to see animations 
of Dideoxy Sequencing of DNA, Polymerase Chain 
Reaction, Synthesizing an Oligonucleotide Array, 
Screening an Oligonucleotide Array for Patterns of Gene 
Expression, Plasmid Cloning, In Vitro Mutagenesis of 
Cloned Genes, Creating a Transgenic Mouse. [Courtesy of 
H. Lodish et al., Molecular Cell Biology, 5th ed. 
(W. H. Freeman and Company, 2004). ] 


Exploring Evolution 
and Bioinformatics 


CHAPTER 6 


Evolutionary relationships are manifest in protein sequences. 
The close kinship between human beings and chimpanzees, 
hinted at by the mutual interest shown by Jane Goodall and 
a chimpanzee in the photograph, is revealed in the amino 
acid sequences of myoglobin. The human sequence (red) 
differs from the chimpanzee sequence (blue) in only one 
amino acid in a protein chain of 153 residues. [(Left) 
Kennan Ward/Corbis.] 
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ike members of a human family, members of molecular families often 
have features in common. Such family resemblance is most easily detected 
by comparing three-dimensional structure, the aspect of a molecule most 
closely linked to function. Consider as an example ribonuclease from cows, 
which was introduced in our consideration of protein folding (Section 2.6). 
Comparing structures reveals that the three-dimensional structure of this 
protein and that of a human ribonuclease are quite similar (Figure 6.1). 
Although the degree of overlap between these two structures is not unex- 
pected, given their nearly identical biological functions, similarities revealed 
by other such comparisons are sometimes surprising. For example, angio- 
genin, a protein that stimulates the growth of new blood vessels, also turns 
out to be structurally similar to ribonuclease—so similar that both angio- 
genin and ribonuclease are clearly members of the same protein family 
(Figure 6.2). Angiogenin and ribonuclease must have had a common ances- 
tor at some earlier stage of evolution. 
Three-dimensional structures have been determined for only a small 
proportion of the total number of proteins. In contrast, gene sequences and 
the corresponding amino acid sequences are available for a great number of 
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hk Figure 6.1 Structures of 
ribonucleases from cows and human 
beings. Structural similarity often follows 
functional similarity. [Drawn from 8RAT.pdb. 
and 2RNF.pdb|] 


Angiogenin 


hk Figure 6.2 Structure of angiogenin. 
The protein angiogenin, identified on the basis 
of its ability to stimulate blood-vessel growth, 
is highly similar in three-dimensional structure 
to ribonuclease. [Drawn from 2ANG.pdb.] 


Bovine ribonuclease Human ribonuclease 


proteins, largely owing to the tremendous power of DNA cloning and 
sequencing techniques including applications to complete-genome sequenc- 
ing. Evolutionary relationships also are manifest in amino acid sequences. 
For example, 35% of the amino acids in corresponding positions are iden- 
tical in the sequences of bovine ribonuclease and angiogenin. Is this level 
sufficiently high to ensure an evolutionary relationship? If not, what level 
is required? In this chapter, we shall examine the methods that are used 
to compare amino acid sequences and to deduce such evolutionary 
relationships. 

Sequence-comparison methods have become powerful tools in modern bio- 
chemistry. Sequence databases can be probed for matches to a newly eluci- 
dated sequence to identify related molecules. This information can often be 
a source of considerable insight into the function and mechanism of the 
newly sequenced molecule. When three-dimensional structures are avail- 
able, they can be compared to confirm relationships suggested by sequence 
comparisons and to reveal others that are not readily detected at the level of 
sequence alone. 

By examining the footprints present in modern protein sequences, the 
biochemist can become a molecular archeologist able to learn about events 
in the evolutionary past. Sequence comparisons can often reveal pathways 
of evolutionary descent and estimated dates of specific evolutionary land- 
marks. This information can be used to construct evolutionary trees that 
trace the evolution of a particular protein or nucleic acid in many cases from 
Archaea and Bacteria through Eukarya, including human beings. Molecular 
evolution can also be studied experimentally. In some cases, DNA from 
fossils can be amplified by PCR methods and sequenced, giving a direct 
view into the past. In addition, investigators can observe molecular evolu- 
tion taking place in the laboratory, through experiments based on nucleic 
acid replication. The results of such studies are revealing more about how 
evolution proceeds. 


6.1 Homologs Are Descended from a Common Ancestor 


The exploration of biochemical evolution consists largely of an attempt to 
determine how proteins, other molecules, and biochemical pathways have 
been transformed through time. The most fundamental relationship between 
two entities is homology; two molecules are said to be homologous if they 
have been derived from a common ancestor. Homologous molecules, or 
homologs, can be divided into two classes (Figure 6.3). Paralogs are homologs 
that are present within one species. Paralogs often differ in their detailed 
biochemical functions. Orthologs are homologs that are present within 


Figure 6.3 Two classes of homologs. 175 
Homologs that perform identical or very 
similar functions in different species are called 
orthologs, whereas homologs that perform 
different functions within one species are 
called paralogs. 
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(digestive enzyme) (stimulates blood-vessel growth) 


different species and have very similar or identical functions. Understanding 
the homology between molecules can reveal the evolutionary history of the 
molecules as well as information about their function; if a newly sequenced 
protein is homologous to an already characterized protein, we have a strong 
indication of the new protein’s biochemical function. 

How can we tell whether two human proteins are paralogs or whether a 
yeast protein is the ortholog of a human protein? As will be discussed in 
Section 6.2, homology is often detectable by significant similarity in nucleotide 
or amino acid sequence and almost always manifested in three-dimensional 
structure. 


6.2 Statistical Analysis of Sequence Alignments Can 
Detect Homology 


A significant sequence similarity between two molecules implies that they 
are likely to have the same evolutionary origin and, therefore, similar three- 
dimensional structures, functions, and mechanisms. Both nucleic acid and 
protein sequences can be compared to detect homology. However, the pos- 
sibility exists that the observed agreement between any two sequences is 
solely a product of chance. Because nucleic acids are composed of fewer 
building blocks than proteins (4 bases versus 20 amino acids), the likelihood 
of random agreement between two DNA or RNA sequences is significantly 
greater than that for protein sequences. For this reason, detection of homol- 
ogy between protein sequences is typically far more effective. 

To illustrate sequence-comparison methods, let us consider a class 
of proteins called the globins. Myoglobin is a protein that binds oxygen 
in muscle, whereas hemoglobin is the oxygen-carrying protein in blood 
(Chapter 7). Both proteins cradle a heme group, an iron-containing organic 
molecule that binds the oxygen. Each human hemoglobin molecule is com- 
posed of four heme-containing polypeptide chains, two identical a chains 
and two identical B chains. Here, we consider only the a chain. To examine 
the similarity between the amino acid sequence of the human a chain and 


Figure 6.4 Amino acid sequences of 
human hemoglobin (a chain) and human 
myoglobin. a-Hemoglobin is composed of 
141 amino acids; myoglobin consists of 153 
amino acids. (One-letter abbreviations 
designating amino acids are used; see 

Table 2.2.) 


Human hemoglobin (a chain) 


VLS PADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHG 
SAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLS 
HCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR 


Human myoglobin 
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EDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPVK 
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that of human myoglobin (Figure 6.4), we apply a method, referred to as a 
sequence alignment, in which the two sequences are systematically aligned 
with respect to each other to identify regions of significant overlap. 

How can we tell where to align the two sequences? In the course of 
evolution, the sequences of two proteins that have an ancestor in common 
will have diverged in a variety of ways. Insertions and deletions may have 
occurred at the ends of the proteins or within the functional domains them- 
selves. Individual amino acids may have been mutated to other residues of 
varying degrees of similarity. To understand how the methods of sequence 
alignment take these potential sequence variations into account, let us first 
consider the simplest approach, where we slide one sequence past the other, 
one amino acid at a time, and count the number of matched residues, or 
sequence identities (Figure 6.5). For a-hemoglobin and myoglobin, the best 
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Figure 6.5 Comparing the amino acid 
sequences of a-hemoglobin and 
myoglobin. (A) A comparison is made by 
sliding the sequences of the two proteins past 
each other, one amino acid at a time, and 
counting the number of amino acid identities 
between the proteins. (B) The two alignments 
with the largest number of matches are 
shown above the graph, which plots the 
matches as a function of alignment. 
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alignment reveals 23 sequence identities, spread throughout the central 
parts of the sequences. 

However, careful examination of all the possible alignments and 
their scores suggests that important information regarding the relationship 
between myoglobin and hemoglobin a has been lost with this method. In 
particular, we see that another alignment, featuring 22 identities, is nearly 
as good. This alignment is shifted by six residues relative to the preceding 
alignment and yields identities that are concentrated toward the amino- 
terminal end of the sequences. By introducing a gap into one of the sequences, 
the identities found in both alignments will be represented (Figure 6.6). 
Insertion of gaps allows the alignment method to compensate for the inser- 
tions or deletions of nucleotides that may have taken place in the gene for 
one molecule but not the other in the course of evolution. 

The use of gaps substantially increases the complexity of sequence 
alignment because a vast number of possible gaps, varying in both position 
and length, must be considered throughout each sequence. Moreover, the 
introduction of an excessive number of gaps can yield an artificially high 
number of identities. Nevertheless, methods have been developed for the 
insertion of gaps in the automatic alignment of sequences. These methods 
use scoring systems to compare different alignments, including penalties for 
gaps to prevent the insertion of an unreasonable number of them. Here is an 
example of such a scoring system: each identity between aligned sequences 
is counted as +10 points, whereas each gap introduced, regardless of size, 
counts for —25 points. For the alignment shown in Figure 6.6, there are 
38 identities (38 X 10 = 380) and 1 gap (1 X —25 = —25), producing a 
score of (380 + —25 = 355). Overall, there are 38 matched amino acids in 
an average length of 147 residues; so the sequences are 25.9% identical. Next, 
we must determine the significance of this score and level of identity. 
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The statistical significance of alignments can be estimated by shuffling 


The similarities in sequence in Figure 6.5 appear striking, yet there remains 
the possibility that a grouping of sequence identities has occurred by chance 
alone. Because proteins are composed of the same set of 20 amino acid mono- 
mers, the alignment of any two unrelated proteins will yield some identities, 
particularly if we allow the introduction of gaps. Even if two proteins have 
identical amino acid composition, they may not be linked by evolution. It is 
the order of the residues within their sequences that implies a relationship 
between them. Hence, we can assess the significance of our alignment by 
“shuffling,” or randomly rearranging, one of the sequences (Figure 6.7), 
repeat the sequence alignment, and determine a new alignment score. This 
process is repeated many times to yield a histogram showing, for each 
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Figure 6.6 Alignment with gap insertion. 


The alignment of a-hemoglobin and 


myoglobin after a gap has been inserted into 


the hemoglobin a sequence. 
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Figure 6.7 The generation of a 
shuffled sequence. 
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(Figure 6.8). If the original score is not appreciably different from the scores 
from the shuffled alignments, then we cannot exclude the possibility that the 


original alignment is merely a consequence of chance. 
When this procedure is applied to the sequences of myoglobin and 
a-hemoglobin, the authentic alignment clearly stands out (see Figure 6.8). Its 


30 


25 


Number of alignments 


200 300 400 
Alignment score 


Figure 6.8 Statistical comparison of alignment scores. Alignment 
scores are calculated for many shuffled sequences, and the number of 
sequences generating a particular score is plotted against the score. The 
resulting plot is a distribution of alignment scores occurring by chance. 
The alignment score for unshuffled a-hemoglobin and myoglobin 
(shown in red) is substantially greater than any of these scores, strongly 
suggesting that the sequence similarity is significant. 


score is far above the mean for the alignment scores based 
on shuffled sequences. The probability that such a devia- 
tion occurred by chance alone is approximately 1 in 107”, 
Thus, we can comfortably conclude that the two sequences 
are genuinely similar; the simplest explanation for this 
similarity is that these sequences are homologous—that is, 
that the two molecules have descended by divergence from 
a common ancestor. 


Distant evolutionary relationships can be detected 
through the use of substitution matrices 


The scoring scheme heretofore described assigns points 
only to positions occupied by identical amino acids in the 
two sequences being compared. No credit is given for any 
pairing that is not an identity. However, as already dis- 
cussed, two proteins related by evolution undergo amino 
acid substitutions as they diverge. A scoring system based 
solely on amino acid identity cannot account for these 
changes. To add greater sensitivity to the detection of 
evolutionary relationships, methods have been developed 
to compare two amino acids and assess their degree of 
similarity. 


Not all substitutions are equivalent. For example, 
amino acid changes can be classified as structurally 
conservative or nonconservative. A conservative substitution replaces one 
amino acid with another that is similar in size and chemical properties. 
Conservative substitutions may have only minor effects on protein struc- 
ture and often can be tolerated without compromising protein function. In 
contrast, in a nonconservative substitution, an amino acid is replaced by 
one that is structurally dissimilar. Amino acid changes can also be classi- 
fied by the fewest number of nucleotide changes necessary to achieve the 
corresponding amino acid change. Some substitutions arise from the 
replacement of only a single nucleotide in the gene sequence; whereas oth- 
ers require two or three replacements. Conservative and single-nucleotide 
substitutions are likely to be more common than are substitutions with 
more radical effects. 

How can we account for the type of substitution when comparing 
sequences? We can approach this problem by first examining the substitu- 
tions that have actually taken place in evolutionarily related proteins. From 
an examination of appropriately aligned sequences, substitution matrices 
have been deduced. A substitution matrix describes a scoring system for the 
replacement of any amino acid with each of the other 19 amino acids. In 
these matrices, a large positive score corresponds to a substitution that 
occurs relatively frequently, whereas a large negative score corresponds to a 
substitution that occurs only rarely. A commonly used substitution matrix, 
the Blosum-62 (for Blocks of amino acid substitution matrix), is illustrated 
in Figure 6.9. In this depiction, each column in this matrix represents one of 
the 20 amino acids, whereas the position of the single-letter codes within 
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Figure 6.9 A graphic view of the Blosum-62. This substitution matrix was derived by examining 
substitutions within aligned sequence blocks in related proteins. Amino acids are classified into four 
groups (charged, red; polar, green; large and hydrophobic, blue; other, black). Substitutions that 
require the change of only a single nucleotide are shaded. Identities are boxed. To find the score for 
a substitution of, for instance, a Y for an H, you find the Y in the column having H at the top and 
check the number at the left. In this case, the resulting score is 2. 


each column specifies the score for the corresponding substitution. Notice 
that scores corresponding to identity (the boxed codes at the top of each 
column) are not the same for each residue, owing to the fact that less fre- 
quently occurring amino acids such as cysteine (C) and tryptophan (W) 
will align by chance less often than the more common residues align. Further- 
more, structurally conservative substitutions such as lysine (K) for arginine 
(R) and isoleucine (1) for valine (V) have relatively high scores, whereas 
nonconservative substitutions such as lysine for tryptophan result in nega- 
tive scores (Figure 6.10). When two sequences are compared, each pair of 
aligned residues is assigned a score based on the matrix. In addition, gap 
penalties are often assessed. For example, the introduction of a single- 
residue gap lowers the alignment score by 12 points and the extension of an 
existing gap costs 2 points per residue. With the use of this scoring system, 
the alignment shown in Figure 6.6 receives a score of 115. In many regions, 
most substitutions are conservative (defined as those substitutions with 


179 


180 


CHAPTER 6 Exploring Evolution and 
Bioinformatics 


Figure 6.10 Scoring of conservative and 
nonconservative substitutions. The 
Blosum-62 indicates that a conservative 
substitution (lysine for arginine) receives a 
positive score, whereas a nonconservative 
substitution (lysine for tryptophan) is scored 
negatively. The matrix is depicted as an 
abbreviated form of Figure 6.9. 


Figure 6.11 Alignment with conservative 
substitutions noted. The alignment of 
a-hemoglobin and myoglobin with 
conservative substitutions indicated by yellow 
shading and identities by orange. 


Figure 6.12 Alignment of identities only 
versus the Blosum-62. Repeated shuffling 
and scoring reveal the significance of sequence 
alignment for human myoglobin versus lupine 
leghemoglobin with the use of either (A) the 
simple, identity-based scoring system or 

(B) the Blosum-62. The scores for the 
alignment of the authentic sequences are 
shown in red. Accounting for amino acid 
similarity in addition to identity reveals a greater 
separation between the authentic alignment 
and the population of shuffled alignments. 


Substitution of 
lysine for tryptophan 
(nonconservative) 


Substitution of 
lysine for arginine 
(conservative) 


R 


Score = +2--7---K 


Score = —3--4----- ----b---p---- K 


scores greater than 0) and relatively few are strongly disfavored types 


(Figure 6.11). 
HAGE 
DIPG 


ALITNAVAHVDDMPNALS 
IALIGGI LKKKGHHEAEIK 


Hemoglobina  V| 
Myoglobin GiL¢ 


IAASVSTV 
LIELFRKD 


This scoring system detects homology between less obviously related 
sequences with greater sensitivity than would a comparison of identities only. 
Consider, for example, the protein leghemoglobin, an oxygen-binding pro- 
tein found in the roots of some plants. The amino acid sequence of leghemo- 
globin from the herb lupine can be aligned with that of human myoglobin and 
scored by using either the simple scoring scheme based on identities only or 
the Blosum-62 (see Figure 6.9). Repeated shuffling and scoring provides a 
distribution of alignment scores (Figure 6.12). Scoring based on identities 
only indicates that the probability of the alignment between myoglobin and 
leghemoglobin occurring by chance alone is 1 in 20. Thus, although the level 
of similarity suggests a relationship, there is a 5% chance that the similarity is 
accidental on the basis of this analysis. In contrast, users of the substitution 
matrix are able to incorporate the effects of conservative substitutions. From 
such an analysis, the odds of the alignment occurring by chance are calculated 
to be approximately 1 in 300. Thus, an analysis performed by using the sub- 
stitution matrix reaches a much firmer conclusion about the evolutionary 
relationship between these proteins (Figure 6.13). 
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Figure 6.13 Alignment of human myoglobin and lupine leghemoglobin. The use of 
Blosum-62 yields the alignment shown between human myoglobin and lupine leghemoglobin, 
illustrating identities (orange boxes) and conservative substitutions (yellow). These sequences are 
23% identical. 


Experience with sequence analysis has led to the development of simpler 
rules of thumb. For sequences longer than 100 amino acids, sequence identi- 
ties greater than 25% are almost certainly not the result of chance alone; such 
sequences are probably homologous. In contrast, if two sequences are less 
than 15% identical, their alignment alone is unlikely to indicate statistically 
significant similarity. For sequences that are between 15 and 25% identical, 
further analysis is necessary to determine the statistical significance of the 
alignment. It must be emphasized that the lack of a statistically significant 
degree of sequence similarity does not rule out homology. The sequences of many 
proteins that have descended from common ancestors have diverged to such 
an extent that the relationship between the proteins can no longer be detected 
from their sequences alone. As we will see, such homologous proteins can 
often be detected by examining three-dimensional structures. 


Databases can be searched to identify homologous sequences 


When the sequence of a protein is first determined, comparing it with all 
previously characterized sequences can be a source of tremendous insight 
into its evolutionary relatives and, hence, its structure and function. Indeed, 
an extensive sequence comparison is almost always the first analysis per- 
formed on a newly elucidated sequence. The sequence-alignment methods 
just described are used to compare an individual sequence with all members 
of a database of known sequences. 

Database searches for homologous sequences are most often accom- 
plished by using resources available on the Internet at the National Center 
for Biotechnology Information (www.ncbi.nih.gov). The procedure used is 
referred to as a BLAST (Basic Local Alignment Search Tool) search. An 
amino acid sequence is typed or pasted into the Web browser, and a search 
is performed, most often against a nonredundant database of all known 
sequences. At the end of 2009, this database included more than 10 million 
sequences. A BLAST search yields a list of sequence alignments, each 
accompanied by an estimate giving the likelihood that the alignment occurred 
by chance (Figure 6.14). 

In 1995, investigators reported the first complete sequence of the genome 
of a free-living organism, the bactertum Haemophilus influenzae. With the 
sequences available, they performed a BLAST search with each deduced 
protein sequence. Of 1,743 identified protein-coding regions, also called open 
reading frames, 1,007 (58%) could be linked to some protein of known func- 
tion that had been previously characterized in another organism. An addi- 
tional 347 open reading frames could be linked to sequences in the database 
for which no function had yet been assigned (“hypothetical proteins’). The 
remaining 389 sequences did not match any sequence present in the database 
at that time. Thus, investigators were able to identify likely functions for more 
than half the proteins within this organism solely by sequence comparisons. 
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Figure 6.14 BLAST search results. Part of the results from a BLAST search of the nonredundant 
(nr) protein sequence database using the sequence of ribose 5-phosphate isomerase (also called 
phosphopentose isomerase, Chapter 20) from E. coli as a query. Among the thousands of sequences 
found is the orthologous sequence from human beings, and the alignment between these sequences 
is shown (highlighted in yellow). The number of sequences with this level of similarity expected to be 
in the database by chance is 1 x 1073 as shown by the E value (highlighted in red). Because this 
value is much less than 1, the observed sequence alignment is highly significant. 


6.3. Examination of Three-Dimensional Structure Enhances 
Our Understanding of Evolutionary Relationships 


Sequence comparison is a powerful tool for extending our knowledge of 
protein function and kinship. However, biomolecules generally function as 
intricate three-dimensional structures rather than as linear polymers. 
Mutations occur at the level of sequence, but the effects of the mutations are 
at the level of function, and function is directly related to tertiary structure. 
Consequently, to gain a deeper understanding of evolutionary relationships 


between proteins, we must examine three-dimensional structures, espe- 
cially in conjunction with sequence information. The techniques of struc- 
tural determination are presented in Chapter 3. 


Tertiary structure is more conserved than primary structure 


Because three-dimensional structure is much more closely associated with 
function than is sequence, tertiary structure is more evolutionarily conserved 
than is primary structure. This conservation is apparent in the tertiary 
structures of the globins (Figure 6.15), which are extremely similar even 
though the similarity between human myoglobin and lupine leghemoglobin 
is just barely detectable at the sequence level and that between human 
a-hemoglobin and lupine leghemoglobin is not statistically significant 
(15.6% identity). This structural similarity firmly establishes that the 
framework that binds the heme group and facilitates the reversible binding 
of oxygen has been conserved over a long evolutionary period. 
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6.3 Examination of Three-Dimensional 
Structure 


Hemoglobin (a chain) Myoglobin 


Anyone aware of the similar biochemical functions of hemoglobin, myo- 
globin, and leghemoglobin could expect the structural similarities. In a 
growing number of other cases, however, acomparison of three-dimensional 
structures has revealed striking similarities between proteins that were not 
expected to be related, on the basis of their diverse functions. A case in point 
is the protein actin, a major component of the cytoskeleton (Section 35.2), 
and heat shock protein 70 (Hsp-70), which assists protein folding inside 
cells. These two proteins were found to be noticeably similar in structure 
despite only 15.6% sequence identity (Figure 6.16). On the basis of their 


Leghemoglobin 


hk Figure 6.15 Conservation of three- 
dimensional structure. The tertiary 
structures of human hemoglobin (a chain), 
human myoglobin, and lupine leghemoglobin 
are conserved. Each heme group contains an 
iron atom to which oxygen binds. [Drawn 
from 1HBB.pdb, 1MBD.pdb, and 1GDJ.pdb.] 


hk Figure 6.16 Structures of actin and 
a large fragment of heat shock protein 
70 (Hsp-70). A comparison of the identically 
colored elements of secondary structure 
reveals the overall similarity in structure 
despite the difference in biochemical activities. 
[Drawn from 1ATN.pdb and 1ATR.pdb.] 
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three-dimensional structures, actin and Hsp-70 are paralogs. The level of 
structural similarity strongly suggests that, despite their different biological 
roles in modern organisms, these proteins descended from a common ances- 
tor. As the three-dimensional structures of more proteins are determined, 
such unexpected kinships are being discovered with increasing frequency. 
The search for such kinships relies ever more frequently on computer-based 
searches that are able to compare the three-dimensional structure of any 
protein with all other known structures. 


Knowledge of three-dimensional structures can aid in the evaluation of 
sequence alignments 


The sequence-comparison methods described thus far treat all positions 
within a sequence equally. However, we know from examining families of 
homologous proteins for which at least one three-dimensional structure is 
known that regions and residues critical to protein function are more 
strongly conserved than are other residues. For example, each type of 
globin contains a bound heme group with an iron atom at its center. A 
histidine residue that interacts directly with this iron atom (residue 64 in 
human myoglobin) is conserved in all globins. After we have identified 
key residues or highly conserved sequences within a family of proteins, we 
can sometimes identify other family members even when the overall level 
of sequence similarity is below statistical significance. Thus it may be use- 
ful to generate a sequence template—a map of conserved residues that are 
structurally and functionally important and are characteristic of particular 
families of proteins, which makes it possible to recognize new family 
members that might be undetectable by other means. A variety of other 
methods for sequence classification that take advantage of known three- 
dimensional structures also are being developed. Still other methods are 
able to identify conserved residues within a family of homologous pro- 
teins, even without a known three-dimensional structure. These methods 
often use substitution matrices that differ at each position within a family 
of aligned sequences. Such methods can often detect quite distant evolu- 
tionary relationships. 


Repeated motifs can be detected by aligning sequences 
with themselves 


More than 10% of all proteins contain sets of two or more domains that are 
similar to one another. Sequence search methods can often detect inter- 
nally repeated sequences that have been characterized in other proteins. 
Often, however, repeated units do not correspond to previously identified 
domains. In these cases, their presence can be detected by attempting to 
align a given sequence with itself. The statistical significance of such 
repeats can be tested by aligning the regions in question as if these regions 
were sequences from separate proteins. For the TATA-box-binding pro- 
tein, a key protein in controlling gene transcription (Section 29.2), such an 
alignment is highly significant: 30% of the amino acids are identical over 
90 residues (Figure 6.17A). The estimated probability of such an align- 
ment occurring by chance is 1 in 10!%. The determination of the three- 
dimensional structure of the TATA-box-binding protein confirmed the 
presence of repeated structures; the protein is formed of two nearly identi- 
cal domains (Figure 6.17B). The evidence is convincing that the gene 
encoding this protein evolved by duplication of a gene encoding a single 
domain. 
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Convergent evolution illustrates common solutions to 
biochemical challenges 


Thus far, we have been exploring proteins derived from common ancestors— 
that is, through divergent evolution. Other cases have been found of proteins 
that are structurally similar in important ways but are not descended from a 
common ancestor. How might two unrelated proteins come to resemble each 
other structurally? Two proteins evolving independently may have converged 
on a similar structure to perform a similar biochemical activity. Perhaps that 
structure was an especially effective solution to a biochemical problem that 
organisms face. The process by which very different evolutionary pathways 
lead to the same solution is called convergent evolution. 

An example of convergent evolution is found among the serine proteases. 
These enzymes, to be considered in more detail in Chapter 9, cleave peptide 
bonds by hydrolysis. Figure 6.18 shows the structure of the active sites— 
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6.3 Examination of Three-Dimensional 
Structure 


hk Figure 6.17 Sequence alignment of 
internal repeats. (A) An alignment of the 
sequences of the two repeats of the TATA-box- 
binding protein. The amino-terminal repeat is 
shown in green and the carboxyl-terminal 
repeat in blue. (B) Structure of the TATA-box- 
binding protein. The amino-terminal domain is 
shown in green and the carboxyl-terminal 
domain in blue. [Drawn from 1VOK.pdb_] 


Figure 6.18 Convergent evolution of 
protease active sites. The relative positions 
of the three key residues shown are nearly 
identical in the active sites of the serine 
proteases chymotrypsin and subtilisin. 
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Figure 6.19 Structures of 
mammalian chymotrypsin and bacterial 
subtilisin. The overall structures are quite 
dissimilar, in stark contrast with the active 
sites, shown at the top of each structure. The 
B strands are shown in yellow and the a 
helices in blue. [Drawn from 1GCT.pdb. and 
1SUP.pdb]] 


that is, the sites on the proteins at which the hydrolysis reaction takes 
place—for two such enzymes, chymotrypsin and subtilisin. These active- 
site structures are remarkably similar. In each case, a serine residue, a histi- 
dine residue, and an aspartic acid residue are positioned in space in nearly 
identical arrangements. As we will see, this conserved spatial arrangement 
is critical for the activity of these enzymes and affords the same mechanistic 
solution to the problem of peptide hydrolysis. At first glance, this similarity 
might suggest that these proteins are homologous. However, striking differ- 
ences in the overall structures of these proteins make an evolutionary rela- 
tionship extremely unlikely (Figure 6.19). Whereas chymotrypsin consists 
almost entirely of B sheets, subtilisin contains extensive a-helical structure. 
Moreover, the key serine, histidine, and aspartic acid residues do not occupy 
similar positions or even appear in the same order within the two sequences. 
It is extremely unlikely that two proteins evolving from a common ancestor 
could have retained similar active-site structures while other aspects of the 
structure changed so dramatically. 


Chymotrypsin Subtilisin 


Comparison of RNA sequences can be a source of insight into RNA 
secondary structures 


Homologous RNA sequences can be compared in a manner similar to that 
already described for protein sequences. Such comparisons can be a source 
of important insights into evolutionary relationships; in addition, they pro- 
vide clues to the three-dimensional structure of the RNA itself. As noted in 
Chapter 4, single-stranded nucleic acid molecules fold back on themselves 
to form elaborate structures held together by Watson—Crick base-pairing 
and other interactions. In a family of sequences that form similar base- 
paired structures, base sequences may vary, but base-pairing ability is 
conserved. Consider, for example, a region from a large RNA molecule 
present in the ribosomes of all organisms (Figure 6.20). In the region shown, 
the E. coli sequence has a guanine (G) residue in position 9 and a cytosine 
(C) residue in position 22, whereas the human sequence has uracil (U) in 
position 9 and adenine (A) in position 22. Examination of the six sequences 
shown in Figure 6.20 reveals that the bases in positions 9 and 22, as well as 
several of the neighboring positions, retain the ability to form Watson—Crick 
base pairs even though the identities of the bases in these positions vary. We 
can deduce that two segments with paired mutations that maintain base- 
pairing ability are likely to form a double helix. Where sequences are known 
for several homologous RNA molecules, this type of sequence analysis can 
often suggest complete secondary structures as well as some additional 


Figure 6.20 Comparison of RNA sequences. (A) A comparison of sequences in a part of U—A 


ribosomal RNA taken from a variety of species. (B) The implied secondary structure. Green bars ri %, 
indicate positions at which Watson—Crick base-pairing is completely conserved in the sequences (C,-) A 
shown, whereas dots indicate positions at which Watson—Crick base-pairing is conserved in most G (C, G) 
cases. 
NeN 
NeN 
(A) (B) * 
\9 |22 en 
aneenin Escherichia coli CACACGGCGGCUGCUAACGUCCGUCGUGAA 9 N—N- 22 
Pseudomonas aeruginosa ACCACGGCGGGUGCUAACGUCCGUCGUGAA NeN 
. : NeN 
ARCHAEA Halobacterium halobium =cCCGGUGUGCGGGG-UAAGCCUGUGCACCGU 
Methanococcus vannieli GAGGGCAUACGGG-UAAGCUGUAUGUCCGA N—N 
NeN 
Homo sapiens EGECCCACUUUUGG-UAAGCAGAACUGECGEC 
EUKARYA P _ N—N 
Saccharomyces cerevisiae GGGCCAUUUUUGG-UAAGCAGAACUGGCGA 
N—N. 


interactions. For this particular ribosomal RNA, the subsequent determi- 
nation of its three-dimensional structure (Section 30.3) confirmed the pre- 
dicted secondary structure. 


6.4 Evolutionary Trees Can Be Constructed on the Basis of 
Sequence Information 


The observation that homology is often manifested as sequence similarity 
suggests that the evolutionary pathway relating the members of a family of 
proteins may be deduced by examination of sequence similarity. This 
approach is based on the notion that sequences that are more similar to one 
another have had less evolutionary time to diverge than have sequences that 
are less similar. This method can be illustrated by using the three globin 
sequences in Figures 6.11 and 6.13, as well as the sequence for the human 
hemoglobin B chain. These sequences can be aligned with the additional 
constraint that gaps, if present, should be at the same positions in all of the 
proteins. These aligned sequences can be used to construct an evolutionary 
tree in which the length of the branch connecting each pair of proteins is 
proportional to the number of amino acid differences between the sequences 
(Figure 6.21). 
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iS deduced by sequence comparison, 
= 800 j;— whereas the results of fossil studies 
provided the overall time scale showing 
when divergence occurred. 
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Such comparisons reveal only the relative divergence times—for exam- 
ple, that myoglobin diverged from hemoglobin twice as long ago as the a 
chain diverged from the B chain. How can we estimate the approximate 
dates of gene duplications and other evolutionary events? Evolutionary 
trees can be calibrated by comparing the deduced branch points with diver- 
gence times determined from the fossil record. For example, the duplication 
leading to the two chains of hemoglobin appears to have occurred 350 mil- 
lion years ago. This estimate is supported by the observation that jawless 
fish such as the lamprey, which diverged from bony fish approximately 
400 million years ago, contain hemoglobin built from a single type of 
subunit (Figure 6.22). 

These methods can be applied to both relatively modern and very 
ancient molecules, such as the ribosomal RNAs that are found in all organ- 
isms. Indeed, such an RNA sequence analysis led to the realization that 
Archaea are a distinct group of organisms that diverged from Bacteria very 
early in evolutionary history. 


Figure 6.22 The lamprey. A jawless fish whose 
ancestors diverged from bony fish approximately 
400 million years ago, the lamprey contains 
hemoglobin molecules that contain only a single 
type of polypeptide chain. [Brent P. Kent] 


6.5 Modern Techniques Make the Experimental 
Exploration of Evolution Possible 


Two techniques of biochemistry have made it possible to examine the 
course of evolution more directly and not simply by inference. The poly- 
merase chain reaction (Chapter 5) allows the direct examination of ancient 
DNA sequences, releasing us, at least in some cases, from the constraints of 
being able to examine existing genomes from living organisms only. 
Molecular evolution may be investigated through the use of combinatorial 
chemistry, the process of producing large populations of molecules en masse 
and selecting for a biochemical property. This exciting process provides a 
glimpse into the types of molecules that may have existed very early in 
evolution. 


Ancient DNA can sometimes be amplified and sequenced 


The tremendous chemical stability of DNA makes the molecule well suited 
to its role as the storage site of genetic information. So stable is the molecule 
that samples of DNA have survived for many thousands of years under 
appropriate conditions. With the development of PCR and advanced 
DNA-sequencing methods, such ancient DNA can be amplified and 
sequenced. This approach has been applied to mitochondrial DNA from 
a Neanderthal fossil estimated at 38,000 years of age excavated from 
Vindija Cave, Croatia, in 1980. Remarkably, investigators have completely 
sequenced the mitochondrial genome from this specimen. Comparison of 
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Neanderthal 6.5 Molecular Exploration 
of Evolution 


Figure 6.23 Placing Neanderthal on an 
evolutionary tree. Comparison of DNA 
sequences revealed that Neanderthal is not 
on the line of direct descent leading to Homo 
sapiens but, instead, branched off earlier and 
then became extinct. 


the Neanderthal mitochondrial sequence with those from Homo sapiens 
individuals revealed between 201 and 234 substitutions, considerably fewer 
than the approximately 1,500 differences between human beings and chim- 
panzees over the same region. Further analysis suggested that the common 
ancestor of modern human beings and Neanderthals lived approximately 
660,000 years ago. An evolutionary tree constructed from these data has 
revealed that the Neanderthal was not an intermediate between chimpan- 
zees and human beings but, instead, was an evolutionary “dead end” that 
became extinct (Figure 6.23). 

A few earlier studies claimed to determine the sequences of far more 
ancient DNA such as that found in insects trapped in amber, but these 
studies appear to have been flawed. The source of these sequences turned 
out to be contaminating modern DNA. Successful sequencing of ancient 
DNA requires sufficient DNA for reliable amplification and the rigorous 
exclusion of all sources of contamination. 


Molecular evolution can be examined experimentally 


Evolution requires three processes: (1) the generation of a diverse popula- 
tion, (2) the selection of members based on some criterion of fitness, and 
(3) reproduction to enrich the population in these more-fit members. 
Nucleic acid molecules are capable of undergoing all three processes in 
vitro under appropriate conditions. The results of such studies enable us 
to glimpse how evolutionary processes might have generated catalytic 
activities and specific binding abilities—important biochemical functions 
in all living systems. 

A diverse population of nucleic acid molecules can be synthesized in the 
laboratory by the process of combinatorial chemistry, which rapidly pro- 
duces large populations of a particular type of molecule such as a nucleic 
acid. A population of molecules of a given size can be generated randomly 
so that many or all possible sequences are present in the mixture. When an 
initial population has been generated, it is subjected to a selection process 
that isolates specific molecules with desired binding or reactivity properties. 
Finally, molecules that have survived the selection process are replicated 
through the use of PCR; primers are directed toward specific sequences 
included at the ends of each member of the population. Errors that occur 
naturally in the course of the replication process introduce additional varia- 
tion into the population in each “generation.” 

Let us consider an application of this approach. Early in evolution, 
before the emergence of proteins, RNA molecules may have played all 
major roles in biological catalysis. To understand the properties of potential 
RNA catalysts, researchers have used the methods heretofore described 
to create an RNA molecule capable of binding adenosine triphosphate 
and related nucleotides. An initial population of RNA molecules 169 
nucleotides long was created; 120 of the positions differed randomly, with 
equimolar mixtures of adenine, cytosine, guanine, and uracil. The initial 
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Figure 6.25 A conserved secondary 
structure. The secondary structure shown is 
common to RNA molecules selected for ATP 
binding. 
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Figure 6.24 Evolution in the laboratory. A collection of RNA molecules of random sequences is 
synthesized by combinatorial chemistry. This collection is selected for the ability to bind ATP by 
passing the RNA through an ATP affinity column (Section 3.1). The ATP-binding RNA molecules are 
released from the column by washing with excess ATP and then replicated. The process of selection 
and replication is then repeated several times. The final RNA products with significant ATP-binding 
ability are isolated and characterized. 


synthetic pool that was used contained approximately 10'* RNA molecules. 
Note that this number is a very small fraction of the total possible pool of 
random 120-base sequences. From this pool, those molecules that bound 
to ATP, which had been immobilized on a column, were selected 
(Figure 6.24). 

The collection of molecules that were bound well by the ATP affinity 
column were allowed to replicate by reverse transcription into DNA, 
amplification by PCR, and transcription back into RNA. The somewhat 
error-prone replication processes introduced additional mutations into the 
population in each cycle. The new population was subjected to additional 
rounds of selection for ATP-binding activity. After eight generations, 
members of the selected population were characterized by sequencing. 
Seventeen different sequences were obtained, 16 of which could form the 
structure shown in Figure 6.25. Each of these molecules bound ATP with 
dissociation constants less than 50 wM. 

The folded structure of the ATP-binding region from one of these 
RNAs was determined by nuclear magnetic resonance (Section 3.6) methods 
(Figure 6.26). As expected, this 40-nucleotide molecule is composed of two 
Watson—Crick base-paired helical regions separated by an 11-nucleotide 


loop. This loop folds back on itself in an intricate way to form a deep pocket 
into which the adenine ring can fit. Thus, a structure had evolved that was 
capable of a specific interaction. 
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Summary 


6.1 


6.2 


6.3 


6.4 


Homologs Are Descended from a Common Ancestor 

Exploring evolution biochemically often means searching for homol- 
ogy between molecules, because homologous molecules, or homologs, 
evolved from a common ancestor. Paralogs are homologous molecules 
that are found in one species and have acquired different functions 
through evolutionary time. Orthologs are homologous molecules that 
are found in different species and have similar or identical functions. 


Statistical Analysis of Sequence Alignments Can Detect Homology 
Protein and nucleic acid sequences are two of the primary languages of 
biochemistry. Sequence-alignment methods are the most powerful tools 
of the evolutionary detective. Sequences can be aligned to maximize 
their similarity, and the significance of these alignments can be judged 
by statistical tests. The detection of a statistically significant alignment 
between two sequences strongly suggests that two sequences are related 
by divergent evolution from a common ancestor. The use of sub- 
stitution matrices makes the detection of more-distant evolutionary 
relationships possible. Any sequence can be used to probe sequence 
databases to identify related sequences present in the same organism or 
in other organisms. 


Examination of Three-Dimensional Structure Enhances Our 
Understanding of Evolutionary Relationships 

The evolutionary kinship between proteins may be even more strik- 
ingly evident in the conserved three-dimensional structures. The analy- 
sis of three-dimensional structure in combination with the analysis of 
especially conserved sequences has made it possible to determine 
evolutionary relationships that cannot be detected by other means. 
Sequence-comparison methods can also be used to detect imperfectly 
repeated sequences within a protein, indicative of linked similar 
domains. 


Evolutionary Trees Can Be Constructed on the Basis of 

Sequence Information 

Evolutionary trees can be constructed with the assumption that the 
number of sequence differences corresponds to the time since the 
two sequences diverged. Construction of an evolutionary tree based 
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Figure 6.26 An evolved ATP-binding 
RNA molecule. (A) The Watson—Crick 
base-pairing pattern, (B) the folding pattern, 
and (C) a surface representation of an RNA 
molecule selected to bind adenosine 
nucleotides. The bound ATP is shown in part 
B, and the binding site is revealed as a deep 
pocket in part C. 
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6.5 


on sequence comparisons revealed approximate times for the gene- 
duplication events separating myoglobin and hemoglobin as well as the 
a and B subunits of hemoglobin. Evolutionary trees based on sequences 
can be compared with those based on fossil records. 


Modern Techniques Make the Experimental Exploration of 

Evolution Possible 

The exploration of evolution can also be a laboratory science. In favor- 
able cases, PCR amplification of well-preserved samples allows the 
determination of nucleotide sequences from extinct organisms. 
Sequences so determined can help authenticate parts of an evolutionary 
tree constructed by other means. Molecular evolutionary experiments 
performed in the test tube can examine how molecules such as ligand- 


binding RNA molecules might have been generated. 


Key Terms 


homolog (p. 174) 
paralog (p. 174) 
ortholog (p. 174) 


sequence alignment (p. 176) 


conservative substitution (p. 178) 
substitution matrix (p. 178) 
BLAST search (p. 181) 

sequence template (p. 184) 


divergent evolution (p. 185) 
convergent evolution (p. 185) 
evolutionary tree (p. 187) 
combinatorial chemistry (p. 188) 


Problems 


1. What’s the score? Using the identity-based scoring 
system (Section 6.2), calculate the score for the follow- 
ing alignment. Do you think the score is statistically 
significant? 

(1) WYLGKITRMDAEVLLKKPTVRDGHFLVTQCESSPGEF- 
(2) WYFGKITRRESERLLLNPENPRGTFLVRESETTKGAY- 


SISVRFGDSVQ----- HFKVLRDONGKYYLWAVK-FN- 
CLSVSDFDNAKGLNVKHYKIRKLDSGGFYITSRTOFS-— 


SLNELVAYHRTASVSRTHTILLSDMNV 
SSLOOLVAYYSKHADGLCHRLTNV 


2. Sequence and structure. A comparison of the aligned amino 
acid sequences of two proteins each consisting of 150 amino 
acids reveals them to be only 8% identical. However, their 
three-dimensional structures are very similar. Are these 
two proteins related evolutionarily? Explain. 


3. It depends on how you count. Consider the following two 
sequence alignments: 


(a) A-SNLFDIRLIG 
GSNDFYEVKIMD 


(b) ASNLFDIRLI-G 
GSNDFYEVKIMD 


Which alignment has a higher score if the identity-based 
scoring system (Section 6.2) is used? Which alignment has 
a higher score if the Blosum-62 substitution matrix 
(Figure 6.9) is used? 


4. Discovering a new base pair. Examine the ribosomal 
RNA sequences in Figure 6.20. In sequences that do not 


contain Watson—Crick base pairs, what base tends to be 
paired with G? Propose a structure for your new base pair. 


5. Overwhelmed by numbers. Suppose that you wish to syn- 
thesize a pool of RNA molecules that contain all four bases 
at each of 40 positions. How much RNA must you have in 
grams if the pool is to have at least a single molecule of each 
sequence? The average molecular weight of a nucleotide is 
330 g mol !. 


6. Form follows function. The three-dimensional structure 
of biomolecules is more conserved evolutionarily than is 
sequence. Why? 


7. Shuffling. Using the identity-based scoring system 
(Section 6.2), calculate the alignment score for the align- 
ment of the following two short sequences: 


(1) ASNFLDKAGK 
(2) ATDYLEKAGK 


Generate a shuffled version of sequence 2 by randomly 
reordering these 10 amino acids. Align your shuffled 
sequence with sequence 1 without allowing gaps, and cal- 
culate the alignment score between sequence 1 and your 
shuffled sequence. 


8. Interpreting the score. Suppose that the sequences of two 
proteins each consisting of 200 amino acids are aligned and 
that the percentage of identical residues has been calculated. 
How would you interpret each of the following results in 


regard to the possible divergence of the two proteins from a 
common ancestor? (a) 80%, (b) 50%, (c) 20%, (d) 10%. 


9. Particularly unique. Consider the Blosum-62 matrix in 
Figure 6.9. Replacement of which three amino acids never 
yields a positive score? What features of these residues 
might contribute to this observation? 


10. A set of three. The sequences of three proteins (A, B, 
and C) are compared with one another, yielding the follow- 
ing levels of identity: 


A B Cc 
100% 65% 15% 
65% 100% 55% 


C 15% 55% 100% 


Assume that the sequence matches are distributed uni- 
formly along each aligned sequence pair. Would you expect 
protein A and protein C to have similar three-dimensional 
structures? Explain. 


11. RNA alignment. Sequences of an RNA fragment from 
five species have been determined and aligned. Propose a 
likely secondary structure for these fragments. 


(1) UUGGAGAUUCGGUAGAAUCUCCC 
(2) GCCGGGAAUCGACAGAUUCCCCG 
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(3) CCCAAGUCCCGGCAGGGACUUAC 
(4) CUCACCUGCCGAUAGGCAGGUCA 
(5) AAUACCACCCGGUAGGGUGGUUC 


12. The more the merrier. When RNA alignments are used 
to determine secondary structure, it is advantageous to have 
many sequences representing a wide variety of species. 


Why? 


13. To err is human. You have discovered a mutant form of 
a thermostable DNA polymerase with significantly reduced 
fidelity in adding the appropriate nucleotide to the growing 
DNA strand, compared with wild-type DNA polymerase. 
How might this mutant be useful in the molecular-evolution 
experiments described in Section 6.5? 


14. Generation to generation. When performing a molecular- 
evolution experiment, such as that described in Section 6.5, 
why is it important to repeat the selection and replication steps 
for several generations? 


15. BLAST away. Using the National Center for 
Biotechnology Information Web site (www.ncbi.nlm.nih. 
gov), find the sequence of the enzyme triose phosphate 
isomerase from E. coli. Use this sequence as the query for a 
protein-protein BLAST search. In the output, find the 
alignment with the sequence of triose phosphate isomerase 
from human beings (Homo sapiens). How many identities 
are observed in the alignment? 
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Hemoglobin: Portrait of a 
Protein in Action 


Beta chain of hemoglobin 


he transition from anaerobic to aerobic life was a major step in evolution 

because it uncovered a rich reservoir of energy. Fifteen times as much 
energy is extracted from glucose in the presence of oxygen than in its 
absence. For single-celled and other small organisms, oxygen can be 
absorbed into actively metabolizing cells directly from the air or surround- 
ing water. Vertebrates evolved two principal mechanisms for supplying 
their cells with an adequate supply of oxygen. The first is a circulatory sys- 
tem that actively delivers oxygen to cells throughout the body. The second 
is the use of the oxygen-transport and oxygen-storage proteins, hemoglobin 
and myoglobin. Hemoglobin, which is contained in red blood cells, is a 
fascinating protein, efficiently carrying oxygen from the lungs to the tissues 
while also contributing to the transport of carbon dioxide and hydrogen ions 
back to the lungs. Myoglobin, located in muscle, provides a reserve supply 
of oxygen available in time of need. 

A comparison of myoglobin and hemoglobin illuminates some key 
aspects of protein structure and function. These two evolutionarily related 
proteins employ nearly identical structures for oxygen binding (Chapter 6). 
However, hemoglobin is a remarkably efficient oxygen carrier, able to use 
as much as 90% of its potential oxygen-carrying capacity effectively. Under 
similar conditions, myoglobin would be able to use only 7% of its potential 


CHAPTER / 


In the bloodstream, red cells carry oxygen from the lungs to the tissues, where demand 
is high. Hemoglobin, the protein that gives blood its red color, is responsible for the 
transport of oxygen via its four heme-bound subunits. Hemoglobin was one of the first 
proteins to have its structure determined; the folding of a single subunit is shown in this 
hand-drawn view. [Left, Dr. Dennis Kunkel/Visuals Unlimited.] 


OUTLINE 


7.1 Myoglobin and Hemoglobin Bind 
Oxygen at Iron Atoms in Heme 


7.2, Hemoglobin Binds Oxygen 
Cooperatively 


7.3, Hydrogen lons and Carbon 
Dioxide Promote the Release of 
Oxygen: The Bohr Effect 


7.4 Mutations in Genes Encoding 
Hemoglobin Subunits Can Result 
in Disease 
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CHAPTER 7 Hemoglobin: Portrait of a 
Protein in Action 


Myoglobin 


h Figure 7.1 Structure of 
myoglobin. Notice that myoglobin 
consists of a single polypeptide chain, 
formed of a helices connected by turns, 
with one oxygen-binding site. [Drawn 
from 1MBD.pdb.] 


capacity. What accounts for this dramatic difference? Myoglobin exists as a 
single polypeptide, whereas hemoglobin comprises four polypeptide chains. 
The four chains in hemoglobin bind oxygen cooperatively, meaning that the 
binding of oxygen to a site in one chain increases the likelihood that the 
remaining chains will bind oxygen. Furthermore, the oxygen-binding prop- 
erties of hemoglobin are modulated by the binding of hydrogen ions and 
carbon dioxide in a manner that enhances oxygen-carrying capacity. Both 
cooperativity and the response to modulators are made possible by varia- 
tions in the quaternary structure of hemoglobin when different combina- 
tions of molecules are bound. 

Hemoglobin and myoglobin have played important roles in the history 
of biochemistry. They were the first proteins for which three-dimensional 
structures were determined by x-ray crystallography. Furthermore, the pos- 
sibility that variations in protein sequence could lead to disease was first 
proposed and demonstrated for sickle-cell anemia, a blood disease caused 
by mutation of a single amino acid in one hemoglobin chain. Hemoglobin 
has been and continues to be a valuable source of knowledge and insight, 
both in itself and as a prototype for many other proteins that we will 
encounter throughout our study of biochemistry. 


7.1 Myoglobin and Hemoglobin Bind Oxygen 
at Iron Atoms in Heme 


Sperm whale myoglobin was the first protein for which the three-dimensional 
structure was determined. X-ray crystallographic studies pioneered by 
John Kendrew revealed the structure of this protein in the 1950s (Figure 7.1). 
Myoglobin consists largely of a helices that are linked to one another by turns 
to form a globular structure. 

Myoglobin can exist in an oxygen-free form called deoxymyoglobin or in 
a form with an oxygen molecule bound called oxymyoglobin. The ability of 
myoglobin and hemoglobin to bind oxygen depends on the presence of a 
bound prosthetic group called heme. 
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The heme group gives muscle and blood their distinctive red color. It con- 
sists of an organic component and a central iron atom. The organic compo- 
nent, called protoporphyrin, is made up of four pyrrole rings linked by 
methine bridges to form a tetrapyrrole ring. Four methyl! groups, two vinyl 
groups, and two propionate side chains are attached. 


In deoxymyoglobin In oxymyoglobin 


The iron atom lies in the center of the protoporphyrin, bonded to 
the four pyrrole nitrogen atoms. Although the heme-bound iron can be in 
either the ferrous (Fe") or ferric (Fe**) oxidation state, only the Fe’* state 
is capable of binding oxygen. The iron ion can form two additional bonds, 
one on each side of the heme plane. These binding sites are called the fifth 
and sixth coordination sites. In myoglobin, the fifth coordination site is 
occupied by the imidazole ring of a histidine residue from the protein. This 
histidine is referred to as the proximal histidine. 

Oxygen binding occurs at the sixth coordination site. In deoxymyo- 
globin, this site remains unoccupied. The iron ion is slightly too large to fit 
into the well-defined hole within the porphyrin ring; it lies approximately 
0.4 A outside the porphyrin plane (Figure 7.2, left). Binding of the oxygen 
molecule at the sixth coordination site substantially rearranges the electrons 
within the iron so that the ion becomes effectively smaller, allowing it to 
move within the plane of the porphyrin (Figure 7.2, right). Remarkably, the 
structural changes that take place on oxygen binding were predicted by 
Linus Pauling, on the basis of magnetic measurements in 1936, nearly 
25 years before the three-dimensional structures of myoglobin and hemoglo- 
bin were elucidated. 


Changes in heme electronic structure upon oxygen binding are the 
basis for functional imaging studies 


e The change in electronic structure that occurs when the iron ion 
moves into the plane of the porphyrin is paralleled by alterations in 
the magnetic properties of hemoglobin; these changes are the basis for func- 
tional magnetic resonance imaging (fMRI), one of the most powerful meth- 
ods for examining brain function. Nuclear magnetic resonance techniques 
detect signals that originate primarily from the protons in water molecules 
and are altered by the magnetic properties of hemoglobin. With the use of 
appropriate techniques, images can be generated that reveal differences in 
the relative amounts of deoxy- and oxyhemoglobin and thus the relative 
activity of various parts of the brain. When a specific part of the brain is 
active, blood vessels relax to allow more blood flow to that region. Thus, a 
more-active region of the brain will be richer in oxyhemoglobin. 

These noninvasive methods identify areas of the brain that process sen- 
sory information. For example, subjects have been imaged while breathing 
air that either does or does not contain odorants. When odorants are pres- 
ent, f MRI detects an increase in the level of hemoglobin oxygenation (and, 


Figure 7.2 Oxygen binding changes 


the position of the iron ion. The iron 


ion lies slightly outside the plane of the 
porphyrin in deoxymyoglobin heme 
(left), but moves into the plane of the 


heme on oxygenation (right). 
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Figure 7.4 Iron-oxygen bonding. The 
interaction between iron and oxygen in 
myoglobin can be described as a combination 
of resonance structures, one with Fe** and 
dioxygen and another with Fe?* and 
superoxide ion. 


Figure 7.5 Stabilizing bound oxygen. A 
hydrogen bond (dotted green line) donated 
by the distal histidine residue to the bound 
oxygen molecule helps stabilize oxymyoglobin. 


Figure 7.3 Functional magnetic resonance 
imaging of the brain. A functional magnetic 
resonance image reveals brain response to 
odorants. The light spots indicate regions of the 
brain activated by odorants. [From N. Sobel et al., 
J. Neurophysiol. 83(2000):537—551; courtesy of 
Dr. Noam Sobel.] 


hence, of activity) in several regions of the brain (Figure 7.3). These regions 
are in the primary olfactory cortex, as well as in areas in which secondary 
processing of olfactory signals presumably takes place. Further analysis 
reveals the time course of activation of particular regions. Functional MRI 
shows tremendous potential for mapping regions and pathways engaged in 
processing sensory information obtained from all the senses. Thus, a seem- 
ingly incidental aspect of the biochemistry of hemoglobin has enabled 
observation of the brain in action. 


The structure of myoglobin prevents the release of reactive 
oxygen species 


Oxygen binding to iron in heme is accompanied by the partial transfer of an 
electron from the ferrous ion to oxygen. In many ways, the structure is best 
described as a complex between ferric ion (Fes) and superoxide anion 
(QO, ), as illustrated in Figure 7.4. It is crucial that oxygen, when it is 
released, leaves as dioxygen rather than superoxide, for two important 
reasons. First, superoxide and other species generated from it are reactive 
oxygen species that can be damaging to many biological materials. Second, 
release of superoxide would leave the iron ion in the ferric state. This spe- 
cies, termed metmyoglobin, does not bind oxygen. Thus, potential oxygen- 
storage capacity is lost. Features of myoglobin stabilize the oxygen complex 
such that superoxide is less likely to be released. In particular, the binding 
pocket of myoglobin includes an additional histidine residue (termed the 
distal histidine) that donates a hydrogen bond to the bound oxygen molecule 
(Figure 7.5). The superoxide character of the bound oxygen species 


Distal histidine 


strengthens this interaction. Thus, the protein component of myoglobin con- 
trols the intrinsic reactivity of heme, making it more suitable for reversible 
oxygen binding. 


Human hemoglobin is an assembly of four myoglobin-like subunits 


The three-dimensional structure of hemoglobin from horse heart was 
solved by Max Perutz shortly after the determination of the myoglobin 
structure. Since then, the structures of hemoglobins from other species 
including humans have been determined. Hemoglobin consists of four 
polypeptide chains, two identical a chains and two identical B chains 
(Figure 7.6). Each of the subunits consists of a set of a helices in the same 
arrangement as the a helices in myoglobin (see Figure 6.15 for a comparison 
of the structures). The recurring structure is called a globin fold. Consistent 
with this structural similarity, alignment of the amino acid sequences of the 
a and B chains of human hemoglobin with those of sperm whale myoglobin 
yields 25% and 24% identity, respectively, and good conservation of 
key residues such as the proximal and distal histidines. Thus, the a and B 
chains are related to each other and to myoglobin by divergent evolution 
(Section 6.2). 


Ore 


hk Figure 7.6 Quaternary structure of deoxyhemoglobin. Hemoglobin, which is composed 
of two « chains and two B chains, functions as a pair of «8 dimers. (A) A ribbon diagram. 
(B) A space-filling model. [Drawn from 1A3N.pdb.] 


The hemoglobin tetramer, referred to as hemoglobin A (HbA), is best 
described as a pair of identical aB dimers (a8; and a2B2) that associate to 
form the tetramer. In deoxyhemoglobin, these aB dimers are linked by an 
extensive interface, which includes the carboxyl terminus of each chain. The 
heme groups are well separated in the tetramer by iron—iron distances 
ranging from 24 to 40 A. 


7.2, Hemoglobin Binds Oxygen Cooperatively 


We can determine the oxygen-binding properties of each of these proteins 
by observing its oxygen-binding curve, a plot of the fractional saturation ver- 
sus the concentration of oxygen. The fractional saturation, Y, is defined as 
the fraction of possible binding sites that contain bound oxygen. The value 
of Y can range from 0 (all sites empty) to 1 (all sites filled). The concentration 
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|_ Torr 
A unit of pressure equal to that exerted by a 
column of mercury 1 mm high at 0°C and 
standard gravity (1 mm Hg). Named after 
Evangelista Torricelli (1608-1647), inventor 
of the mercury barometer. 
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Figure 7.8 Oxygen binding by 
hemoglobin. This curve, obtained for 
hemoglobin in red blood cells, is shaped 
somewhat like an “S," indicating that distinct, 
but interacting, oxygen-binding sites are 
present in each hemoglobin molecule. 
Half-saturation for hemoglobin is 26 torr. For 
comparison, the binding curve for myoglobin 
is shown as a dashed black curve. 
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Figure 7.7 Oxygen binding by 
myoglobin. Half the myoglobin molecules 
have bound oxygen when the oxygen partial 
pressure is 2 torr. 
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of oxygen is most conveniently measured by its partial pressure, pO». For 
myoglobin, a binding curve indicating a simple chemical equilibrium is 
observed (Figure 7.7). Notice that the curve rises sharply as pOz increases 
and then levels off. Half-saturation of the binding sites, referred to as Ps 
(for 50% saturated), is at the relatively low value of 2 torr (mm Hg), indicat- 
ing that oxygen binds with high affinity to myoglobin. 

In contrast, the oxygen-binding curve for hemoglobin in red blood cells 
shows some remarkable features (Figure 7.8). It does not look like a simple 
binding curve such as that for myoglobin; instead, it resembles an “S.”’ Such 
curves are referred to as sigmoid because of their S-like shape. In addition, 
oxygen binding for hemoglobin (Ps9 = 26 torr) is significantly weaker than 
that for myoglobin. Note that this binding curve is derived from hemo- 
globin in red blood cells. Inside red cells, hemoglobin interacts with 
2,3-bisphosphoglycerate, a molecule that significantly lowers hemoglobin’s 
oxygen affinity, as will be considered in detail shortly. 

A sigmoid binding curve indicates that a protein shows a special binding 
behavior. For hemoglobin, this shape suggests that the binding of oxygen at 
one site within the hemoglobin tetramer increases the likelihood that oxy- 
gen binds at the remaining unoccupied sites. Conversely, the unloading of 
oxygen at one heme facilitates the unloading of oxygen at the others. This 
sort of binding behavior is referred to as cooperative, because the binding 
reactions at individual sites in each hemoglobin molecule are not indepen- 
dent of one another. We will return to the mechanism of this cooperativity 
shortly. 

What is the physiological significance of the cooperative binding of oxy- 
gen by hemoglobin? Oxygen must be transported in the blood from the 
lungs, where the partial pressure of oxygen is relatively high (approximately 
100 torr), to the actively metabolizing tissues, where the partial pressure 
of oxygen is much lower (typically, 20 torr). Let us consider how the coop- 
erative behavior indicated by the sigmoid curve leads to efficient oxygen 
transport (Figure 7.9). In the lungs, hemoglobin becomes nearly saturated 
with oxygen such that 98% of the oxygen-binding sites are occupied. When 
hemoglobin moves to the tissues and releases O», the saturation level drops 
to 32%. Thus, a total of 98 — 32 = 66% of the potential oxygen-binding sites 
contribute to oxygen transport. The cooperative release of oxygen favors a 
more-complete unloading of oxygen in the tissues. If myoglobin were 
employed for oxygen transport, it would be 98% saturated in the lungs, but 
would remain 91% saturated in the tissues, and so only 98 — 91 = 7% of the 
sites would contribute to oxygen transport; myoglobin binds oxygen too 
tightly to be useful in oxygen transport. The situation might have been 
improved without cooperativity by the evolution of a noncooperative oxy- 
gen carrier with an optimized affinity for oxygen. For such a protein, the 
most oxygen that could be transported from a region in which pO? is 100 torr 


Tissues Lungs Myoglobin 
1.0 — 
~ Hemoglobin 
5 08 
3 
5 No 
5 OS cooperativity 
as (hypothetical) 
5 04 
oS Figure 7.9 Cooperativity enhances 
gy 0.2 oxygen delivery by hemoglobin. Because 
~ of cooperativity between Oz binding sites, 
0.0 | | hemoglobin delivers more O> to tissues than 
0 20 50 100 150 200 would myoglobin or any noncooperative 
pO, (torr) protein, even one with optimal O2 affinity. 


to one in which it is 20 torr is 63 — 25 = 38%. Thus, the cooperative binding 
and release of oxygen by hemoglobin enables it to deliver nearly 10 times as 
much oxygen as could be delivered by myoglobin and more than 1.7 times 
as much as could be delivered by any noncooperative protein. 

Closer examination of oxygen concentrations in tissues at rest and dur- 
ing exercise underscores the effectiveness of hemoglobin as an oxygen car- 
rier (Figure 7.10). Under resting conditions, the oxygen concentration in 
muscle is approximately 40 torr, but during exercise the concentration is 
reduced to 20 torr. In the decrease from 100 torr in the lungs to 40 torr in 
resting muscle, the oxygen saturation of hemoglobin is reduced from 98% to 
77%, and so 98 — 77 = 21% of the oxygen is released over a drop of 60 torr. 
In a decrease from 40 torr to 20 torr, the oxygen saturation is reduced from 
77% to 32%, corresponding to an oxygen release of 45% over a drop of 
20 torr. Thus, because the change in oxygen concentration from rest to exercise 
corresponds to the steepest part of the oxygen-binding curve, oxygen is 
effectively delivered to tissues where it is most needed. In Section 7.3, we 
shall examine other properties of hemoglobin that enhance its physiological 
responsiveness. 
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Oxygen binding markedly changes the quaternary structure 
of hemoglobin 


The cooperative binding of oxygen by hemoglobin requires that the binding 
of oxygen at one site in the hemoglobin tetramer influence the oxygen- 
binding properties at the other sites. Given the large separation between the 
iron sites, direct interactions are not possible. Thus, indirect mechanisms 


201 
7.2 Cooperative Binding of Oxygen 


202 


CHAPTER 7 Hemoglobin: Portrait of a 
Protein in Action 


for coupling the sites must be at work. These mechanisms are intimately 
related to the quaternary structure of hemoglobin. 

Hemoglobin undergoes substantial changes in quaternary structure on 
oxygen binding: the a;B; and a2» dimers rotate approximately 15 degrees 
with respect to one another (Figure 7.11). The dimers themselves are rela- 
tively unchanged, although there are localized conformational shifts. Thus, 
the interface between the a;B; and a2» dimers is most affected by this 
structural transition. In particular, the a;B; and «2B» dimers are freer to 
move with respect to one another in the oxygenated state than they are in the 
deoxygenated state. 


Deoxyhemoglobin Oxyhemoglobin 


h Figure 7.11 Quaternary structural changes on oxygen binding by hemoglobin. Notice 
that, on oxygenation, one af dimer shifts with respect to the other by a rotation of 15 degrees. 
[Drawn from 1A3N.pdb and 1LFQ.pdb_] 


The quaternary structure observed in the deoxy form of hemoglobin, 
deoxyhemoglobin, is often referred to as the T (for tense) state because it is 
quite constrained by subunit—subunit interactions. The quaternary struc- 
ture of the fully oxygenated form, oxyhemoglobin, is referred to as the R (for 
relaxed) state. In light of the observation that the R form of hemoglobin is 
less constrained, the tense and relaxed designations seem particularly apt. 
Importantly, in the R state, the oxygen-binding sites are free of strain and 
are capable of binding oxygen with higher affinity than are the sites in the 
T state. By triggering the shift of the hemoglobin tetramer from the T state to 
the R state, the binding of oxygen to one site increases the binding affinity of 
other sites. 


Hemoglobin cooperativity can be potentially explained by 
several models 


Two limiting models have been developed to explain the cooperative 
binding of ligands to a multisubunit assembly such as hemoglobin. In the 
concerted model, also known as the MWC model after Jacques Monod, 
Jeffries Wyman, and Jean-Pierre Changeux, who first proposed it, the over- 
all assembly can exist only in two forms: the T state and the R state. The 
binding of ligands simply shifts the equilibrium between these two states 
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(Figure 7.12). Thus, as a hemoglobin tetramer binds each oxygen molecule, 
the probability that the tetramer is in the R state increases. Deoxyhemoglobin 
tetramers are almost exclusively in the T state. However, the binding of 
oxygen to one site in the molecule shifts the equilibrium toward the R state. 
If a molecule assumes the R quaternary structure, the oxygen affinity of 
its sites increases. Additional oxygen molecules are now more likely to bind 
to the three unoccupied sites. Thus, the binding curve is shallow at low 
oxygen concentrations when all of the molecules are in the T state, becomes 
steeper as the fraction of molecules in the R state increases, and flattens out 
again when all of the sites within the R-state molecules become filled 
(Figure 7.13). These events produce the sigmoid binding curve so impor- 
tant for efficient oxygen transport. 

In the concerted model, each tetramer can exist in only two states, the T 
state and the R state. In an alternative model, the sequential model, the bind- 
ing of a ligand to one site in an assembly increases the binding affinity of 
neighboring sites without inducing a full conversion from the T into the R 
state (Figure 7.14). 

Is the cooperative binding of oxygen by hemoglobin better described 
by the concerted or the sequential model? Neither model in its pure form 
fully accounts for the behavior of hemoglobin. Instead, a combined model 
is required. Hemoglobin behavior is concerted in that the tetramer with 
three sites occupied by oxygen is almost always in the quaternary struc- 
ture associated with the R state. The remaining open binding site has an 
affinity for oxygen more than 20-fold greater than that of fully deoxygen- 
ated hemoglobin binding its first oxygen. However, the behavior is not 
fully concerted, because hemoglobin with oxygen bound to only one of 
four sites remains primarily in the T-state quaternary structure. Yet, this 
molecule binds oxygen three times as strongly as does fully deoxygenated 
hemoglobin, an observation consistent only with a sequential model. 
These results highlight the fact that the concerted and sequential models 
represent idealized limiting cases, which real systems may approach but 
rarely attain. 
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Figure 7.14 Sequential model. The binding of a ligand changes the conformation of the subunit 
to which it binds. This conformational change induces changes in neighboring subunits that increase 
their affinity for the ligand. 
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Figure 7.12 Concerted model. All 
molecules exist either in the T state or in the 
R state. At each level of oxygen loading, an 
equilibrium exists between the T and R states. 
The equilibrium shifts from strongly favoring 
the T state with no oxygen bound to strongly 
favoring the R state when the molecule is fully 
loaded with oxygen. The R state has a greater 
affinity for oxygen than does the T state. 
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Figure 7.13 T-to-R transition. The observed 
binding curve for hemoglobin can be seen as 
a combination of the binding curves that 
would be observed if all molecules remained 
in the T state or if all of the molecules were in 
the R state. The sigmoidal curve is observed 
because molecules convert from the T state 
into the R state as oxygen molecules bind. 
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Figure 7.15 Conformational changes in 
hemoglobin. The movement of the iron ion 
on oxygenation brings the iron-associated 
histidine residue toward the porphyrin ring. 
The associated movement of the histidine- 
containing « helix alters the interface between 
the af dimers, instigating other structural 
changes. For comparison, the 
deoxyhemoglobin structure is shown in gray 
behind the oxyhemoglobin structure in color. 
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Figure 7.16 Oxygen binding by pure 
hemoglobin compared with hemoglobin 
in red blood cells. Pure hemoglobin binds 
oxygen more tightly than does hemoglobin in 
red blood cells. This difference is due to the 
presence of 2,3-bisphosphoglycerate (2,3-BPG) 
in red blood cells. 


Structural changes at the heme groups are transmitted to 
the 1B 1-a2B2 interface 


We now examine how oxygen binding at one site is able to shift the equi- 
librium between the T and R states of the entire hemoglobin tetramer. As 
in myoglobin, oxygen binding causes each iron atom in hemoglobin to move 
from outside the plane of the porphyrin into the plane. When the iron atom 
moves, the histidine residue bound in the fifth coordination site moves 
with it. This histidine residue is part of an a helix, which also moves (Fig- 
ure 7.15). The carboxyl terminal end of this a helix lies in the interface 
between the two af dimers. The change in position of the carboxyl terminal 
end of the helix favors the T-to-R transition. Consequently, the structural 
transition at the iron ion in one subunit is directly transmitted to the other 
subunits. The rearrangement of the dimer interface provides a pathway for 
communication between subunits, enabling the cooperative binding of 
oxygen. 


2,3-Bisphosphoglycerate in red cells is crucial in determining the 
oxygen affinity of hemoglobin 


For hemoglobin to function efficiently, the T state must remain stable until 
the binding of sufficient oxygen has converted it into the R state. In fact, 
however, the T state of hemoglobin is highly unstable, pushing the equilib- 
rium so far toward the R state that little oxygen would be released in physi- 
ological conditions. Thus, an additional mechanism is needed to properly 
stabilize the T state. This mechanism was discovered by comparing the 
oxygen-binding properties of hemoglobin in red blood cells with fully puri- 
fied hemoglobin (Figure 7.16). Pure hemoglobin binds oxygen much more 
tightly than does hemoglobin in red blood cells. This dramatic difference is 
due to the presence within these cells of 2,3-bisphosphoglycerate (2,3-BPG; 
also known as 2,3-diphosphoglycerate or 2,3-DPG). 
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2,3-Bisphosphoglycerate 
(2,3-BPG) 


This highly anionic compound is present in red blood cells at approximately 
the same concentration as that of hemoglobin (~2 mM). Without 2,3-BPG, 
hemoglobin would be an extremely inefficient oxygen transporter, releasing 
only 8% of its cargo in the tissues. 

How does 2,3-BPG lower the oxygen affinity of hemoglobin so signifi- 
cantly? Examination of the crystal structure of deoxyhemoglobin in the 
presence of 2,3-BPG reveals that a single molecule of 2,3-BPG binds in the 
center of the tetramer, in a pocket present only in the T form (Figure 7.17). 
On T-to-R transition, this pocket collapses and 2,3-BPG is released. Thus, 
in order for the structural transition from T to R to take place, the bonds 
between hemoglobin and 2,3-BPG must be broken. In the presence of 2,3- 
BPG, more oxygen-binding sites within the hemoglobin tetramer must be 
occupied in order to induce the T-to-R transition, and so hemoglobin 
remains in the lower-affinity T state until higher oxygen concentrations are 
reached. This mechanism of regulation is remarkable because 2,3-BPG 
does not in any way resemble oxygen, the molecule on which hemoglobin 
carries out its primary function. 2,3-BPG is referred to as an allosteric 
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effector (from the Greek allos, “other,” and stereos, “structure’). Regulation 
by a molecule structurally unrelated to oxygen is possible because the allos- 
teric effector binds to a site that is completely distinct from that for oxygen. 
We will encounter allosteric effects again when we consider enzyme regula- 
tion in Chapter 10. ii seeiiad 
A cells 
The binding of 2,3- BPG to hemoglobin has other crucial physiological 6 ost bali 
consequences. The globin gene expressed by human fetuses differs is enanae 
from that expressed by adults; fetal hemoglobin tetramers include two a g oe 
chains and two y chains. The y chain, a result of a gene duplication, is 72% 3 eal 
identical in amino acid sequence with the B chain. One noteworthy change oe b Aisiceeneieial 
is the substitution of a serine residue for His 143 in the B chain, part of the E o2 oxyhemoglobin to fetal 
2,3-BPG-binding site. This change removes two positive charges from = deoxyhemoglobin 
the 2,3-BPG-binding site (one from each chain) and reduces the affinity of 0.0« mn ; a 
2,3-BPG for fetal hemoglobin. Consequently, the oxygen-binding affinity poster) 


of fetal hemoglobin is higher than that of maternal (adult) hemoglobin 
(Figure 7.18). This difference in oxygen affinity allows oxygen to be effec- 
tively transferred from maternal to fetal red blood cells. We have here an 
example in which gene duplication and specialization produced a ready 
solution to a biological challenge—uin this case, the transport of oxygen from 
mother to fetus. 


Carbon monoxide can disrupt oxygen transport by hemoglobin 
¥e Carbon monoxide (CO) is a colorless, odorless gas that binds to 
hemoglobin at the same site as oxygen, forming a complex termed 
carboxyhemoglobin. Formation of carboxyhemoglobin exerts devastating 
consequences on normal oxygen transport in two ways. First, carbon mon- 
oxide binds to hemoglobin about 200-fold more tightly than does oxygen. 
Even at low partial pressures in the blood, carbon monoxide will displace 
oxygen from hemoglobin, preventing its delivery. Second, carbon monoxide 
bound to one site in hemoglobin will shift the oxygen saturation curve of the 
remaining sites to the left, forcing the tetramer into the R state. This results 
in an increased affinity for oxygen, preventing its dissociation at tissues. 
Exposure to carbon monoxide—from gas appliances and running auto- 


mobiles, for example—can cause carbon monoxide poisoning, in which 
patients exhibit nausea, vomiting, lethargy, weakness, and disorientation. 


Figure 7.18 Oxygen affinity of fetal red 
blood cells. Fetal red blood cells have a 
higher oxygen affinity than do maternal red 
blood cells because fetal hemoglobin does 
not bind 2,3-BPG as well as maternal 
hemoglobin does. 
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Protein in Action is referred to as hyperbaric oxygen therapy). With this therapy, the partial 
pressure of oxygen in the blood becomes sufficiently high to increase sub- 
stantially the rate of carbon monoxide displacement from hemoglobin. 
Exposure to high concentrations of carbon monoxide, however, can be rap- 
idly fatal: in the United States, about 2500 people die each year from carbon 
monoxide poisoning, about 500 of them from accidental exposures and 
nearly 2000 by suicide. 


7.3 Hydrogen lons and Carbon Dioxide Promote the 
Release of Oxygen: The Bohr Effect 


We have seen how cooperative release of oxygen from hemoglobin helps 
deliver oxygen to tissues where it is most needed, as revealed by their low 
oxygen partial pressures. This ability is enhanced by the facility of hemo- 
globin to respond to other cues in its physiological environment that signal 
the need for oxygen. Rapidly metabolizing tissues, such as contracting 
muscle, generate large amounts of hydrogen ions and carbon dioxide 
(Chapter 16). To release oxygen where the need is greatest, hemoglobin has 
evolved to respond to higher levels of these substances. Like 2,3-BPG, 
hydrogen ions and carbon dioxide are allosteric effectors of hemoglobin that 
bind to sites on the molecule that are distinct from the oxygen-binding sites. 
The regulation of oxygen binding by hydrogen ions and carbon dioxide is 
called the Bohr effect after Christian Bohr, who described this phenomenon 
in 1904. 

The oxygen affinity of hemoglobin decreases as pH decreases from a 
value of 7.4 (Figure 7.19). Consequently, as hemoglobin moves into a region 
of lower pH, its tendency to release oxygen increases. For example, transport 
from the lungs, with pH 7.4 and an oxygen partial pressure of 100 torr, to 
active muscle, with a pH of 7.2 and an oxygen partial pressure of 20 torr, 
results in a release of oxygen amounting to 77% of total carrying capacity. 
Only 66% of the oxygen would be released in the absence of any change in 
pH. Structural and chemical studies have revealed much about the chemical 
basis of the Bohr effect. At least two sets of chemical groups are important 
for sensing changes in pH: the a-amino groups at the amino termini of the 
a chain and the side chains of histidines 8146 and «122, all of which have 
pK, values near pH 7. Consider histidine 8146, the residue at the C termi- 
nus of the B chain. In deoxyhemoglobin, the terminal carboxylate group 
of 8146 forms a salt bridge with a lysine residue in the a subunit of the 
other aB dimer. This interaction locks the side chain of histidine 8146 ina 


Tissues Lungs 
LOS 1) eee eee sees 
=> 0, 
6 08 aa ‘\ pH 7.4 
8 pH 7.2 
> 
5B 06 
wn 
ao) 
5 0.4 
S 
2 
S 0.2 
Figure 7.19 Effect of pH on the oxygen = 
affinity of hemoglobin. Lowering the pH from 0.0 
0 20 100 


74 (red curve) to 7.2 (blue curve) results in the 
release of O2 from oxyhemoglobin. PO, (torr) 


Q> Lys 40 


Figure 7.20 Chemical basis of the Bohr 
effect. In deoxyhemoglobin, three amino acid 
+ residues form two salt bridges that stabilize the 
C terminus T quaternary structure. The formation of one of 
the salt bridges depends on the presence of an 
Added added proton on histidine 8146. The proximity 
pratet of the negative charge on aspartate B94 in 
i deoxyhemoglobin favors protonation of this 
histidine. Notice that the salt bridge between 
B, His 146 - B, Asp 94 histidine 8146 and aspartate B94 is stabilized 
by a hydrogen bond (green dashed line). 


position from which it can participate in a salt bridge with negatively 
charged aspartate B94 in the same chain, provided that the imidazole group 
of the histidine residue is protonated (Figure 7.20). 

The other groups also participate in salt bridges in the T state. The for- 
mation of these salt bridges stabilizes the T state, leading to a greater tendency 
for oxygen to be released. For example, at high pH, the side chain of histidine 
8146 is not protonated and the salt bridge does not form. As the pH drops, 
however, the side chain of histidine B146 becomes protonated, the salt 
bridge with aspartate B94 forms, and the T state is stabilized. 

Carbon dioxide, a neutral species, passes through the red-blood-cell 
membrane into the cell. This transport is also facilitated by membrane 
transporters including proteins associated with Rh blood types. Carbon 
dioxide stimulates oxygen release by two mechanisms. First, the presence of 
high concentrations of carbon dioxide leads to a drop in pH within the red 
blood cell (Figure 7.21). Carbon dioxide reacts with water to form carbonic 
acid, H,CO3. This reaction is accelerated by carbonic anhydrase, an enzyme 
abundant in red blood cells that will be considered extensively in Chapter 9. 
H,CO3 is a moderately strong acid with a pK, of 3.5. Thus, once formed, 
carbonic acid dissociates to form bicarbonate ion, HCO3 _, and H’™, result- 
ing in a drop in pH that stabilizes the T state by the mechanism discussed 
previously. 


OO 


Body tissue Blood capillary 


Figure 7.21 Carbon dioxide and pH. Carbon dioxide in the tissues diffuses into red blood cells. 
Inside a red blood cell, carbon dioxide reacts with water to form carbonic acid, in a reaction 
catalyzed by the enzyme carbonic anhydrase. Carbonic acid dissociates to form HCO3~ and H*, 
resulting in a drop in pH inside the red cell. 
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Figure 7.22 Carbon dioxide effects. The 
presence of carbon dioxide decreases the 
affinity of hemoglobin for oxygen even 
beyond the effect due to a decrease in pH, 
resulting in even more efficient oxygen 
transport from the tissues to the lungs. 


Figure 7.23 Transport of CO, from 
tissues to lungs. Most carbon dioxide is 
transported to the lungs in the form of 


HCOz~ produced in red blood cells and then 


released into the blood plasma. A lesser 
amount is transported by hemoglobin in the 
form of an attached carbamate. 


In the second mechanism, a direct chemical interaction between carbon 
dioxide and hemoglobin stimulates oxygen release. The effect of carbon 
dioxide on oxygen affinity can be seen by comparing oxygen-binding curves 
in the absence and in the presence of carbon dioxide at a constant pH 
(Figure 7.22). In the presence of carbon dioxide at a partial pressure of 
40 torr at pH 7.2, the amount of oxygen released approaches 90% of the 
maximum carrying capacity. Carbon dioxide stabilizes deoxyhemoglobin 
by reacting with the terminal amino groups to form carbamate groups, 
which are negatively charged, in contrast with the neutral or positive charges 
on the free amino groups. 
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The amino termini lie at the interface between the af dimers, and these 
negatively charged carbamate groups participate in salt-bridge interactions 
that stabilize the T state, favoring the release of oxygen. 

Carbamate formation also provides a mechanism for carbon dioxide 
transport from tissues to the lungs, but it accounts for only about 14% of the 
total carbon dioxide transport. Most carbon dioxide released from red blood 
cells is transported to the lungs in the form of HCO3_ produced from the 
hydration of carbon dioxide inside the cell (Figure 7.23). Much of the 
HCO3_ that is formed leaves the cell through a specific membrane-transport 
protein that exchanges HCO 3. from one side of the membrane for Cl~ 
from the other side. Thus, the serum concentration of HCO3_ increases. By 
this means, a large concentration of carbon dioxide is transported from tis- 
sues to the lungs in the form of HCO3 . In the lungs, this process is 
reversed: HCO3_ is converted back into carbon dioxide and exhaled. Thus, 
carbon dioxide generated by active tissues contributes to a decrease in red- 
blood-cell pH and, hence, to oxygen release and is converted into a form 
that can be transported in the serum and released in the lungs. 
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7.4 Mutations in Genes Encoding Hemoglobin 
Subunits Can Result in Disease 


In modern times, particularly after the sequencing of the human genome, to 
think of genetically encoded variations in protein sequence as a factor in 
specific diseases is routine. The notion that diseases might be caused by 


molecular defects was proposed by Linus Pauling in 1949 (4 years before 
Watson and Crick’s proposal of the DNA double helix) to explain the blood 
disease sickle-cell anemia. The name of the disorder comes from the abnor- 
mal sickle shape of red blood cells deprived of oxygen observed in people 
suffering from this disease (Figure 7.24). Pauling proposed that sickle-cell 
anemia might be caused by a specific variation in the amino acid sequence 
of one hemoglobin chain. Today, we know that this bold hypothesis is 
correct. In fact, approximately 7% of the world’s population are carriers of 
some disorder of hemoglobin caused by a variation in its amino acid 
sequence. In concluding this chapter, we will focus on the two most impor- 
tant of these disorders, sickle-cell anemia and thalassemia. 


Sickle-cell anemia results from the aggregation of mutated 
deoxyhemoglobin molecules 


i 2 People with sickled red blood cells experience a number of dangerous 
symptoms. Examination of the contents of these red cells reveals 
that the hemoglobin molecules have formed large fibrous aggregates 
(Figure 7.25). These fibers extend across the red blood cells, distorting them 
so that they clog small capillaries and impair blood flow. The results may be 
painful swelling of the extremities and a higher risk of stroke or bacterial 
infection (due to poor circulation). The sickled red cells also do not remain 
in circulation as long as normal cells do, leading to anemia. 

What is the molecular defect associated with sickle-cell anemia? Using 
newly developed chromatographic techniques, Vernon Ingram demon- 
strated in 1956 that a single amino acid substitution in the B chain of hemo- 
globin is responsible—namely, the replacement of a valine residue with 
glutamate in position 6. The mutated form is referred to as hemoglobin S 
(HbS). In people with sickle-cell anemia, both alleles of the hemoglobin 
B-chain gene (HbB) are mutated. The HbS substitution substantially 
decreases the solubility of deoxyhemoglobin, although it does not markedly 
alter the properties of oxyhemoglobin. 

Examination of the structure of hemoglobin 5 reveals that the new valine 
residue lies on the surface of the T-state molecule (Figure 7.26). This new 
hydrophobic patch interacts with another hydrophobic patch formed by 
Phe 85 and Val 88 of the B chain of a neighboring molecule to initiate the 
aggregation process. More-detailed analysis reveals that a single hemoglo- 
bin 5 fiber is formed from 14 chains of multiple interlinked hemoglobin 
molecules. Why do these aggregates not form when hemoglobin 5 is 
oxygenated? Oxyhemoglobin 5 is in the R state, and residues Phe 85 and Val 
88 on the B chain are largely buried inside the hemoglobin assembly. 


Figure 7.24 Sickled red blood cells. A 
micrograph showing a sickled red blood cell 
adjacent to normally shaped red blood cells. 
[Eye of Science/Photo Researchers.] 


Figure 7.25 Sickle-cell hemoglobin 
fibers. An electron micrograph depicting a 
ruptured sickled red blood cell with fibers of 
sickle-cell hemoglobin emerging. [Courtesy of 
Robert Josephs and Thomas E. Wellems, 
University of Chicago.] 


h Figure 7.26 Deoxygenated 
hemoglobin S. The interaction between Val 
6 (blue) on a & chain of one hemoglobin 
molecule and a hydrophobic patch formed by 
Phe 85 and Val 88 (gray) on a B chain of 
another deoxygenated hemoglobin molecule 
leads to hemoglobin aggregation. The 
exposed Val 6 residues of other 8 chains 
participate in other such interactions in 
hemoglobin S fibers. [Drawn from 2HBS.pdb.] 
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Figure 7.27 Sickle-cell trait and malaria. 
A significant correlation is observed between 
regions with a high frequency of the HbS 
allele and regions with a high prevalence of 
malaria. 
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Without a partner with which to interact, the surface Val residue in position 
6 is benign. 

Approximately 1 in 100 West Africans suffer from sickle-cell anemia. 
Given the often devastating consequences of the disease, why is the HbS 
mutation so prevalent in Africa and in some other regions? Recall that both 
copies of the HbB gene are mutated in people with sickle-cell anemia. 
People with one copy of the HbB gene and one copy of the HbS are rela- 
tively unaffected. They are said to have sickle-cell trait because they can pass 
the HbS gene to their offspring. However, people with sickle-cell trait are 
resistant to malaria, a disease carried by a parasite, Plasmodium falciparum, 
that lives within red blood cells at one stage in its life cycle. The dire effect 
of malaria on health and reproductive likelihood in regions where malaria 
has been historically endemic has favored people with sickle-cell trait, 
increasing the prevalence of the Hb5 allele (Figure 7.27). 


Thalassemia is caused by an imbalanced production of 
hemoglobin chains 


k2 Sickle-cell anemia is caused by the substitution of a single specific 
amino acid in one hemoglobin chain. Thalassemia, the other preva- 
lent inherited disorder of hemoglobin, is caused by the loss or substantial 
reduction of a single hemoglobin chain. The result is low levels of func- 
tional hemoglobin and a decreased production of red blood cells, which may 
lead to anemia, fatigue, pale skin, and spleen and liver malfunction. 
Thalassemia is a set of related diseases. In a-thalassemia, the a chain of 
hemoglobin is not produced in sufficient quantity. Consequently, hemoglo- 
bin tetramers form that contain only the B chain. These tetramers, referred 
to as hemoglobin H (HbH), bind oxygen with high affinity and no cooper- 
ativity. Thus, oxygen release in the tissues is poor. In B-thalassemia, the B 
chain of hemoglobin is not produced in sufficient quantity. In the 
absence of B chains, the a chains form insoluble aggregates that precipi- 
tate inside immature red blood cells. The loss of red blood cells results in 
anemia. The most severe form of B-thalassemia is called thalassemia major 
or Cooley anemia. 

Both a- and B-thalassemia are associated with many different genetic 
variations and display a wide range of clinical severity. The most severe 
forms of a-thalassemia are usually fatal shortly before or just after birth. 
However, these forms are relatively rare. An examination of the repertoire 


of hemoglobin genes in the human genome provides one explanation. 
Normally, humans have not two but four alleles for the a chain, arranged 
such that the two genes are located adjacent to each other on one end of each 
chromosome 16. Thus, the complete loss of a-chain expression requires 
the disruption of four alleles. B-Thalassemia is more common because 
humans normally have only two alleles for the B chain, one on each copy of 
chromosome 11. 


The accumulation of free alpha-hemoglobin chains is prevented 


The presence of four genes expressing the a chain, compared with two for 
the B chain, suggests that the a chain would be produced in excess (given the 
overly simple assumption that protein expression from each gene is compa- 
rable). If this is correct, why doesn’t the excess a chain precipitate? One 
mechanism for maintaining a chains in solution was revealed by the discov- 
ery of an 11-kd protein in red blood cells called a-hemoglobin stabilizing 
protein (AHSP). This protein forms a soluble complex specifically with 
newly synthesized a-chain monomers. The crystal structure of a complex 
between AHSP and a-hemoglobin reveals that AHSP binds to the same 
face of a-hemoglobin as does B-hemoglobin (Figure 7.28). AHSP binds the 
a chain in both the deoxygenated and oxygenated forms. In the complex 
with oxygen bound, the distal histidine, rather than the proximal histidine, 
binds the iron atom. 

AHSP serves to bind and ensure the proper folding of a-hemoglobin 
as it is produced. As B-hemoglobin is expressed, it displaces AHSP 
because the a-hemoglobin—B-hemoglobin dimer is more stable than the 
a-hemoglobin—AHSP complex. Thus, AHSP prevents the misfolding, 
accumulation, and precipitation of free a-hemoglobin. Studies are under 
way to determine if mutations in the gene encoding AHSP play a role in 
modulating the severity of B-thalassemia. 


o-Hemoglobin 


Distal 
histidine 


Additional globins are encoded in the human genome 


MZ In addition to the gene for myoglobin, the two genes for a-hemoglo- 

bin, and the one for B-hemoglobin, the human haploid genome con- 
tains other globin genes. We have already encountered fetal hemoglobin, 
which contains the y chain in place of the B chain. Several other genes 
encode other hemoglobin subunits that are expressed during development, 
including the 6 chain, the € chain, and the ¢ chain. 
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7.4 Mutations in Genes 


h Figure 7.28 Stabilizing free 
a-hemoglobin. The structure of a complex 
between AHSP and a-hemoglobin is shown. 
In this complex, the iron atom is bound to 
oxygen and to the distal histidine. Notice that 
AHSP binds to the same surface of 
a-hemoglobin as does B-hemoglobin. [Drawn 
from 1Y01.pdb.] 
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Examination of the human genome sequence has revealed two addi- 
tional globins. Both of these proteins are monomeric proteins, more similar 
to myoglobin than to hemoglobin. The first, newroglobin, is expressed pri- 
marily in the brain and at especially high levels in the retina. Neuroglobin 
may play a role in protecting neural tissues from hypoxia (insufficient oxy- 
gen). The second, cytoglobin, is expressed more widely throughout the 
body. Structural and spectroscopic studies reveal that, in both neuroglobin 
and cytoglobin, the proximal and the distal histidines are coordinated to the 
iron atom in the deoxy form. Oxygen binding displaces the distal histidine. 
Future studies should more completely elucidate the functions of these 
members of the globin family. 


Summary 


7.1 Myoglobin and Hemoglobin Bind Oxygen at Iron Atoms in Heme 
Myoglobin 1s a largely a-helical protein that binds the prosthetic group 
heme. Heme consists of protoporphyrin, an organic component with 
four linked pyrrole rings, and a central iron ion in the Fe’~ state. The 
iron ion is coordinated to the side chain of a histidine residue in myo- 
globin, referred to as the proximal histidine. One of the oxygen atoms 
in Oz binds to an open coordination site on the iron. Because of partial 
electron transfer from the iron to the oxygen, the iron ion moves into the 
plane of the porphyrin on oxygen binding. Hemoglobin consists of four 
polypeptide chains, two a chains and two B chains. Each of these chains 
is similar in amino acid sequence to myoglobin and folds into a very 
similar three-dimensional structure. The hemoglobin tetramer is best 
described as a pair of af dimers. 


7.2 Hemoglobin Binds Oxygen Cooperatively 

The oxygen-binding curve for myoglobin reveals a simple equilibrium 
binding process. Myoglobin is half-saturated with oxygen at an oxygen 
concentration of approximately 2 torr. The oxygen-binding curve for 
hemoglobin has an “S’ -like (sigmoid) shape, indicating that the oxygen 
binding is cooperative. The binding of oxygen at one site within the 
hemoglobin tetramer affects the affinities of the other sites for oxygen. 
Cooperative oxygen binding and release significantly increase the effi- 
ciency of oxygen transport. The amount of the potential oxygen-carrying 
capacity utilized in transporting oxygen from the lungs (with a partial 
pressure of oxygen of 100 torr) to tissues (with a partial pressure of 
oxygen of 20 torr) is 66% compared with 7% if myoglobin had been used 
as the oxygen carrier. 

The quaternary structure of hemoglobin changes on oxygen bind- 
ing. The structure of deoxyhemoglobin is referred to as the T state. The 
structure of oxyhemoglobin is referred to as the R state. The two af 
dimers rotate by approximately 15 degrees with respect to one another 
in the transition from the T to the R state. Cooperative binding can be 
potentially explained by concerted and sequential models. In the con- 
certed model, each hemoglobin adopts either the T state or the R state; 
the equilibrium between these two states is determined by the number 
of occupied oxygen-binding sites. Sequential models allow intermedi- 
ate structures. Structural changes at the iron sites in response to oxygen 
binding are transmitted to the interface between af dimers, influencing 
the T-to-R equilibrium. 


7.3 


7.4 


Red blood cells contain 2,3-bisphosphoglycerate in concentrations 
approximately equal to that for hemoglobin. 2,3-BPG binds tightly to 
the T state but not to the R state, stabilizing the T state and lowering the 
oxygen affinity of hemoglobin. Fetal hemoglobin binds oxygen more 
tightly than does adult hemoglobin owing to weaker 2,3-BPG binding. 
This difference allows oxygen transfer from maternal to fetal blood. 


Hydrogen lons and Carbon Dioxide Promote the Release of Oxygen 
The oxygen-binding properties of hemoglobin are markedly affected by 
pH and by the presence of carbon dioxide, a phenomenon known as the 
Bohr effect. Increasing the concentration of hydrogen ions—that is, 
decreasing pH—decreases the oxygen affinity of hemoglobin, owing to 
the protonation of the amino termini and certain histidine residues. The 
protonated residues help stabilize the T state. Increasing concentrations 
of carbon dioxide decrease the oxygen affinity of hemoglobin by two 
mechanisms. First, carbon dioxide is converted into carbonic acid, 
which lowers the oxygen affinity of hemoglobin by decreasing the pH 
inside the red blood cell. Second, carbon dioxide adds to the amino ter- 
mini of hemoglobin to form carbamates. These negatively charged 
groups stabilize deoxyhemoglobin through ionic interactions. Because 
hydrogen ions and carbon dioxide are produced in rapidly metabolizing 
tissues, the Bohr effect helps deliver oxygen to sites where it is most 
needed. 


Mutations in Genes Encoding Hemoglobin Subunits Can Result 

in Disease 

Sickle-cell disease is caused by a mutation in the B chain of hemoglobin 
that substitutes a valine residue for a glutamate residue. As a result, a 
hydrophobic patch forms on the surface of deoxy (T-state) hemoglobin 
that leads to the formation of fibrous polymers. These fibers distort red 
blood cells into sickle shapes. Sickle-cell disease was the first disease to 
be associated with a change in the amino acid sequence of a protein. 
Thalassemias are diseases caused by the reduced production of either 
the a or the B chain, yielding hemoglobin tetramers that contain only 
one type of hemoglobin chain. Such hemoglobin molecules are charac- 
terized by poor oxygen release and low solubility, leading to the destruc- 
tion of red blood cells in the course of their development. Red-blood- 
cell precursors normally produce a slight excess of hemoglobin a chains 
compared with B chains. To prevent the aggregation of the excess a 
chains, they produce a-hemoglobin stabilizing protein, which binds 
specifically to newly synthesized a-chain monomers to form a soluble 
complex. 
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APPENDIX: Binding Models Can Be Formulated in Quantitative Terms: 


The Hill Plot and the Concerted Model 


The Hill Plot 
A useful way of quantitatively describing cooperative 
binding processes such as that for hemoglobin was 
developed by Archibald Hill in 1913. Consider the 
hypothetical equilibrium for a protein X binding a 
ligand S: 

X + nS =— X(S)a (1) 


where n is a variable that can take on both integral and 
fractional values. The parameter n is a measure of the 
degree of cooperativity in ligand binding, although it 
does not have deeper significance because equation 1 
does not represent an actual physical process. For X = 
hemoglobin and $ = Og, the maximum value of n is 4. 
The value of n = 4 would apply if oxygen binding by 
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hemoglobin were completely cooperative. If oxygen 
binding were completely noncooperative, then n would 
be 1. 
Analysis of the equilibrium in equation 1 yields the 
following expression for the fractional saturation, Y: 
[S]" 
[S]" + [Sso]” 
where [Ss] is the concentration at which X is half- 
saturated. For hemoglobin, this expression becomes 
_ p02" 
pO2" + Ps" 


Y= 


where Ps is the partial pressure of oxygen at which 
hemoglobin is half-saturated. This expression can be 
rearranged to: 
Y _ pO," 
1—-—Y — Ps" 


and so 


aa 7 _ _ 
los(; = >) = lal Ps") nlog(pO2) — nlog(Pso) 


This equation predicts that a plot of log (Y/1 — Y) ver- 
sus log(Pso), called a Hill plot, should be linear with a 
slope of n. 

Hill plots for myoglobin and hemoglobin are shown 
in Figure 7.29. For myoglobin, the Hill plot is linear 
with a slope of 1. For hemoglobin, the Hill plot is not 
completely linear, because the equilibrium on which 
the Hill plot is based is not entirely correct. However, 
the plot is approximately linear in the center with a 
slope of 2.8. The slope, often referred to as the Hill 


coefficient, is a measure of the cooperativity of oxygen 
binding. The utility of the Hill plot is that it provides a 
simply derived quantitative assessment of the degree of 
cooperativity in binding. With the use of the Hill equa- 
tion and the derived Hill coefficient, a binding curve 
that closely resembles that for hemoglobin is produced 
(Figure 7.30). 


The Concerted Model 

The concerted model can be formulated in quantitative 
terms. Only four parameters are required: (1) the num- 
ber of binding sites (assumed to be equivalent) in the 
protein, (2) the ratio of the concentrations of the T and 
R states in the absence of bound ligands, (3) the affinity 
of sites in proteins in the R state for ligand binding, and 
(4) a measure of how much more tightly subunits in 
proteins in the R state bind ligands compared with sub- 
units in the T state. The number of binding sites, n, is 
usually known from other information. For hemoglobin, 
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Figure 7.30 Oxygen-binding curves for several Hill coefficients. 
The curve labeled n = 2.8 closely resembles the curve for hemoglobin. 


n = 4. The ratio of the concentrations of the T and R 
states with no ligands bound is a constant: 


L = [To]/[Ro] 


where the subscript refers to the number of ligands 
bound (in this case, zero). The affinity of subunits in 
the R state is defined by the dissociation constant for a 
ligand binding to a single site in the R state, Kr. 
Similarly, the dissociation constant for a ligand binding 
to a single site in the T state is Ky. We can define the 
ratio of these two dissociation constant as 


c = Kr/Ky 


This is the measure of how much more tightly a subunit 
for a protein in the R state binds a ligand compared 
with a subunit for a protein in the T state. Note that 
c <1 because Kp and Ky are dissociation constants 
and tight binding corresponds to a small dissociation 
constant. 

What is the ratio of the concentration of T-state 
proteins with one ligand bound to the concentration of 
R-state proteins with one ligand bound? The dissocia- 
tion constant for a single site in the R state is Kp. Fora 
protein with n sites, there are n possible sites for the first 
ligand to bind. This statistical factor favors ligand bind- 
ing compared with a single-site protein. Thus, [R,] = 
n[{Ro][S]/Kr. Similarly, [T,] = n[To][S]/Kr. Thus, 


Tol[S]/K T 
(T;]/[Ry] = o][S]/Kr _ [To] 


= = cL 
n[Rol[S]/Kr  [Rol(Kr/Kr) ~ 


Similar analysis reveals that, for states with i ligands 
bound, [T;]/[R;] = c'L. In other words, the ratio of the 
concentrations of the T state to the R state is reduced by 
a factor of c for each ligand that binds. 

Let us define a convenient scale for the concentra- 
tion of S: 


a = [S]/Kr 


This definition is useful because it is the ratio of the 
concentration of S to the dissociation constant that 
determines the extent of binding. Using this definition, 
we see that 


Rol lS 
[Ri] = _ = n[Ro]a 
Similarly, 
[y= a Ms ncL[Rola 


What is the concentration of R-state molecules 
with two ligands bound? Again, we must consider the 
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statistical factor—that is, the number of ways in which 
a second ligand can bind to a molecule with one site 
occupied. The number of ways is n — 1. However, 
because which ligand is the “first” and which is the 
“second” does not matter, we must divide by a factor of 


2. Thus, 
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2 Kp 
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We can derive similar equations for the case with i 
ligands bound and for T states. 

We can now calculate the fractional saturation, Y. 
This is the total concentration of sites with ligands 
bound divided by the total concentration of potential 
binding sites. Thus, 


({Ri] + [T1]) + 2([Re] + [T2]) + + 


y- + n([Ra} + (Tn) 
n([Ro] + [To] + [Ra] + [Ti] + ++ + [Ral + (Tr) 
Substituting into this equation, we find 
n[Rola + ne[To]a + 2(n(n — 1)/2)[Ro]a* 
+ n(n — 1)/2)c*[To]a” +: 
[ 


o> Ral PT 2 nile el lee 


+ [Rola" + c"[To]a”) 
Substituting [To] = L[Ro] and summing these series 
yields 
a(1 + a)"~! + Lea(1 + ca)" ! 


Y= 
CL aa)” + LO ea)” 


We can now use this equation to fit the observed data 
for hemoglobin by varying the parameters L, c, and Kr 
(with n = 4). An excellent fit is obtained with L = 9000, 
c = 0.014, and Kr = 2.5 torr (Figure 7.31). 

In addition to the fractional saturation, the concen- 
trations of the species To, T,, T2, Ro, R3, and Ry are 
shown. The concentrations of all other species are very 
low. The addition of concentrations is a major differ- 
ence between the analysis using the Hill equation and 
this analysis of the concerted model. The Hill equa- 
tion gives only the fractional saturation, whereas the 
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analysis of the concerted model yields concentrations 
for all species. In the present case, this analysis yields 
the expected ratio of T-state proteins to R-state pro- 
teins at each stage of binding. This ratio changes from 
9000 to 126 to 1.76 to 0.025 to 0.00035 with zero, one, 
two, three, and four oxygen molecules bound. This 
ratio provides a quantitative measure of the switching 
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of the population of hemoglobin molecules from the 
T state to the R state. 

The sequential model can also be formulated in 
quantitative terms. However, the formulation entails 
many more parameters, and many different sets of 
parameters often yield similar fits to the experimental 
data. 


Figure 7.31 Modeling oxygen binding with the concerted model. The fractional saturation 
(Y) as a function pOz: L = 9000, c = 0.014, and Kp = 2.5 torr. The fraction of molecules in the 
T state with zero, one, and two oxygen molecules bound (Tg, T;, and T2) and the fraction of 
molecules in the R state with two, three, and four oxygen molecules bound (Ro, Rz, and R4) are 
shown. The fractions of molecules in other forms are too low to be shown. 
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Problems 


1. Screening the biosphere. The first protein structure to 
have its structure determined was myoglobin from sperm 
whale. Propose an explanation for the observation that 
sperm whale muscle is a rich source of this protein. 


2. Hemoglobin content. The average volume of a red blood 
cell is 87 m°, The mean concentration of hemoglobin in 
red cells is 0.34 g ml“. 


(a) What is the weight of the hemoglobin contained in an 
average red cell? 


(b) How many hemoglobin molecules are there in an aver- 
age red cell? Assume that the molecular weight of the 
human hemoglobin tetramer is 65 kd. 


(c) Could the hemoglobin concentration in red cells be 
much higher than the observed value? (Hint: Suppose that 


a red cell contained a crystalline array of hemoglobin mol- 
ecules in a cubic lattice with 65 A sides.) 


3. Iron content. How much iron is there in the hemoglobin 
of a 70-kg adult? Assume that the blood volume is 70 ml 
kg! of body weight and that the hemoglobin content of 
blood is 0.16 g ml}. 


4. Oxygenating myoglobin. The myoglobin content of some 
human muscles is about 8 g kg _!. In sperm whale, the myo- 
globin content of muscle is about 80 g kg. 


(a) How much QO; is bound to myoglobin in human muscle 
and in sperm whale muscle? Assume that the myoglobin is 
saturated with O», and that the molecular weights of human 
and sperm whale myoglobin are the same. 


(b) The amount of oxygen dissolved in tissue water (in equi- 
librium with venous blood) at 37°C is about 3.5 X 10-°M. 
What is the ratio of oxygen bound to myoglobin to that 
directly dissolved in the water of sperm whale muscle? 


5. Tuning proton affinity. The pK, of an acid depends partly 
on its environment. Predict the effect of each of the follow- 
ing environmental changes on the pK, of a glutamic acid 
side chain. 


(a) A lysine side chain is brought into proximity. 


(b) The terminal carboxyl group of the protein is brought 
into proximity. 


(c) The glutamic acid side chain is shifted from the outside 
of the protein to a nonpolar site inside. 


6. Saving grace. Hemoglobin A inhibits the formation of 
the long fibers of hemoglobin 5 and the subsequent sickling 
of the red cell on deoxygenation. Why does hemoglobin A 
have this effect? 


7. Carrying a load. Suppose that you are climbing a high 
mountain and the oxygen partial pressure in the air is 
reduced to 75 torr. Estimate the percentage of the oxygen- 
carrying capacity that will be utilized, assuming that the pH 
of both tissues and lungs is 7.4 and that the oxygen concen- 
tration in the tissues is 20 torr. 


8. High-altitude adaptation. After spending a day or more 
at high altitude (with an oxygen partial pressure of 75 torr), 
the concentration of 2,3-bisphosphoglycerate (2,3- BPG) in 
red blood cells increases. What effect would an increased 
concentration of 2,3-BPG have on the oxygen-binding 
curve for hemoglobin? Why would this adaptation be ben- 
eficial for functioning well at high altitude? 


9. I'll take the lobster. Arthropods such as lobsters have 
oxygen carriers quite different from hemoglobin. The 
oxygen-binding sites do not contain heme but, instead, are 
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Problems 


based on two copper(I) ions. The structural changes that 
accompany oxygen binding are shown below. How might 
these changes be used to facilitate cooperative oxygen 
binding? 


_) cy 
5, Jee 
| 


HN / 


(oe 
20 


10. A disconnect. With the use of site-directed mutagenesis, 
hemoglobin has been prepared in which the proximal 
histidine residues in both the a and the B subunits have 
been replaced by glycine. The imidazole ring from the his- 
tidine residue can be replaced by adding free imidazole in 
solution. Would you expect this modified hemoglobin to 
show cooperativity in oxygen binding? Why or why not? 


oN 
N NH 
a_i 
Imidazole 
11. Successful substitution. Blood cells from some birds do 


not contain 2,3-bisphosphoglycerate but, instead, contain 
one of the compounds in parts a through d, which plays an 
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analogous functional role. Which compound do you 
think is most likely to play this role? Explain briefly. 


c HG \ CH; 


Choline 


H 
(b) HN OE a ae ee baa 
H 


Spermine 


-O;PQ OH 
OPO; 
(c) ~O;PO OPO; 
-O3PO 


Inositol pentaphosphate 


Indole 


12. Theoretical curves. (a) Using the Hill equation, plot an 
oxygen-binding curve for a hypothetical two-subunit hemo- 
globin with n = 1.8 and Psp = 10 torr. (b) Repeat, using the 
concerted model with n = 2, L = 1000, c = 0.01, and Kp = 
1 torr. 


13. Parasitic effect. When P. falciparum lives inside red 
blood cells, the metabolism of the parasite tends to release 
acid. What effect is the presence of acid likely to have on the 
oxygen-carrying capacity of the red blood cells? On the 
likelihood that these cells sickle? 


Data Interpretation Problems 


14. Primitive oxygen binding. Lampreys are primitive 
organisms whose ancestors diverged from the ancestors of 
fish and mammals approximately 400 million years ago. 
Lamprey blood contains a hemoglobin related to mamma- 
lian hemoglobin. However, lamprey hemoglobin is mono- 
meric in the oxygenated state. Oxygen-binding data for 
lamprey hemoglobin are as follows: 


pO, Y pO, Y pOz Y 
0.1 .0060 2.0  .112 50.0 .889 
0,2 .0124 3.0 .170 60.0 .905 
0.3 .0190 4.0  .227 70.0 .917 
0.4 .0245 5.0 .283 80.0 .927 
0.5 .0307 7.5 420 90.0 .935 
0.6 .0380 10.0 .500 100 941 
0.7 .0430 15.0 .640 150 .960 
0.8 .0481 20.0 .721 200 .970 
0.9 .0530 30.0 .812 

1.0 .0591 40.0  .865 


(a) Plot these data to produce an oxygen-binding curve. 
At what oxygen partial pressure is this hemoglobin half- 
saturated? On the basis of the appearance of this curve, does 
oxygen binding seem to be cooperative? 


(b) Construct a Hill plot using these data. Does the Hill 
plot show any evidence for cooperativity? What is the Hill 
coefficient? 


(c) Further studies revealed that lamprey hemoglobin 
forms oligomers, primarily dimers, in the deoxygenated 
state. Propose a model to explain any observed cooperativ- 
ity in oxygen binding by lamprey hemoglobin. 


15. Leaning to the left or to the right. The illustration below 
shows several oxygen-dissociation curves. Assume that curve 
3 corresponds to hemoglobin with physiological concentra- 
tions of CO and 2,3-BPG at pH 7. Which curves represent 
each of the following perturbations? 


Saturation (Y) 
& 


pO, 
(a) Decrease in CO 
(b) Increase in 2,3-BPG 


(c) Increase in pH 


(d) Loss of quaternary structure 


Chapter Integration Problem 


16. Location is everything. 2,3-Bisphosphoglycerate lies ina 
central cavity within the hemoglobin tetramer, stabilizing 
the T state. What would be the effect of mutations that 
placed the BPG-binding site on the surface of hemoglobin? 


CHAPTER 8 


Enzymes: Basic Concepts 
and Kinetics 
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nzymes, the catalysts of biological systems, are remarkable molecular OUTLINE 
devices that determine the patterns of chemical transformations. They 
also mediate the transformation of one form of energy into another. About 
a quarter of the genes in the human genome encode enzymes, a testament to 


8.1 Enzymes Are Powerful and Highly 
Specific Catalysts 


their importance to life. The most striking characteristics of enzymes are 8.2 Free Energy Is a Useful 
their catalytic power and specificity. Catalysis takes place at a particular site Thermodynamic Function for 
on the enzyme called the active site. Understanding Enzymes 
Nearly all known enzymes ave proteins. However, proteins do not have an fail Grisanies fuceisicia Reaciionsiny 


absolute monopoly on catalysis; the discovery of catalytically active RNA mol- 
ecules provides compelling evidence that RNA was a biocatalyst early in evolu- 
tion. Proteins as a class of macromolecules are highly effective catalysts for an 
enormous diversity of chemical reactions because of their capacity to specifically 
bind a very wide range of molecules. By utilizing the full repertoire of intermo- 
lecular forces, enzymes bring substrates together in an optimal orientation, the 
prelude to making and breaking chemical bonds. They catalyze reactions by 8.5 Enzymes Can Be Inhibited by 
stabilizing transition states, the highest-energy species in reaction pathways. By Specific Molecules 
selectively stabilizing a transition state, an enzyme determines which one of 
several potential chemical reactions actually takes place. 


Facilitating the Formation of the 
Transition State 


8.4 The Michaelis-Menten Model 
Accounts for the Kinetic 
Properties of Many Enzymes 


8.6 Enzymes Can Be Studied One 
Molecule at a Time 
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Table 8.1 Rate enhancement by selected enzymes 


Rate 
Nonenzymatic Uncatalyzed Catalyzed enhancement 
Enzyme half-life rate (Ryn s') rate(Reas ') (Reat 8 '/Run 8!) 
OMP decarboxylase 78,000,000 years 2.8 x 107% 39 1.4.x 1017 
Staphylococcal nuclease 130,000 years 7 Sl 95 5.6 x 104 
AMP nucleosidase 69,000 years 1.0 1071! 60 6.0 x 1012 
Carboxypeptidase A 7.3 years 30 110° 578 1. S310" 
Ketosteroid isomerase 7 weeks 1.71077 66,000 3.9 x 101! 
Triose phosphate 1.9 days 43x 107° 4,300 1.0 X 10° 
isomerase 
Chorismate mutase 7.4 hours 2.6X 10° 50 1.9 x 10° 
Carbonic anhydrase 5 seconds 1.3X 10! 1x 10° 7.7 x 10° 


Abbreviations: OMP, orotidine monophosphate; AMP, adenosine monophosphate. 
Source: After A. Radzicka and R. Wolenden. Science 267:90—93, 1995. 


8.1 Enzymes Are Powerful and Highly Specific Catalysts 


Enzymes accelerate reactions by factors of as much as a million or more 
(Table 8.1). Indeed, most reactions in biological systems do not take place at 
perceptible rates in the absence of enzymes. Even a reaction as simple as the 
hydration of carbon dioxide is catalyzed by an enzyme—namely, carbonic 
anhydrase (Section 9.2). The transfer of CO? from the tissues to the blood and 
then to the air in the alveolae of the lungs would be less complete in the absence 
of this enzyme. In fact, carbonic anhydrase is one of the fastest enzymes 
known. Each enzyme molecule can hydrate 10° molecules of CO) per second. 
This catalyzed reaction is 10’ times as fast as the uncatalyzed one. We will 
consider the mechanism of carbonic anhydrase catalysis in Chapter 9. 

Enzymes are highly specific both in the reactions that they catalyze and 
in their choice of reactants, which are called substrates. An enzyme usually 
catalyzes a single chemical reaction or a set of closely related reactions. Let 
us consider proteolytic enzymes as an example. In vivo, these enzymes cata- 
lyze proteolysis, the hydrolysis of a peptide bond. 


Ry @) Ry Oo 
oH I ‘9 = \ O + *HsN I 
2 ee 3 
aN CN MON 2 Nees, 
H | = \ H |: = = 7 
Oo R fe) Ro 

Peptide Carboxyl Amino 

component component 


Most proteolytic enzymes also catalyze a different but related reaction in 
vitro—namely, the hydrolysis of an ester bond. Such reactions are more eas- 
ily monitored than is proteolysis and are useful in experimental investiga- 
tions of these enzymes. 


R, fe) Rw ~-0 HO 
“. SR, + H20 “ies ¢ Re + Ht 
fe) fe) 
Ester Acid Alcohol 


Proteolytic enzymes differ markedly in their degree of substrate specific- 
ity. Papain, which is found in papaya plants, is quite undiscriminating: it will 
cleave any peptide bond with little regard to the identity of the adjacent side 
chains. This lack of specificity accounts for its use in meat-tenderizing 
sauces. The digestive enzyme trypsin, on the other hand, is quite specific and 


catalyzes the splitting of peptide bonds only on the carboxyl side of lysine 
and arginine residues (Figure 8.1A). Thrombin, an enzyme that participates 
in blood clotting, is even more specific than trypsin. It catalyzes the hydro- 
lysis of Arg—Gly bonds in particular peptide sequences only (Figure 8.1B). 

DNA polymerase I, a template-directed enzyme (Section 28.3), is 
another highly specific catalyst. To a DNA strand that is being synthesized, 
it adds nucleotides in a sequence determined by the sequence of nucleotides 
in another DNA strand that serves as a template. DNA polymerase I is 
remarkably precise in carrying out the instructions given by the template. It 
inserts the wrong nucleotide into anew DNA strand less than one in a thou- 
sand times. The specificity of an enzyme is due to the precise interaction of the 
substrate with the enzyme. This precision is a result of the intricate three- 
dimensional structure of the enzyme protein. 


Many enzymes require cofactors for activity 


The catalytic activity of many enzymes depends on the presence of small 
molecules termed cofactors, although the precise role varies with the cofac- 
tor and the enzyme. Generally, these cofactors are able to execute chemical 
reactions that cannot be performed by the standard set of twenty amino 
acids. An enzyme without its cofactor is referred to as an apoenzyme; the 
complete, catalytically active enzyme is called a holoenzyme. 


Apoenzyme + cofactor = holoenzyme 


Cofactors can be subdivided into two groups: (1) metals and (2) small 
organic molecules called coenzymes (Table 8.2). Often derived from vita- 
mins, coenzymes can be either tightly or loosely bound to the enzyme. 
Tightly bound coenzymes are called prosthetic groups. Loosely associated 
coenzymes are more like cosubstrates because, like substrates and prod- 
ucts, they bind to the enzyme and are released from it. The use of the same 
coenzyme by a variety of enzymes sets coenzymes apart from normal sub- 
strates, however, as does their source in vitamins (Section 15.4). Enzymes 
that use the same coenzyme usually perform catalysis by similar mecha- 
nisms. In Chapter 9, we will examine the importance of 
metals to enzyme activity and, throughout the book, we 
will see how coenzymes and their enzyme partners operate 


in their biochemical context. Cofactor 


Lys Hydrolysis site 
or 
Arg 
an Ne > a Me 
= i we 


(A) 


(B) A 


Figure 8.1 Enzyme specificity. (A) Trypsin 
cleaves on the carboxyl side of arginine and 
lysine residues, whereas (B) thrombin cleaves 
Arg—Gly bonds in particular sequences only. 


Table 8.2 Enzyme cofactors 


Enzyme 


Coenzyme 


Enzymes can transform energy from one form into 
another 


A key activity in all living systems is the ability to convert 
one form of energy into another. For example, in photo- 
synthesis, light energy is converted into chemical-bond 
energy. In cellular respiration, which takes place in mito- 
chondria, the free energy contained in small molecules 
derived from food is converted first into the free energy of 
an ion gradient and then into a different currency—the 
free energy of adenosine triphosphate. Given their central- 
ity to life, it should come as no surprise that enzymes play 
vital roles in energy transformation. As we will see, 
enzymes play fundamental roles in photosynthesis and 
cellular respiration. Other enzymes can then use the 
chemical-bond energy of ATP in diverse ways. For 
instance, the enzyme myosin converts the energy of 
ATP into the mechanical energy of contracting muscles 


Thiamine pyrophosphate 


Flavin adenine nucleotide 


Pyruvate dehydrogenase 


Monoamine oxidase 


Nicotinamide adenine dinucleotide Lactate dehydrogenase 

Pyridoxal phosphate Glycogen phosphorylase 

Coenzyme A (CoA) Acetyl CoA carboxylase 

Biotin Pyruvate carboxylase 

5'-Deoxyadenosyl cobalamin Methylmalonyl mutase 

Tetrahydrofolate Thymidylate synthase 
Metal 

Toe Carbonic anhydrase 

Zn** Carboxypeptidase 

Me?* EcoRV 

Mg?* Hexokinase 

Niger Urease 

Mo Nitrate reductase 

Se Glutathione peroxidase 

Mn Superoxide dismutase 

Ke Propionyl CoA carboxylase 
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(Chapter 35). Pumps in the membranes of cells and organelles, which can 
be thought of as enzymes that move substrates rather than chemically alter 
them, use the energy of ATP to transport molecules and ions across the 
membrane (Chapter 13). The chemical and electrical gradients resulting 
from the unequal distribution of these molecules and ions are themselves 
forms of energy that can be used for a variety of purposes, such as sending 
nerve impulses. 

The molecular mechanisms of these energy-transducing enzymes are 
being unraveled. We will see in subsequent chapters how unidirectional 
cycles of discrete steps—binding, chemical transformation, and release— 
lead to the conversion of one form of energy into another. 


8.2 Free Energy Is a Useful Thermodynamic Function for 
Understanding Enzymes 


Enzymes speed up the rate of chemical reactions, but the properties of the 
reaction—whether it can take place at all and the degree to which the 
enzyme accelerates the reaction—depend on energy differences between 
reactants and products. Free energy (G), which was touched on in Chapter 1, 
is a thermodynamic property that is a measure of useful energy, or the 
energy that is capable of doing work. To understand how enzymes operate, 
we need to consider only two thermodynamic properties of the reaction: 
(1) the free-energy difference (AG) between the products and reactants and 
(2) the energy required to initiate the conversion of reactants into products. 
The former determines whether the reaction will take place spontaneously, 
whereas the latter determines the rate of the reaction. Enzymes affect only 
the latter. Let us review some of the principles of thermodynamics as they 
apply to enzymes. 


The free-energy change provides information about the spontaneity 
but not the rate of a reaction 


As discussed in Chapter 1, the free-energy change of a reaction (AG) tells us 
if the reaction can take place spontaneously: 


1. A reaction can take place spontaneously only if AG is negative. Such reac- 
tions are said to be exergonic. 


2. A system is at equilibrium and no net change can take place if AG is 
zero. 


3. A reaction cannot take place spontaneously if AG is positive. An input 
of free energy is required to drive such a reaction. These reactions are 
termed endergonic. 


4. The AG of a reaction depends only on the free energy of the products 
(the final state) minus the free energy of the reactants (the initial state). The 
AG of a reaction is independent of the path (or molecular mechanism) of the 
transformation. The mechanism of a reaction has no effect on AG. For 
example, the AG for the oxidation of glucose to CO and HO is the same 
whether it takes place by combustion or by a series of enzyme-catalyzed 
steps ina cell. 


5. The AG provides no information about the rate of a reaction. A negative 
AG indicates that a reaction can take place spontaneously, but it does not 
signify whether it will proceed at a perceptible rate. As will be discussed 
shortly (Section 8.3), the rate of a reaction depends on the free energy of 
activation (AG*), which is largely unrelated to the AG of the reaction. 


The standard free-energy change of a reaction is related 
to the equilibrium constant 


As for any reaction, we need to be able to determine AG for an enzyme- 
catalyzed reaction to know whether the reaction is spontaneous or an input 
of energy is required. To determine this important thermodynamic param- 
eter, we need to take into account the nature of both the reactants and the 
products as well as their concentrations. 

Consider the reaction 


A+B C+D 
The AG of this reaction is given by 


[C][D] 
AG =AG* + RTln——_ _ 1 

— (A][B] 
in which AG® is the standard free-energy change, R is the gas constant, T is 
the absolute temperature, and [A], [B], [C], and [D] are the molar con- 
centrations (more precisely, the activities) of the reactants. AG® is the free- 
energy change for this reaction under standard conditions—that is, when 
each of the reactants A, B, C, and D is present at a concentration of 1.0 M 
(for a gas, the standard state is usually chosen to be 1 atmosphere). Thus, 
the AG of a reaction depends on the nature of the reactants (expressed in the 
AG® term of equation 1) and on their concentrations (expressed in the loga- 
rithmic term of equation 1). 

A convention has been adopted to simplify free-energy calculations for 
biochemical reactions. The standard state is defined as having a pH of 7. 
Consequently, when H” is a reactant, its activity has the value 1 (corre- 
sponding to a pH of 7) in equations 1 and 3 (below). The activity of water 
also is taken to be 1 in these equations. The standard free-energy change at 
pH 7, denoted by the symbol AG®’, will be used throughout this book. The 
kilojoule (abbreviated kJ) and the kilocalorie (kcal) will be used as the units 
of energy. One kilojoule is equivalent to 0.239 kilocalorie. 

A simple way to determine AG®’ is to measure the concentrations of 
reactants and products when the reaction has reached equilibrium. At 
equilibrium, there is no net change in reactants and products; in essence, 
the reaction has stopped and AG = 0. At equilibrium, equation 1 then 
becomes 


[C][D] 
0 = AG?’ + RTIn ——— (2) 
[A][B] 
and so 
[C][D] 
AG* = —RTIln (3) 
[A][B] 
The equilibrium constant under standard conditions, K¢q, is defined as 
[C][D] 
Kg = een (4) 
7 [A][B] 
Substituting equation 4 into equation 3 gives 
AG = —RT In Kg (5) 


which can be rearranged to give 


Sig (6) 
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Units of energy 


A kilojoule (kJ) is equal to 1000 J. 

A joule (J) is the amount of energy needed 
to apply a 1-newton force over a distance 
of 1 meter. 

A kilocalorie (kcal) is equal to 1000 cal. 

A calorie (cal) is equivalent to the amount 
of heat required to raise the temperature of 
1 gram of water from 14.5°C to 15.5°C. 

1 kJ = 0.239 kcal. 


Table 8.3 Relation between AG” 


and K’eq (at 25°C) 
AG” 


Ko kJ mol! kcal mol! 


1075 28.53 6.82 
lO 22.84 5.46 
1073 17.11 4.09 
Ome 11.42 2.73 
107! 5.69 1.36 

1 0.00 0.00 
10 —5.69 ~1.36 
10? 11.42 =2\73 
10° -17.11 —4.09 
107 —29 84 —5.46 
10° —28.53 —6.82 


Oe ae hye x 
H2 H2 
Dihydroxyacetone 


phosphate 
(DHAP) 


HO, 
o < OPO32- 
Sc Ncw 

| Hy 
Glyceraldehyde 


3-phosphate 
(GAP) 


Product —~> 


Seconds 


Time —> 


Cc OPO37- 


+ Enzyme 


Substituting R = 8.315 X 10-3 kJ mol! deg! and T = 298 K (corre- 
sponding to 25°C) gives 


Ke _ 19 AG°/2.47 (7) 


where AG’ is here expressed in kilojoules per mole because of the choice 
of the units for R in equation 7. Thus, the standard free energy and the 
equilibrium constant of a reaction are related by a simple expression. For 
example, an equilibrium constant of 10 gives a standard free-energy 
change of —5.69 kJ mol! (—1.36 kcal mol!) at 25°C (Table 8.3). Note 
that, for each 10-fold change in the equilibrium constant, the AG°’changes 
by 5.69 kJ mol! (1.36 kcal mol7?). 

As an example, let us calculate AG®°’ and AG for the isomerization of 
dihydroxyacetone phosphate (DHAP) to glyceraldehyde 3-phosphate (GAP). 
This reaction takes place in glycolysis (Chapter 16). At equilibrium, the 
ratio of GAP to DHAP is 0.0475 at 25°C (298 K) and pH 7. Hence, Ki, = 
0.0475. The standard free-energy change for this reaction is then calculated 
from equation 5: 


AG’ = —RTInK;,, 
= —8.315 X 10-3 X 298 X In (0.0475) 
+7.53 kJ mol! (+1.80 kcal mol!) 


Under these conditions, the reaction is endergonic. DHAP will not sponta- 
neously convert into GAP. 

Now let us calculate AG for this reaction when the initial concentra- 
tion of DHAP is 2 X 10 * M and the initial concentration of GAP is 
3 X 10 °M. Substituting these values into equation 1 gives 


3x10 °M 
2x10 7M 
= 7.53kJ mol! — 10.42 kJ mol! 

—2.89 kJ mol” !(—0.69 kcal mol‘) 


This negative value for the AG indicates that the isomerization of DHAP 
to GAP is exergonic and can take place spontaneously when these species 
are present at the preceding concentrations. Note that AG for this reaction 
is negative, although AG’ is positive. It is important to stress that whether 
the AG for a reaction is larger, smaller, or the same as AG®’ depends on the 
concentrations of the reactants and products. The criterion of spontaneity for 

a reaction is AG, not AG®’. This point is important because reactions 

that are not spontaneous based on AG®’ can be made spontaneous by 


AG = 7.53kJ mol”! + RT In 


_——— adjusting the concentrations of reactants and products. This principle is 


No enzyme 


the basis of the coupling of reactions to form metabolic pathways 
(Chapter 15). 


Enzymes alter only the reaction rate and not the reaction 
equilibrium 


Because enzymes are such superb catalysts, it is tempting to ascribe 


}______ to them powers that they do not have. An enzyme cannot alter the laws 


Hours 


of thermodynamics and consequently cannot alter the equilibrium of a 
chemical reaction. Consider an enzyme-catalyzed reaction, the conver- 
sion of substrate, 5, into product, P. Figure 8.2 shows the rate of product 


Figure 8.2 Enzymes accelerate the reaction rate. 
The same equilibrium point is reached but much 
more quickly in the presence of an enzyme. 
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formation with time in the presence and absence of enzyme. Note that 
the amount of product formed is the same whether or not the enzyme 


is present but, in the present example, the amount of product formed in 
seconds when the enzyme is present might take hours (or centuries, see 
Table 8.1) to form if the enzyme were absent. 

Why does the rate of product formation level off with time? The reaction 
has reached equilibrium. Substrate 5 is still being converted into product P, 
but P is being converted into 5 at a rate such that the amount of P present 
stays the same. 

Let us examine the equilibrium in a more quantitative way. Suppose 
that, in the absence of enzyme, the forward rate constant (kp) for the con- 
version of S into P is 10-*s ! and the reverse rate constant (kp) for the 
conversion of P into Sis 10° s_!. The equilibrium constant K is given by 
the ratio of these rate constants: 

10-457! 
106s! 
[P] kp 104 
K = 100 
[S] kr 10° 
The equilibrium concentration of P is 100 times that of S, whether or not 
enzyme is present. However, it might take a very long time to approach this 
equilibrium without enzyme, whereas equilibrium would be attained rap- 
idly in the presence of a suitable enzyme (see Table 8.1). Enzymes accelerate 
the attainment of equilibria but do not shift their positions. The equilibrium 
position is a function only of the free-energy difference between reactants and 
products. 


8.3 Enzymes Accelerate Reactions by Facilitating the 
Formation of the Transition State 


The free-energy difference between reactants and products accounts for the 
equilibrium of the reaction, but enzymes accelerate how quickly this equi- 
librium is attained. How can we explain the rate enhancement in terms of 
thermodynamics? To do so, we have to consider not the end points of the 
reaction but the chemical pathway between the end points. 

A chemical reaction of substrate 5 to form product P goes through a 
transition state X*‘ that has a higher free energy than does either S or P. 


—— > (eo 


The double dagger denotes the transition state. The transition state is a 
transitory molecular structure that is no longer the substrate but is not yet 
the product. The transition state is the least-stable and most-seldom- 
occupied species along the reaction pathway because it is the one with the 
highest free energy. The difference in free energy between the transition 
state and the substrate is called the Gibbs free energy of activation or simply 
the activation energy, symbolized by AG* (Figure 8.3). 


AG* = Gy* - Gg 


Note that the energy of activation, or AG*, does not enter into the final AG 
calculation for the reaction, because the energy required to generate the 
transition state is released when the transition state forms the product. The 
activation-energy barrier immediately suggests how an enzyme enhances 
the reaction rate without altering AG of the reaction: enzymes function to 
lower the activation energy, or, in other words, enzymes facilitate the forma- 
tion of the transition state. 
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Figure 8.3 Enzymes decrease the 
activation energy. Enzymes accelerate 
reactions by decreasing AG, the free energy 
of activation. 
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“| think that enzymes are molecules that are 
complementary in structure to the activated 
complexes of the reactions that they catalyze, 
that is, to the molecular configuration that 
is intermediate between the reacting 
substances and the products of reaction for 
these catalyzed processes. The attraction of 
the enzyme molecule for the activated 
complex would thus lead to a decrease in 
its energy and hence to a decrease in the 
energy of activation of the reaction and to an 
increase in the rate of reaction.’ 

—Linus Pauling 
Nature 161:707, 1948 


Maximal velocity ae 


Reaction velocity — 


Substrate concentration —> 


Figure 8.4 Reaction velocity versus 
substrate concentration in an enzyme- 
catalyzed reaction. An enzyme-catalyzed 
reaction approaches a maximal velocity. 


One approach to understanding the increase in reaction rates achieved 
by enzymes is to assume that the transition state (X*) and the substrate (S) 
are in equilibrium. 

Kt v 
S == xt —> P 

in which K* is the equilibrium constant for the formation of X* and v is the 
rate of formation of product from X*. The rate of the reaction v is propor- 
tional to the concentration of X*, 


v x [X*], 


because only X* can be converted into product. The concentration of X? at 
equilibrium is in turn related to the free-energy difference AG* between X* 
and 8; the greater the difference in free energy between these two states, the 
smaller the amount of X’. Thus, the overall rate of reaction V depends on 
AG". Specifically, 
V =v[X*] = = [SJe AGURT 

In this equation, k is Boltzmann’s constant, and h is Planck’s constant. The 
value of kT/h at 25°C is 6.6 X 10'* 5 _!. Suppose that the free energy of 
activation is 28.53 kJ mol! (6.82 kcal mol !). If we were to substitute this 
value of AG in equation 7 (as shown in Table 8.3), this free-energy differ- 
ence will result when the ratio (X*] /[S]is 107°. If we assume for simplicity’s 
sake that [S] = 1 M, then the reaction rate V is 6.2 X 10’s_ |. If AG* were 
lowered by 5.69 kJ mol~! (1.36 kcal mol~'), the ratio [X*]/[S] would then 
be 107 *, and the reaction rate would be 6.2 X 108s 1. A decrease of 5.69 kJ 
mol! in AG* yields a 10-fold larger V. A relatively small decrease in 
AG# (20% in this particular reaction) results in a much greater increase in V. 

Thus, we see the key to how enzymes operate: enzymes accelerate reactions 
by decreasing AG", the activation energy. The combination of substrate and 
enzyme creates a reaction pathway whose transition-state energy is lower than 
that of the reaction in the absence of enzyme (see Figure 8.3). Because the 
activation energy is lower, more molecules have the energy required to reach 
the transition state. Decreasing the activation barrier is analogous to lowering 
the height of a high-jump bar; more athletes will be able to clear the bar. The 
essence of catalysis is stabilization of the transition state. 


The formation of an enzyme-substrate complex is the 
first step in enzymatic catalysis 


Much of the catalytic power of enzymes comes from their bringing sub- 
strates together in favorable orientations to promote the formation of the 
transition states. Enzymes bring together substrates in enzyme—substrate 
(ES) complexes. The substrates are bound to a specific region of the enzyme 
called the active site. Most enzymes are highly selective in the substrates 
that they bind. Indeed, the catalytic specificity of enzymes depends in part 
on the specificity of binding. 

What 1s the evidence for the existence of an enzyme-substrate complex? 


1. The first clue was the observation that, at a constant concentration of 
enzyme, the reaction rate increases with increasing substrate concentration 
until a maximal velocity is reached (Figure 8.4). In contrast, uncatalyzed 
reactions do not show this saturation effect. The fact that an enzyme-cata- 
lyzed reaction has a maximal velocity suggests the formation of a discrete ES 
complex. At a sufficiently high substrate concentration, all the catalytic sites 
are filled, or saturated, and so the reaction rate cannot increase. Although 
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hk Figure 8.5 Structure of an enzyme- 
substrate complex. (Left) The enzyme 
cytochrome P450 is illustrated bound to its 
substrate camphor. (Right) Notice that, in the 
active site, the substrate is surrounded by 
residues from the enzyme. Note also the 
presence of a heme cofactor. [Drawn from 
2CPP.pdb.] 


indirect, the ability to saturate an enzyme with substrate is the most general 
evidence for the existence of ES complexes. 


2. X-ray crystallography has provided high-resolution images of sub- 
strates and substrate analogs bound to the active sites of many enzymes 
(Figure 8.5). In Chapter 9, we will take a close look at several of these 
complexes. 


3. The spectroscopic characteristics of many enzymes and substrates change 
on the formation of an ES complex. These changes are particularly striking 
if the enzyme contains a colored prosthetic group (see Problem 31). 


The active sites of enzymes have some common features 


The active site of an enzyme is the region that binds the substrates (and the 
cofactor, if any). It also contains the residues that directly participate in the 
making and breaking of bonds. These residues are called the catalytic groups. 
In essence, the interaction of the enzyme and substrate at the active site pro- 
motes the formation of the transition state. The active site is the region of the 
enzyme that most directly lowers the AG? of the reaction, thus providing 
the rate-enhancement characteristic of enzyme action. Although enzymes 
differ widely in structure, specificity, and mode of catalysis, a 
number of generalizations concerning their active sites can be (A) 
stated: 

1. The active site is a three-dimensional cleft, or crevice, formed 
by groups that come from different parts of the amino acid 
sequence: indeed, residues far apart in the amino acid sequence 
may interact more strongly than adjacent residues in the sequence, 
which may be sterically constrained from interacting with one 
another. In lysozyme, an enzyme that degrades the cell walls of 
some bacteria, the important groups in the active site are contrib- 
uted by residues numbered 35, 52, 62, 63, 101, and 108 in the 
sequence of 129 amino acids (Figure 8.6). 


2. The active site takes up a small part of the total volume of an 
enzyme. Most of the amino acid residues in an enzyme are notin (8) N E i L a = = J ie 
contact with the substrate, which raises the intriguing question of : 

why enzymes are so big. Nearly all enzymes are made up of more k Figure 8.6 Active sites may include distant residues. 
than 100 amino acid residues, which gives them a mass greater (A) Ribbon diagram of the enzyme lysozyme with several 


3 9 Ff 3 components of the active site shown in color. (B) A schematic 
than 10 kd and a diameter of more than 25 A. The “extra representation of the primary structure of lysozyme shows that 


amino acids serve as a scaffold to create the three-dimensional the active site is composed of residues that come from different 
active site. In many proteins, the remaining amino acids also _parts of the polypeptide chain. [Drawn from 6LYZ.pdb]] 
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Figure 8.7 Hydrogen bonds between an 


enzyme and substrate. The enzyme 


ribonuclease forms hydrogen bonds with the 


uridine component of the substrate. 
[After F. M. Richards, H. W. Wyckoff, and N. 


Allewell. In The Neurosciences: Second Study 


Program, F. O. Schmidt, Ed. (Rockefeller 
University Press, 1970), p. 970.] 


a 


Substrate 


Active site 


Enzyme 


Figure 8.8 Lock-and-key model of enzyme-substrate binding. In 
this model, the active site of the unbound enzyme is complementary in 


shape to the substrate. 
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constitute regulatory sites, sites of interaction with other proteins, or chan- 
nels to bring the substrates to the active sites. 


3. Active sites are unique microenvironments. In all enzymes of known struc- 
ture, active sites are shaped like a cleft, or crevice, to which the substrates 
bind. Water is usually excluded unless it is a reactant. The nonpolar micro- 
environment of the cleft enhances the binding of substrates as well as 
catalysis. Nevertheless, the cleft may also contain polar residues. In the 
nonpolar microenvironment of the active site, certain of these polar residues 
acquire special properties essential for substrate binding or catalysis. The 
internal positions of these polar residues are biologically crucial exceptions 
to the general rule that polar residues are exposed to water. 


4. Substrates are bound to enzymes by multiple weak attractions. The nonco- 
valent interactions in ES complexes are much weaker than covalent bonds, 
which have energies between —210 and —460 kJ mol! (between —50 and 
—110 kcal mol °). In contrast, ES complexes usually have equilibrium con- 
stants that range from 10 * to 10 ° M, corresponding to free energies of 
interaction ranging from about —13 to —50 kJ mol | (from —3 to —12 kcal 
mol‘). As discussed in Section 1.3, these weak reversible interactions 
are mediated by electrostatic interactions, hydrogen bonds, and van der 
Waals forces. Van der Waals forces become significant in binding only 
when numerous substrate atoms simultaneously come close to many 
enzyme atoms through the hydrophobic effect. Hence, the enzyme and 
substrate should have complementary shapes. The directional character 
of hydrogen bonds between enzyme and substrate often enforces a high 
degree of specificity, as seen in the RNA-degrading enzyme ribonu- 
clease (Figure 8.7). 


5. The specificity of binding depends on the precisely defined arrangement of 
atoms in an active site. Because the enzyme and the substrate interact by 
means of short-range forces that require close contact, a substrate must have 
a matching shape to fit into the site. Emil Fischer proposed the lock-and- 
key analogy in 1890 (Figure 8.8), which was the model for enzyme-substrate 
interaction for several decades. However, we now know that enzymes are 
flexible and that the shapes of the active sites can be markedly modified by 
the binding of substrate, as was postulated by Daniel E. Koshland, Jr., in 
1958. The active site of some enzymes assumes a shape that is complemen- 
tary to that of the substrate only after the substrate has been bound. This 
process of dynamic recognition is called induced fit (Figure 8.9). 


a 


Substrate 
+ —> 


ES complex 


Enzyme 


Figure 8.9 Induced-fit model of enzyme-substrate binding. In this 
model, the enzyme changes shape on substrate binding. The active site 
forms a shape complementary to the substrate only after the substrate 
has been bound. 


The binding energy between enzyme and substrate 
is important for catalysis 


Enzymes lower the activation energy, but where does the energy to lower the 
activation energy come from? Free energy is released by the formation of a large 
number of weak interactions between a complementary enzyme and its sub- 
strate. The free energy released on binding is called the binding energy. Only the 
correct substrate can participate in most or all of the interactions with the 
enzyme and thus maximize binding energy, accounting for the exquisite sub- 
strate specificity exhibited by many enzymes. Furthermore, the full complement 
of such interactions is formed only when the substrate is converted into the transition 
state. Thus, the maximal binding energy is released when the enzyme facilitates 
the formation of the transition state. The energy released by the interactions 
between the enzyme and the substrate can be thought of as lowering the activa- 
tion energy. Paradoxically, the most-stable interaction (maximum binding 
energy) takes place between the enzyme and the transition state, the least-stable 
reaction intermediate. However, the transition state is too unstable to exist for 
long. It collapses to either substrate or product, but which of the two accumu- 
lates is determined only by the energy difference between the substrate and the 
product—that is, by the AG of the reaction. 


8.4 The Michaelis-Menten Equation Describes 
the Kinetic Properties of Many Enzymes 


The study of the rates of chemical reactions is called kinetics, and the study 
of the rates of enzyme-catalyzed reactions is called enzyme kinetics. A kinetic 
description of enzyme activity will help us understand how enzymes func- 
tion. We begin by briefly examining some of the basic principles of reaction 
kinetics. 


Kinetics is the study of reaction rates 
What do we mean when we say the “rate” of a chemical reaction? Consider 
a simple reaction: 


amet 


The rate V is the quantity of A that disappears in a specified unit of time. It 
is equal to the rate of the appearance of P, or the quantity of P that appears 
in a specified unit of time. 


V = —AA/AT = AP/AT (8) 


If A is yellow and P is colorless, we can follow the decrease in the concentra- 
tion of A by measuring the decrease in the intensity of yellow color with 
time. Consider only the change in the concentration of A for now. The rate 
of the reaction is directly related to the concentration of A by a proportion- 
ality constant, k, called the rate constant. 


V=R[A] (9) 


Reactions that are directly proportional to the reactant concentration are 
called first-order reactions. First-order rate constants have the units of s_/. 

Many important biochemical reactions include two reactants. For 
example, 


3, —= P 
or 


A+ B= P 
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Figure 8.10 Determining the relation 
between initial velocity and substrate 
concentration. (A) The amount of product 
formed at different substrate concentrations is 
plotted as a function of time. The initial 
velocity (Vo) for each substrate concentration 
is determined from the slope of the curve at 
the beginning of a reaction, when the reverse 
reaction is insignificant. (B) The values for 
initial velocity determined in part A are then 
plotted against substrate concentration. 


They are called bimolecular reactions and the corresponding rate equations 
often take the form 


V=k[A}’ (10) 
and 
V = R{A][B] (11) 


The rate constants, called second-order rate constants, have the units 
M's” 

Sometimes, second-order reactions can appear to be first-order reac- 
tions. For instance, in reaction 11, if B is present in excess and A is present 
at low concentrations, the reaction rate will be first order with respect to A 
and will not appear to depend on the concentration of B. These reactions are 
called pseudo-first-order reactions, and we will see them a number of times in 
our study of biochemistry. 

Interestingly enough, under some conditions, a reaction can be zero 
order. In these cases, the rate is independent of reactant concentrations. 
Enzyme-catalyzed reactions can approximate zero-order reactions under 
some circumstances (p. 232). 


The steady-state assumption facilitates a description of enzyme kinetics 


The simplest way to investigate the reaction rate is to follow the increase in 
reaction product as a function of time. The extent of product formation is 
determined as a function of time for a series of substrate concentrations 
(Figure 8.10A). As expected, in each case, the amount of product formed 
increases with time, although eventually a time is reached when there is no net 
change in the concentration of S or P. The enzyme is still actively converting 
substrate into product and visa versa, but the reaction equilibrium has been 
attained. However, enzyme kinetics is more readily comprehended if we con- 
sider only the forward reaction. We can define the rate of catalysis Vo as the 
number of moles of product formed per second when the reaction is just 
beginning—that is, when t ~ 0 (see Figure 8.10A). On the time scale of 
enzyme-catalyzed reactions, the amount of enzyme present is constant. When 
we plot Voversus the substrate concentration [5], assuming a constant amount 
of enzyme, many enzymes yield the results shown in Figure 8.10B. The rate 
of catalysis rises linearly as substrate concentration increases and then begins 
to level off and approach a maximum at higher substrate concentrations. 

In 1913, Leonor Michaelis and Maud Menten proposed a simple model 
to account for these kinetic characteristics. The critical feature in their 
treatment is that a specific ES complex is a necessary intermediate in cataly- 
sis. The model proposed is 


E+S—=5S — E+P 
1 —2 

An enzyme E combines with substrate 5 to form an ES complex, with a rate 
constant k;. The ES complex has two possible fates. It can dissociate to E 
and 5, with a rate constant k—4, or it can proceed to form product P, witha 
rate constant k7. The ES complex can also be re-formed from E and P by the 
reverse reaction with a rate constant k_». However, as before, we can sim- 
plify these reactions by considering the rate of reaction at times close to zero 
(hence, Vo) when there is negligible product formation and thus no back 
reaction (k—2 [E][P] ~ 0). 


ky ky 
E+S == &§5 — E+P (12) 


1 


Thus, for the graph in Figure 8.11, Vo is determined for each substrate con- 
centration by measuring the rate of product formation at early times before 
P accumulates (see Figure 8.10A). 

We want an expression that relates the rate of catalysis to the concentra- 
tions of substrate and enzyme and the rates of the individual steps. Our 
starting point is that the catalytic rate is equal to the product of the concen- 
tration of the ES complex and ky. 


Vo = Ro[ES] (13) 


Now we need to express [ES] in terms of known quantities. The rates of 
formation and breakdown of ES are given by 


ie Vinax 


Reaction velocity (V9) — 


hs 


Substrate concentration [S] — 


Rate of formation of ES = ky[E][5] (14) Figure 8.11 Michaelis-Menten kinetics. A 
Rate of formation of ES = (kR-4 ae ky)[ES] (15) plot of the reaction velocity (Vo) as a function 
of the substrate concentration [S] for an 
To simplify matters, George Briggs and John Haldane suggested the enzyme that obeys Michaelis-Menten kinetics 
steady-state assumption in 1924. In a steady state, the concentrations of shows that the maximal velocity (Vmax) is 


approached asymptotically. The Michaelis 
constant (Kj) is the substrate concentration 
yielding a velocity of Vinax/2- 


intermediates—in this case, [ES]—stay the same even if the concentrations 
of starting materials and products are changing. This steady state is reached 
when the rates of formation and breakdown of the ES complex are equal. 
Setting the right-hand sides of equations 14 and 15 equal gives 


Ry[E][S] = (k-1 + kp)[ES] (16) 
By rearranging equation 16, we obtain 
[E][S]/[ES] = (R-4 + Ro)/ky (17) 


Equation 17 can be simplified by defining a new constant, Ky, called the 
Michaelis constant: 


_ k_, + ko 
ky 


Note that Ky, has the units of concentration and is independent of enzyme 
and substrate concentrations. As will be explained, Kyy is an important 
characteristic of enzyme-substrate interactions. 
Inserting equation 18 into equation 17 and solving for [ES] yields 
[E][S] 


[ES] = ie (19) 


Ku (18) 


Now let us examine the numerator of equation 19. Because the substrate is 
usually present at a much higher concentration than that of the enzyme, the 
concentration of uncombined substrate [5] is very nearly equal to the total 
substrate concentration. The concentration of uncombined enzyme [EF] is 
equal to the total enzyme concentration [E]7 minus the concentration of the 
ES complex: 


[E] = [E]r — [ES] (20) 
Substituting this expression for [E] in equation 19 gives 


(LE]r — [ES] [5] 


[ES] = Kui (21) 
Solving equation 21 for [ES] gives 
_ [E}r[S]/Ku 
Ee) + [5] /Kut (22) 
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or 


[S] 
ES] = [E 23 
ES] = El ee (23) 
By substituting this expression for [ES] into equation 13, we obtain 


[S] 
Vo = [EE] t= 24 
0 = k[E]r (8) + Ky (24) 
The maximal rate, Vmax, is attained when the catalytic sites on the 
enzyme are saturated with substrate—that is, when [ES] = [E]7. Thus, 


Vinax = Ro[E]r (25) 


Substituting equation 25 into equation 24 yields the Michaelis-Menten 
equation: 


[S] 


= 
' [S] + Kyy 


(26) 
This equation accounts for the kinetic data given in Figure 8.11. At very low 
substrate concentration, when [S] is much less than Ky, Vo = (Vmax /Ky)[S]; 
that is, the reaction is first order with the rate directly proportional to the 
substrate concentration. At high substrate concentration, when [5S] is much 
greater than Kj, Vo = Vmax; that is, the rate is maximal. The reaction is 
zero order, independent of substrate concentration. 

The meaning of Ky is evident from equation 26. When [5S] = Ky, then 
Vo = Vimax/2. Thus, Kyy is equal to the substrate concentration at which the 
reaction rate is half its maximal value. Ky4 is an important characteristic of 
an enzyme-catalyzed reaction and is significant for its biological function. 


Variations in Ky can have physiological consequences 


ci The physiological consequence of Kyy 1s illustrated by the sensitivity 

of some persons to ethanol. Such persons exhibit facial flushing and 
rapid heart rate (tachycardia) after ingesting even small amounts of alcohol. 
In the liver, alcohol dehydrogenase converts ethanol into acetaldehyde. 


Alcohol 
dehydrogenase 
——_$ 


CH3;CH,OH + NAD* CH3;CHO + NADH + H* 


Normally, the acetaldehyde, which is the cause of the symptoms when present 
at high concentrations, is processed to acetate by aldehyde dehydrogenase. 


Aldehyde 
dehydrogenase 
——— 


CH3;CHO + NAD* + H,O CH3;COO7 + NADH + 2H* 


Most people have two forms of the aldehyde dehydrogenase, a low Kyy 
mitochondrial form and a high Kj cytoplasmic form. In susceptible per- 
sons, the mitochondrial enzyme is less active owing to the substitution of a 
single amino acid, and acetaldehyde is processed only by the cytoplasmic 
enzyme. Because this enzyme has a high Ky, it achieves a high rate of 
catalysis only at very high concentrations of acetaldehyde. Consequently, 
less acetaldehyde is converted into acetate; excess acetaldehyde escapes into 
the blood and accounts for the physiological effects. 


Ky and Vmax values can be determined by several means 


Km is equal to the substrate concentration that yields Vmax/2; however 
Vmax, like perfection, is only approached but never attained. How, then, can 


we experimentally determine Ky and V,,,,, and how do these parameters 
enhance our understanding of enzyme-catalyzed reactions? The Michaelis 
constant, Ky, and the maximal rate, V,,,,, can be readily derived from rates 
of catalysis measured at a variety of substrate concentrations if an enzyme 
operates according to the simple scheme given in equation 26. The deriva- 
tion of Kyq and Vinax is most commonly achieved with the use of curve- 
fitting programs on a computer. However, an older method, although 
rarely used because the data points at high and low concentrations are 
weighted differently and thus sensitive to errors, is a source of further 
insight into the meaning of Kyy and Vmax. 

Before the availability of computers, the determination of Km and Vmax 
values required algebraic manipulation of the basic Michaelis-Menten 
equation. Because Vy,x is approached asymptotically (see Figure 8.11), it is 
impossible to obtain a definitive value from a Michaelis-Menten curve. 
Because Ky, is the concentration of substrate at Vinax/2, it is likewise impos- 
sible to determine an accurate value of Kyy. However, Vmax can be accu- 
rately determined if the Michaelis-Menten equation is transformed into 
one that gives a straight-line plot. Taking the reciprocal of both sides of 
equation 26 gives 


1 _ Ku 1, 1 
Vo Vrieae 5 Vine 
A plot of 1/Vo versus 1/[S], called a Lineweaver—Burk or double-reciprocal 


plot, yields a straight line with a y-intercept of 1/ Vinax and a slope of Ky4/ Vmax 
(Figure 8.12). The intercept on the x-axis is —1/K)y. 


(27) 


Ky and Vmax values are important enzyme characteristics 


The Ky values of enzymes range widely (Table 8.4). For most enzymes, Kyy 
lies between 10! and 10°’ M. The Ky value for an enzyme depends on the 
particular substrate and on environmental conditions such as pH, tempera- 
ture, and ionic strength. The Michaelis constant, Kjy, has two meanings. 
First, Kyy is the concentration of substrate at which half the active sites are 
filled. Thus, K)4 provides a measure of the substrate concentration required 
for significant catalysis to take place. For many enzymes, experimental evi- 
dence suggests that K)4 provides an approximation of substrate concentra- 
tion in vivo. 

Second, Ky 1s related to the rate constants of the individual steps in the 
catalytic scheme given in equation 12. In equation 18, Kyy is defined as 
(k_1 + ko)/k1. Consider a case in which k_; is much greater than ky. Under 
such circumstances, the ES complex dissociates to 
E and 5S much more rapidly than product is 
formed. Under these conditions (k_; > k»), 
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1/Vo| Slope =Ku/Vimax 
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Figure 8.12 A double-reciprocal or 
Lineweaver-Burk plot. A double-reciprocal 
plot of enzyme kinetics is generated by 
plotting 1/Vo as a function 1/[S]. The slope is 
Ku/Vmax, the intercept on the vertical axis is 
1/Vmax and the intercept on the horizontal 


axis is — 1/Ky. 


Table 8.4 Ky values of some enzymes 


‘ Enzyme Substrate Ky (uM) 
Ky = cae (28) Chymotrypsin Acetyl-L-tryptophanamide 5000 
ky Lysozyme Hexa-N-acetylglucosamine 6 
‘ : ; bus -Galactosid, Lact 4000 
Equation 28 describes the dissociation con- seo sued nan 
fee ee ee Threonine deaminase Threonine 5000 
stant 0 ne Carbonic anhydrase CO, 8000 
[E][S] k_4 Penicillinase Benzylpenicillin 50 
Kgs = ES = h (29) Pyruvate carboxylase Pyruvate 400 
[ES] 1 HCO; 1000 
In other words, Ky is equal to the dissociation - ae ou 
constant of the ES complex if kz is much smaller Peat ihe zis yee 
; eae ; : tRNA 4 
than k_;. When this condition is met, Ky is a yea a 


measure of the strength of the ES complex: a high 
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Table 8.5 Turnover numbers of 
some enzymes 


Enzyme 


‘Turnover number 
(per second) 


Carbonic anhydrase 
3-Ketosteroid isomerase 
Acetylcholinesterase 
Penicillinase 

Lactate dehydrogenase 
Chymotrypsin 

DNA polymerase I 
Tryptophan synthetase 
Lysozyme 


600,000 
280,000 
25,000 
2,000 
1,000 
100 
15 
2 

0.5 


Ky indicates weak binding; a low Kyy indicates strong binding. It must be 
stressed that Ky, indicates the affinity of the ES complex only when k_; is 
much greater than kp. 

The maximal rate, Vmax, reveals the turnover number of an enzyme, 
which is the number of substrate molecules converted into product by an enzyme 
molecule in a unit time when the enzyme is fully saturated with substrate. It is 
equal to the rate constant k2, which is also called k.,. The maximal rate, 
Vmax, reveals the turnover number of an enzyme if the concentration of 
active sites [E]7 is known, because 


aann = Ro[E] 7 (30) 
and thus 


ko - Vnax/[E]7 (31) 


For example, a 10° M solution of carbonic anhydrase catalyzes the forma- 
tion of 0.6 M H2CO3 per second when the enzyme is fully saturated with 
substrate. Hence, k2 is 6 X 10°s ‘. This turnover number is one of the larg- 
est known. Each catalyzed reaction takes place in a time equal to, on aver- 
age, 1/ko, which is 1.7 ws for carbonic anhydrase. The turnover numbers of 
most enzymes with their physiological substrates range from 1 to 10* per 
second (Table 8.5). 

Ky and Vinax also permit the determination of fgs, the fraction of active 
sites filled. This relation of fgg to Kyg and Vina, 1s given by the following 
equation: 


V [S] 


TS Vix (5) + Ku 


kcat/Km is a measure of catalytic efficiency 


When the substrate concentration is much greater than Ky, the rate of 
catalysis is equal to Vmax, which is a function of ka, the turnover number, 
as already described. However, most enzymes are not normally saturated 
with substrate. Under physiological conditions, the [S]/Kyy ratio is typi- 
cally between 0.01 and 1.0. When [S] < Ky, the enzymatic rate is much 
less than ka because most of the active sites are unoccupied. Is there anum- 
ber that characterizes the kinetics of an enzyme under these more typical 
cellular conditions? Indeed there is, as can be shown by combining equa- 
tions 13 and 19 to give 


Rea 
Vo= x (EIS) (33) 


When [S] < Ky, the concentration of free enzyme [E], is nearly equal 
to the total concentration of enzyme [E]7; so 


_ Rest 
Ku 


Vo [S][E]r (34) 
Thus, when [S] < Kyy, the enzymatic velocity depends on the values of 
Reat/ Kw, [8], and [E]r. Under these conditions, k¢a/Kyy is the rate constant 
for the interaction of S and E. The rate constant k.at/Kyy is a measure of 
catalytic efficiency because it takes into account both the rate of catalysis 
with a particular substrate (k,.,) and the strength of the enzyme—substrate 
interaction (K},). For instance, by using k,,+/Kyy4 values, we can compare 
an enzyme’s preference for different substrates. Table 8.6 shows the 
Reat/Kyq values for several different substrates of chymotrypsin. 


Table 8.6 Substrate preferences of chymotrypsin 


Amino acid in ester Amino acid side chain 


Reat/ Ku (s! M"') 


Glycine —H 1.3.x 107! 
Cle 
: VA 
Valine ae 2.0 
CH, 
Norvaline —CH,CH,CH; 3.6 X 107 
Norleucine —CH,CH,CH,CH; 3.0 X 10° 
Hp 
Phenylalanine =C 1.0 X 10° 


Source: After A. Fersht, Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and 
Protein Folding (W. H. Freeman and Company, 1999), Table 7.3. 


Chymotrypsin clearly has a preference for cleaving next to bulky, hydro- 
phobic side chains. 

How efficient can an enzyme be? We can approach this question by 
determining whether there are any physical limits on the value of Reat/Kyw. 
Note that the koa Kyy ratio depends on k1, R—1, and k,,4, as can be shown by 
substituting for Kyy. 


Reat/ Ku = Roath = ( Beat 


< 35 
kR_4 + Reat kR_4 + = Pi ( 


Suppose that the rate of formation of product (k,a¢) is much faster than the 
rate of dissociation of the ES complex (k—,). The value of Roat/Kyy then 
approaches k,. Thus, the ultimate limit on the value of kcat/Kyy is set by ky 
the rate of formation of the ES complex. This rate cannot be faster than the 
diffusion-controlled encounter of an enzyme and its substrate. Diffusion limits 
the value of k; and so it cannot be higher than between 10° and 10’s M71. 
Hence, the upper limit on k,a¢/Kyy is between 10° and 10’?s~'M7!. 

The Reat/ Ky ratios of the enzymes superoxide dismutase, acetylcholin- 
esterase, and triose phosphate isomerase are between 10° and 10’?s-' M1. 
Enzymes that have k,/Kj4 ratios at the upper limits have attained kinetic 
perfection. Their catalytic velocity is restricted only by the rate at which they 
encounter substrate in the solution (Table 8.7). Any further gain in catalytic 
rate can come only by decreasing the time for diffusion of the substrate into 
the enzyme’s immediate environment. Remember that the active site is only 
a small part of the total enzyme structure. Yet, for catalytically perfect 
enzymes, every encounter between enzyme and substrate is productive. In 
these cases, there may be attractive electrostatic forces on the enzyme that 
entice the substrate to the active site. These forces are sometimes referred to 
poetically as Circe effects. 

The diffusion of a substrate throughout a solution can also be partly 
overcome by confining substrates and products in the limited volume of a 
multienzyme complex. Indeed, some series of enzymes are organized into 
complexes so that the product of one enzyme is very rapidly found by the 
next enzyme. In effect, products are channeled from one enzyme to the 
next, much as in an assembly line. 


Most biochemical reactions include multiple substrates 


Most reactions in biological systems start with two substrates and yield two 
products. They can be represented by the bisubstrate reaction: 


A+B—P+Q 
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Table 8.7 Enzymes for which k.at/Ky 
is close to the diffusion- 
controlled rate of encounter 


Enzyme Reat/ Ky (s-' M~') 
Acetylcholinesterase 1.6 x 10° 
Carbonic anhydrase 8.3 X 10” 
Catalase 4x 107 
Crotonase 28 < 10" 
Fumarase 1.6 x 10° 
Triose phosphate DAK 10" 
isomerase 
G-Lactamase 1x 10° 
Superoxide dismutase 7X 10° 


Source: After A. Fersht, Structure and Mechanism 
in Protein Science: A Guide to Enzyme Catalysis 
and Protein Folding (W. H. Freeman and 
Company, 1999), Table 4.5. 


Circe effect 
The utilization of attractive forces to lure a 
substrate into a site in which it undergoes a 
transformation of structure, as defined by 
William P. Jencks, an enzymologist, who 
coined the term. 

A goddess of Greek mythology, Circe 
lured Odysseus’s men to her house and 
then transformed them into pigs. 
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Many such reactions transfer a functional group, such as a phosphoryl or an 
ammonium group, from one substrate to the other. Those that are oxidation— 
reduction reactions transfer electrons between substrates. Multiple sub- 
strate reactions can be divided into two classes: sequential reactions and 
double-displacement reactions. 


Sequential reactions. In sequential reactions, all substrates must bind to the 
enzyme before any product is released. Consequently, in a bisubstrate reac- 
tion, a ternary complex of the enzyme and both substrates forms. Sequential 
mechanisms are of two types: ordered, in which the substrates bind the 
enzyme in a defined sequence, and random. 

Many enzymes that have NAD* or NADH as a substrate exhibit the 
ordered sequential mechanism. Consider lactate dehydrogenase, an impor- 
tant enzyme in glucose metabolism (Section 16.1). This enzyme reduces 
pyruvate to lactate while oxidizing NADH to NAD™. 


= oO. 0 
Oo. .O ~e 
Se? | 
| + NADH + H* HO 7 H + NAD* 
Cc 
ZA 
0” cH; CH; 
Pyruvate Lactate 


In the ordered sequential mechanism, the coenzyme always binds first and 
the lactate is always released first. This sequence can be represented by 
using a notation developed by W. Wallace Cleland: 


NADH Pyruvate Lactate NAD+ 


Enzyme | l | Enzyme 
E (NADH) (pyruvate) ——— E (lactate) (NAD*) 


The enzyme exists as a ternary complex consisting of, first, the enzyme and 
substrates and, after catalysis, the enzyme and products. 

In the random sequential mechanism, the order of the addition of sub- 
strates and the release of products is random. An example of a random 
sequential reaction is the formation of phosphocreatine and ADP from 
ATP and creatine, which is catalyzed by creatine kinase (p. 8). 


NH) NH o7- 
ae . 
Oo, os O. te 
. Lie “SNH, + ATP oa ia nw SS” + ADP 
fe) CH; fe) CH; 
Creatine Phosphocreatine 


Either creatine or ATP may bind first, and either phosphocreatine or ADP 
may be released first. Phosphocreatine is an important energy source in 
muscle. Sequential random reactions also can be depicted in the Cleland 
notation. 


ATP Creatine Phosphocreatine ADP 
Enzyme -— Enzyme 
E (creatine) ____—._- E (phosphocreatine) 
| (ATP) ; (ADP) 


Creatine ATP ADP _ Phosphocreatine 


Although the order of certain events is random, the reaction still passes through 
the ternary complexes including, first, substrates and, then, products. 


Double-displacement (ping-pong) reactions. In double-displacement, or 
ping-pong, reactions, one or more products are released before all substrates 
bind the enzyme. The defining feature of double-displacement reactions is 
the existence of a substituted enzyme intermediate, in which the enzyme is 
temporarily modified. Reactions that shuttle amino groups between amino 
acids and a-ketoacids are classic examples of double-displacement mecha- 
nisms. The enzyme aspartate aminotransferase catalyzes the transfer of an 
amino group from aspartate to a-ketoglutarate. 


OOK -00C 
Ree CH> coo" \ 
¥ / pow 
H2C H2C H2C H>C 
x H e x —— i AH 
ae c—Coo- C—COO- on 
+HsN~ ~COO- Ui J +HaN coo- 
Aspartate a-Ketoglutarate Oxaloacetate Glutamate 


The sequence of events can be portrayed as the following Cleland notation: 
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Aspartate Oxaloacetate a-Ketoglutarate Glutamate 
Enzyme | I | | Enzyme 
E <== (E-NH3) (E-NH3) (E-NH3) ——— E 
(aspartate) (oxaloacetate) (a-ketoglutarate) (glutamate) 


After aspartate binds to the enzyme, the enzyme accepts aspartate’s amino 
group to form the substituted enzyme intermediate. The first product, 
oxaloacetate, subsequently departs. The second substrate, a-ketoglutarate, 
binds to the enzyme, accepts the amino group from the modified enzyme, 
and is then released as the final product, glutamate. In the Cleland notation, 
the substrates appear to bounce on and off the enzyme much as a Ping-Pong 
ball bounces on a table. 


Allosteric enzymes do not obey Michaelis-Menten kinetics 


The Michaelis-Menten model has greatly assisted the development of 
enzymology. Its virtues are simplicity and broad applicability. However, the 
Michaelis-Menten model cannot account for the kinetic properties of many 
enzymes. An important group of enzymes that do not obey Michaelis— 
Menten kinetics are the allosteric enzymes. These enzymes consist of multi- 
ple subunits and multiple active sites. 

Allosteric enzymes often display sigmoidal plots (Figure 8.13) of the 
reaction velocity Vg versus substrate concentration [S], rather than the 
hyperbolic plots predicted by the Michaelis-Menten equation (equation 26). 
In allosteric enzymes, the binding of substrate to one active site can alter the 
properties of other active sites in the same enzyme molecule. A possible out- 
come of this interaction between subunits is that the binding of substrate 
becomes cooperative; that is, the binding of substrate to one active site facili- 
tates the binding of substrate to the other active sites. As it does for hemoglo- 
bin (Chapter 7), such cooperativity results in a sigmoidal plot of Vo versus 
[5]. In addition, the activity of an allosteric enzyme may be altered by regula- 
tory molecules that are reversibly bound to specific sites other than the cata- 
lytic sites. The catalytic properties of allosteric enzymes can thus be adjusted 
to meet the immediate needs of a cell (Chapter 10). For this reason, allosteric 
enzymes are key regulators of metabolic pathways. 


Reaction velocity, Vo — 


Substrate concentration, [S] — 


Figure 8.13 Kinetics for an allosteric 
enzyme. Allosteric enzymes display a 
sigmoidal dependence of reaction velocity on 
substrate concentration. 
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Figure 8.14 Distinction between 
reversible inhibitors. (A) Enzyme—substrate 
complex; (B) a competitive inhibitor binds at 
the active site and thus prevents the substrate 
from binding; (C) an uncompetitive inhibitor 
binds only to the enzyme-—substrate complex; 
(D) a noncompetitive inhibitor does not 
prevent the substrate from binding. 
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Methotrexate 


Figure 8.15 Enzyme inhibitors. The substrate dihydrofolate and its structural 
analog methotrexate. Regions with structural differences are shown in red. 
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8.5 Enzymes Can Be Inhibited by Specific Molecules 


The activity of many enzymes can be inhibited by the binding of specific small 
molecules and ions. This means of inhibiting enzyme activity serves as a major 
control mechanism in biological systems, typified by the regulation of allosteric 
enzymes. In addition, many drugs and toxic agents act by inhibiting enzymes 
(Chapter 36). Inhibition can be a source of insight into the mechanism of 
enzyme action: specific inhibitors can often be used to identify residues critical 
for catalysis. Transition-state analogs are especially potent inhibitors. 

Enzyme inhibition can be either irreversible or reversible. An irreversible 
inhibitor dissociates very slowly from its target enzyme because it has 
become tightly bound to the enzyme, either covalently or noncovalently. 
Some irreversible inhibitors are important drugs. Penicillin acts by cova- 
lently modifying the enzyme transpeptidase, thereby preventing the syn- 
thesis of bacterial cell walls and thus killing the bacteria (p. 244). Aspirin 
acts by covalently modifying the enzyme cyclooxygenase, reducing the 
synthesis of signaling molecules in inflamation. 

Reversible inhibition, in contrast with irreversible inhibition, is charac- 
terized by a rapid dissociation of the enzyme—inhibitor complex. In the type 
of reversible inhibition called competitive inhibition, an enzyme can bind 
substrate (forming an ES complex) or inhibitor (EI) but not both (ESI, 
enzyme-substrate-inhibitor complex). The competitive inhibitor often 
resembles the substrate and binds to the active site of the enzyme (Figure 
8.14). The substrate is thereby prevented from binding to the same active 
site. A competitive inhibitor diminishes the rate of catalysis by reducing the 
proportion of enzyme molecules bound to a substrate. At any given inhibitor 
concentration, competitive inhibition can be relieved by increasing the sub- 
strate concentration. Under these conditions, the substrate successfully 
competes with the inhibitor for the active site. Methotrexate is an especially 
potent competitive inhibitor of the enzyme dihydrofolate reductase, which 
plays a role in the biosynthesis of purines and pyrimidines. Methotrexate is 
a structural analog of dihydrofolate, a substrate for dihydrofolate reductase 
(Figure 8.15). What makes it such a potent competitive inhibitor is that it 
binds to the enzyme 1000 times as tightly as the natural substrate binds, and 
it inhibits nucleotide base synthesis. It is used to treat 
cancer. 

Uncompetitive inhibition is distinguished by the 
fact that the inhibitor binds only to the enzyme— 
substrate complex. The uncompetitive inhibitor’s 
binding site is created only on interaction of the 
enzyme and substrate (see Figure 8.14C). 
Uncompetitive inhibition cannot be overcome by the 
addition of more substrate. 

In noncompetitive inhibition, the inhibitor and sub- 
strate can bind simultaneously to an enzyme molecule 
at different binding sites (see Figure 8.14D). Unlike 
uncompetitive inhibition, a noncompetitive inhibitor 
can bind free enzyme or the enzyme-substrate com- 
plex. A noncompetitive inhibitor acts by decreasing 
the concentration of functional enzyme rather than by 
diminishing the proportion of enzyme molecules that 
are bound to substrate. The net effect is to decrease 
the turnover number. Noncompetitive inhibition, like 
uncompetitive inhibition, cannot be overcome by 
increasing the substrate concentration. A more com- 
plex pattern, called mixed inhibition, is produced 


when a single inhibitor both hinders the binding of substrate and decreases 
the turnover number of the enzyme. 


Reversible inhibitors are kinetically distinguishable 


How can we determine whether a reversible inhibitor acts by competitive, 
uncompetitive, or noncompetitive inhibition? Let us consider only enzymes 
that exhibit Michaelis-Menten kinetics. Measurements of the rates of 
catalysis at different concentrations of substrate and inhibitor serve to dis- 
tinguish the three types of inhibition. In competitive inhibition, the inhibitor 
competes with the substrate for the active site. The dissociation constant for 
the inhibitor is given by 


K, = (E}()/[ET] 


The smaller the K;, the more potent the inhibition. The hallmark of competi- 
tive inhibition is that it can be overcome by a sufficiently high concentration of 
substrate (Figure 8.16). The effect of a competitive inhibitor is to increase the 
apparent value of K)4, meaning that more substrate is needed to obtain the 
same reaction rate. This new value of Ky, called K yj, is numerically equal to 


KY = Ku + [I]/K;) 


where [I] is the concentration of inhibitor and K; is the dissociation constant 
for the enzyme—inhibitor complex. In the presence of a competitive inhibi- 
tor, an enzyme will have the same V,yax as in the absence of an inhibitor. At 
a sufficiently high concentration, virtually all the active sites are filled by 
substrate, and the enzyme is fully operative. 

Competitive inhibitors are commonly used as drugs. Drugs such as ibu- 
profen are competitive inhibitors of enzymes that participate in signaling 
pathways in the inflammatory response. Statins are drugs that reduce high 
cholesterol levels by competitively 
inhibiting a key enzyme in choles- 5 
terol biosynthesis. E+1—“> S$ 41— > E4+P 

In uncompetitive inhibition, the || 
inhibitor binds only to the ES com- x 
plex. This enzyme—substrate—inhib- 
itor complex, ESI, does not go on to iG 


ESI —<~> 


form any product. Because some No inhibitor 
unproductive ESI complex will 80 

always be present, Vinax willbelower =» 

in the presence of inhibitor than in its : 60 

absence (Figure 8.17). The uncom- 

petitive inhibitor lowers the appar- 2 ~” 

ent value of Ky because the inhibitor = 

binds to ES to form ESI, depleting 

ES. To maintain the equilibrium ) 

between E and ES, more 5 binds to x [Substrate] — 
E. Thus, a lower concentration of S is 


: : Ky for uninhibited enzyme 
required to form half of the maximal 


concentration of ES and the appar- 
ent value of Ky, is reduced. The 
herbicide glyphosate, also known as 
Roundup, is an uncompetitive inhib- 
itor of an enzyme in the biosynthetic 
pathway for aromatic amino acids. 
In noncompetitive inhibition (Fig- 
ure 8.18), substrate can still bind to 


Ka? for [l] =K; 


Figure 8.17 Kinetics of an uncompetitive 
inhibitor. The reaction pathway shows that 
the inhibitor binds only to the enzyme— 
substrate complex. Consequently, Vinax 

cannot be attained, even at high substrate 
concentrations. The apparent value for Ky 

is lowered, becoming smaller as more inhibitor 
is added. 
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Figure 8.16 Kinetics of a competitive 
inhibitor. As the concentration of a 
competitive inhibitor increases, higher 
concentrations of substrate are required to 
attain a particular reaction velocity. The 
reaction pathway suggests how sufficiently 
high concentrations of substrate can 
completely relieve competitive inhibition. 
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Figure 8.18 Kinetics of a noncompetitive 
inhibitor. The reaction pathway shows that 
the inhibitor binds both to free enzyme and to 
an enzyme-substrate complex. Consequently, 
as with uncompetitive competition, Vax 
cannot be attained. Kjy remains unchanged, 
and so the reaction rate increases more slowly 
at low substrate concentrations than is the 
case for uncompetitive competition. 
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Figure 8.19 Competitive inhibition 
illustrated on a double-reciprocal plot. A 
double-reciprocal plot of enzyme kinetics in 
the presence and absence of a competitive 
inhibitor illustrates that the inhibitor has no 
effect on Vinax but increases Ky. 
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Figure 8.20 Uncompetitive inhibition 
illustrated by a double-reciprocal plot. 
An uncompetitive inhibitor does not effect the 
slope of the double-reciprocal plot. Vinax and 
Kw are reduced by equivalent amounts. 
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Figure 8.21 Noncompetitive inhibition 
illustrated on a double-reciprocal plot. A 
double-reciprocal plot of enzyme kinetics in 
the presence and absence of a noncompetitive 
inhibitor shows that Ky is unaltered and Vinay 
is decreased. 


the enzyme—inhibitor complex. However, the enzyme—inhibitor—substrate 
complex does not proceed to form product. The value of Vinax 1s decreased 
to a new value called VPP, whereas the value of Kyy is unchanged. The 


maximal velocity in the presence of a pure noncompetitive inhibitor, VPP, 
is given by 
yap — Vnax (36) 


oe Ae PR, 


Why is Vinax lowered though Ky, remains unchanged? In essence, the 
inhibitor simply lowers the concentration of functional enzyme. The result- 
ing solution behaves asa more dilute solution ofenzyme does. Noncompetitive 
inhibition cannot be overcome by increasing the substrate concentration. 
Deoxycycline, an antibiotic, functions at low concentrations as a noncom- 
petitive inhibitor of a proteolytic enzyme (collagenase). It is used to treat 
periodontal disease. Some of the toxic effects of lead poisoning may be due 
to lead’s ability to act as a noncompetitive inhibitor of a host of enzymes. 
Lead reacts with crucial sulfhydryl groups in these enzymes. 

Double-reciprocal plots are especially useful for distinguishing between 
competitive, uncompetitive, and noncompetitive inhibitors. In competitive 
inhibition, the intercept on the y-axis of the plot of 1/Vo versus 1/[S] is the 
same in the presence and in the absence of inhibitor, although the slope is 
increased (Figure 8.19). The intercept is unchanged because a competitive 
inhibitor does not alter Viax. The increase in the slope of the 1/Vo versus 
1/[S] plot indicates the strength of binding of a competitive inhibitor. In the 
presence of a competitive inhibitor, equation 27 is replaced by 


1. 1. Ku w\/t 
Go Ge” ee(t . a) 67) 


In other words, the slope of the plot is increased by the factor (1 + [I]/K;) 
in the presence of a competitive inhibitor. Consider an enzyme with a Ky of 
10~* M. In the absence of inhibitor, Vo = Vmax/2 when [S] =10°*M. In 
the presence of a 2 X 10° M competitive inhibitor that is bound to the 
enzyme with a K; of 10° M, the apparent Kyy (K }¥°) will be equal to Ky 
(1 + [I]/K,), or 3 X 107* M. Substitution of these values into equation 37 
gives Vo = Vmax/4, when [S] = 10-* M. The presence of the competitive 
inhibitor thus cuts the reaction rate in half at this substrate concentration. 

In uncompetitive inhibition (Figure 8.20), the inhibitor combines only 
with the enzyme-substrate complex. The equation that describes the 
double-reciprocal plot for an uncompetitive inhibitor is 


1 Ky 1 1 2) 
= + 1+ 38 
Vo Vensy [S] aml Le ( 


The slope of the line, Kyy/Vmax, is the same as that for the uninhibited 
enzyme, but the intercept on the y-axis will be increased by 1 + [I]/K;. 
Consequently, the lines in double-reciprocal plots will be parallel. 

In noncompetitive inhibition (Figure 8.21), the inhibitor can combine 
with either the enzyme or the enzyme-substrate complex. In pure noncom- 
petitive inhibition, the values of the dissociation constants of the inhibitor 
and enzyme and of the inhibitor and enzyme-substrate complex are equal. 
The value of Vmax is decreased to the new value V<PF, and so the intercept 
on the vertical axis is increased. The new slope, which is equal to Ky/V<RF,, 
is larger by the same factor. In contrast with Vinax, Kyy is not affected by pure 
noncompetitive inhibition. 
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In Chapter 9, we will examine the chemical details of how enzymes function. 
The first step in obtaining the chemical mechanism of an enzyme is to deter- 
mine what functional groups are required for enzyme activity. How can we 
ascertain what these functional groups are? X-ray crystallography of the 
enzyme bound to its substrate or substrate analog provides one approach. 
Irreversible inhibitors that covalently bond to the enzyme provide an alter- 
native and often complementary approach: the inhibitors modify the func- 
tional groups, which can then be identified. Irreversible inhibitors can be 
divided into three categories: group-specific reagents, reactive substrate ana- 
logs (also called affinity labels), and suicide inhibitors. 

Group-specific reagents react with specific side chains of amino acids. An 
example of a group-specific reagent is diisopropylphosphofluoridate (DIPF). 
DIPF modifies only 1 of the 28 serine residues in the proteolytic enzyme chy- 
motrypsin, implying that this serine residue is especially reactive. We will see in 
Chapter 9 that this serine residue is indeed located 
at the active site. DIPF also revealed a reactive 


serine residue in acetylcholinesterase, an enzyme i CHs 

important in the transmission of nerve impulses Wf ‘ yoy ath 

(Figure 8.22). Thus, DIPF and similar com- - 

pounds that bind and inactivate acetylcholinest- F / 0) 

erase are potent nerve gases. Many group-specific al ‘ » a / a" 
reagents do not display the exquisite specificity a ~ "= of % o~ 


shown by DIPF. Consequently, more specific 
means of modifying the active site are required. 
Affinity labels, or reactive substrate analogs, 
are molecules that are structurally similar to the 
substrate for an enzyme and that covalently 
bind to active-site residues. They are thus more aeeticholliz DIPF inacinated 
specific for the enzyme’s active site than are esterase enzyme 
group-specific reagents. Tosyl-L -phenylalanine Figure 8.22 Enzyme inhibition by diisopropylphosphofluoridate (DIPF), a 
chloromethyl ketone (TPCK) is a substrate ana- group-specific reagent. DIPF can inhibit an enzyme by covalently modifying a 
log for chymotrypsin (Figure 8.23). TPCK — ctucial serine residue. 


(A) (B) Chymotrypsin 
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Specificity group 
d \ 
N Figure 8.23 Affinity labeling. (A) Tosyl-t- 
phenylalanine chloromethyl ketone (TPCK) is 
O a reactive analog of the normal substrate for 
cam the enzyme chymotrypsin. (B) TPCK binds at 


| the active site of chymotrypsin and modifies 
Tosyl-t-phenylalanine chloromethyl ketone (TPCK) R an essential histidine residue. 


triose phosphate isomerase (TPI). It mimics 


Br binds at the active site and then reacts irrevers- 

Co ibly with a histidine residue at that site, inhibit- 

ing the enzyme. The compound 3-bromoacetol 

Fas a —-L4 C—O +8r phosphate is an affinity label for the enzyme 


opo2- son the normal substrate, dihydroxyacetone phos- 
. phate, by binding at the active site; then it 
Triose phosphate Bromoacetol Inactivated dis 
isomerase phosphate enzyme covalently modifies the enzyme such that the 
(TPI) 


Figure 8.24 Bromoacetol phosphate, an 
affinity label for triose phosphate 
isomerase (TPI). Bromoacetol phosphate, an 
analog of dihydroxyacetone phosphate, binds 
at the active site of the enzyme and covalently 
modifies a glutamic acid residue required for 
enzyme activity. 


Hs a 


(-)Deprenyl 


242 


enzyme is irreversibly inhibited (Figure 8.24). 
Suicide inhibitors, or mechanism-based inhib- 
itors, are modified substrates that provide the 
most specific means for modifying an enzyme’s active site. The inhibitor 
binds to the enzyme as a substrate and is initially processed by the normal 
catalytic mechanism. The mechanism of catalysis then generates a chemi- 
cally reactive intermediate that inactivates the enzyme through covalent 
modification. The fact that the enzyme participates in its own irreversible 
inhibition strongly suggests that the covalently modified group on the 
enzyme is vital for catalysis. One example of such an inhibitor is N,N- 
dimethylpropargylamine, an inhibitor of the enzyme monoamine oxidase 
(MAO). A flavin prosthetic group of monoamine oxidase oxidizes the N, N- 
dimethylpropargylamine, which in turn inactivates the enzyme by binding 
to N-5 of the flavin prosthetic group (Figure 8.25). Monoamine oxidase 
deaminates neurotransmitters such as dopamine and serotonin, lowering 
their levels in the brain. Parkinson disease is associated with low levels of 
dopamine, and depression is associated with low levels of serotonin. The 
drug (—)deprenyl, which is used to treat Parkinson disease and depression, 
is a suicide inhibitor of monoamine oxidase. 


Flavin prosthetic group 


; R 
H3C N 
need Oxidation ees 
H3C N 
? H 
H- er, as ON H 
n(cHs) en 
ae (SN(CHs)2 


N,N-Dimethylpropargylamine 


; 
H3C N a 
Bees “ TY | NH 
H3C N 


~~ 
a Cx N(CH3)2 


Alkylation _| -Ht 
[ 


* al 
Je. 
ik “N(CHs)2 


Stably modified flavin 
of inactivated enzyme 


Figure 8.25 Mechanism-based (suicide) inhibition. Monoamine oxidase, an enzyme 
important for neurotransmitter synthesis, requires the cofactor FAD (flavin adenine dinucleotide). 
N,N-Dimethylpropargylamine inhibits monoamine oxidase by covalently modifying the flavin prosthetic 
group only after the inhibitor has been oxidized. The N-5 flavin adduct is stabilized by the addition of 
a proton. R represents the remainder of the flavin prosthetic group. 


Transition-state analogs are potent inhibitors of enzymes 


We turn now to compounds that provide the most intimate views of the 
catalytic process itself. Linus Pauling proposed in 1948 that compounds 
resembling the transition state of a catalyzed reaction should be very effec- 
tive inhibitors of enzymes. These mimics are called transition-state analogs. 
The inhibition of proline racemase is an instructive example. The racemiza- 
tion of proline proceeds through a transition state in which the tetrahedral 
a-carbon atom has become trigonal (Figure 8.26). In the trigonal form, all 
three bonds are in the same plane; Cy also carries a net negative charge. 
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H COOH 
L-Proline Planar p-Proline Pyrrole 2-carboxylic acid 
transition state (transition-state analog) 


Figure 8.26 Inhibition by transition-state analogs. (A) The isomerization of L-proline to 
D-proline by proline racemase, a bacterial enzyme, proceeds through a planar transition state in 
which the a-carbon atom is trigonal rather than tetrahedral. (B) Pyrrole 2-carboxylic acid, a 
transition-state analog because of its trigonal geometry, is a potent inhibitor of proline racemase. 


This symmetric carbanion can be reprotonated on one side to give the L 
isomer or on the other side to give the D isomer. This picture is supported 
by the finding that the inhibitor pyrrole 2-carboxylate binds to the race- 
mase 160 times as tightly as does proline. The a-carbon atom of this inhibi- 
tor, like that of the transition state, is trigonal. An analog that also carries a 
negative charge on C, would be expected to bind even more tightly. In gen- 
eral, highly potent and specific inhibitors of enzymes can be produced by 
synthesizing compounds that more closely resemble the transition state 
than the substrate itself. The inhibitory power of transition-state analogs 
underscores the essence of catalysis: selective binding of the transition state. 


Catalytic antibodies demonstrate the importance of selective binding 
of the transition state to enzymatic activity 


Antibodies that recognize transition states should function as catalysts, if 
our understanding of the importance of the transition state to catalysis is 
correct. The preparation of an antibody that catalyzes the insertion of a 
metal ion into a porphyrin nicely illustrates the validity of this approach. 
Ferrochelatase, the final enzyme in the biosynthetic pathway for the pro- 
duction of heme, catalyzes the insertion of Fe?* into protoporphyrin IX. 
The nearly planar porphyrin must be bent for iron to enter. 

The challenge was to find a transition-state analog for this metallation 
reaction that could be used as an antigen (immunogen) to generate an anti- 
body. The solution came from studies showing that an alkylated porphyrin, 
N-methylmesoporphyrin, is a potent inhibitor of ferrochelatase. This com- 
pound resembles the transition state because N-alkylation forces the porphy- 
vin to be bent. Moreover, N-alkylporphyrins were known to chelate metal 
ions 10° times as fast as their unalkylated counterparts do. Bending increases 
the exposure of the pyrrole nitrogen lone pairs of electrons to solvent, which 
enables the binding of the iron ion. 

An antibody catalyst was produced with the use of an N-alkylporphyrin 
as the antigen. The resulting antibody presumably distorts a planar porphy- 
rin to facilitate the entry of a metal ion (Figure 8.27). On average, an anti- 
body molecule metallated 80 porphyrin molecules per hour, a rate only 


Figure 8.27 N-Methylmesoporphyrin is a 
transition-state analog used to generate 
catalytic antibodies. The insertion of a 
metal ion into a porphyrin by ferrochelatase 
proceeds through a transition state in which 
the porphyrin is bent. V-Methylmesoporphyrin, 
a bent porphyrin that resembles the transition 
state of the ferrochelatase-catalyzed reaction, 
was used to generate an antibody that also 
catalyzes the insertion of a metal ion into a 
porphyrin ring. 
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Figure 8.28 The reactive site of penicillin 
is the peptide bond of its B-lactam ring. 


(A) Structural formula of penicillin. 
(B) Representation of benzylpenicillin. 
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Figure 8.29 Schematic representation of 
the peptidoglycan in Staphylococcus 
aureus. The sugars are shown in yellow, the 
tetrapeptides in red, and the pentaglycine 


bridges in blue. The cell wall is a single, 
enormous, bag-shaped macromolecule 
because of extensive cross-linking. 
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10-fold less than that of ferrochelatase, and 2500-fold faster than the uncat- 
alyzed reaction. Catalytic antibodies (abzymes) can indeed be produced by 
using transition-state analogs as antigens. Antibodies catalyzing many other 
kinds of chemical reactions—exemplified by ester and amide hydrolysis, 
amide-bond formation, transesterification, photoinduced cleavage, photo- 
induced dimerization, decarboxylation, and oxidization—have been pro- 
duced with the use of similar strategies. Studies with transition-state ana- 
logs provide strong evidence that enzymes can function by assuming a 
conformation in the active site that is complementary in structure to the 
transition state. The power of transition-state analogs is now evident: (1) they 
are sources of insight into catalytic mechanisms, (2) they can serve as potent 
and specific inhibitors of enzymes, and (3) they can be used as immunogens to 
generate a wide range of novel catalysts. 


Penicillin irreversibly inactivates a key enzyme in 
bacterial cell-wall synthesis 


¥e Penicillin, the first antibiotic discovered, provides us with an example 
of a clinically useful suicide inhibitor. Penicillin consists of a thiazoli- 
dine ring fused to a B-lactam ring to which a variable R group is attached by 
a peptide bond (Figure 8.28A). In benzylpenicillin, for example, R is a benzyl 
group (Figure 8.28B). This structure can undergo a variety of rearrangements, 
and, in particular, the B-lactam ring is very labile. Indeed, this instability is 
closely tied to the antibiotic action of penicillin, as will be evident shortly. 
How does penicillin inhibit bacterial growth? Let us consider 
Staphylococcus aureus, the most common cause of staph infections. Penicillin 
works by interfering with the synthesis of the S. aureus cell walls. The 
S. aureus cell wall is made up of a macromolecule, called a peptidoglycan 
(Figure 8.29), which consists of linear polysaccharide chains that are cross- 
linked by short peptides (pentaglycines and tetrapeptides). The enormous 
bag-shaped peptidoglycan confers mechanical support and prevents bacte- 
ria from bursting in response to their high internal osmotic pressure. 
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Figure 8.30 Formation of cross-links in S. aureus peptidoglycan. The terminal amino group 
of the pentaglycine bridge in the cell wall attacks the peptide bond between two p-alanine residues 
to form a cross-link. 
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make the peptidoglycan so stable (Figure 8.30). Bacterial cell walls are barepenidabonineaeicn lesengte-ctoselink 
unique in containing D amino acids, which form cross-links by amechanism foirAaHon: 
different from that used to synthesize proteins. 
Penicillin inhibits the cross-linking transpeptidase by the Trojan horse 
stratagem. The transpeptidase normally forms an acyl intermediate with the 
penultimate D-alanine residue of the D-Ala-D-Ala peptide (Figure 8.31). 
This covalent acyl-enzyme intermediate then reacts with the amino group 
of the terminal glycine in another peptide to form the cross-link. Penicillin 
is welcomed into the active site of the transpeptidase because it mimics the 
D-Ala-D-Ala moiety of the normal substrate (Figure 8.32). Bound penicillin 
then forms a covalent bond with a serine residue at the active site of the 
enzyme. This penicilloyl-enzyme does not react further. Hence, the transpep- 
tidase is irreversibly inhibited and cell-wall synthesis cannot take place. 


(A) (B) 


Yellow bonds 
highlight similar 
conformation 
Penicillin R-p-Ala-p-Ala peptide 


Figure 8.32 Conformations of penicillin and a normal substrate. The conformation of 
penicillin in the vicinity of its reactive peptide bond (A) resembles the postulated conformation of 
the transition state of R-p-Ala-p-Ala (B) in the transpeptidation reaction. [After B. Lee. J. Mol. Biol. 
61:463-469, 1971.] 


Why is penicillin such an effective inhibitor of the transpeptidase? The 
highly strained, four-membered B-lactam ring of penicillin makes it espe- 
cially reactive. On binding to the transpeptidase, the serine residue at the 
active site attacks the carbonyl carbon atom of the lactam ring to form the 
penicilloyl-serine derivative (Figure 8.33). Because the peptidase partici- 
pates in its own inactivation, penicillin acts as a suicide inhibitor. 
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Figure 8.33 Formation of a penicilloyl-enzyme complex. Penicillin reacts with the 
transpeptidase to form an inactive complex, which is indefinitely stable. 
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Figure 8.34 Single molecule studies 
can reveal molecular heterogeneity. 
(A) Complex biomolecules, such as 
enzymes, display molecular heterogeneity. 
(B) When measuring an enzyme property 
using ensemble methods, an average value 
of the all of the enzymes present is the 
result. (C) Single enzyme studies reveal 
molecular heterogeneity, with the various 
forms showing different properties. 


8.6 Enzymes Can Be Studied 
One Molecule at a Time 


Most experiments that are preformed to determine an enzyme character- 
istic require an enzyme preparation in a buffered solution. Even a few 
microliters of such a solution will contain millions of enzyme molecules. 
Much that we have learned about enzymes thus far has come from such 
experiments, called ensemble studies. A basic assumption of ensemble 
studies is that all of the enzymes are the same or very similar. When we 
determine an enzyme property such as the value of Kyy in ensemble stud- 
ies, that value is of necessity an average value of all of the enzymes present. 
However, we know that molecular heterogeneity, the ability of a molecule, 
over time, to assume several different structures that differ slightly in 
stability, is an inherent property of all large biomolecules. Recall that pri- 
ons can exist in two different structures, one of which is prone to aggrega- 
tion (pp. 55-56). How can we tell if this molecular heterogeneity affects 
enzyme activity? 

By way of example, consider a hypothetical situation. A Martian visits 
Earth to learn about higher education. The spacecraft hovers high above 
a university, and our Martian meticulously records how the student 
population moves about campus. Much information can be gathered 
from such studies: where students are likely to be at certain times on 
certain days, which buildings are used when and by how many. Now, 
suppose our visitor developed a high-magnification camera that could 
follow one student throughout the day. Such data would provide a much 
different perspective on college life: What does this student eat? To 
whom does she talk? How much time does she spend studying? This new 
in singulo method, examining one individual at a time, yields a host of 
new information but also illustrates a potential pitfall of studying indi- 
viduals, be they students or enzymes: How can we be certain that the 
student or molecule is representative and not an outlier? This pitfall can 
be overcome by studying enough individuals to satisfy statistical analysis 
for validity. 

Let us leave our Martian to his observations, and consider a more bio- 
chemical situation. Figure 8.34A shows an enzyme that displays molecular 
heterogeneity, with three active forms that catalyze the same reaction but 
at different rates. These forms have slightly different stabilities, but ther- 
mal noise is sufficient to interconvert the forms. Each form is present as a 
fraction of the total enzyme population as indicated. If we were to perform 
an experiment to determine enzyme activity under a particular set of con- 
ditions with the use of ensemble methods, we would get a single value, 
which would represent the average of the heterogeneous assembly (Fig- 
ure 8.34B). However, were we to perform a sufficient number of single- 
molecule experiments, we would discover that the enzyme has three 
different molecular forms with very different activities (Figure 8.34C). 
Moreover, these different forms would most likely correspond to impor- 
tant biochemical differences. 

The development of powerful techniques—such as patch-clamp 
recording, single-molecule fluorescence, and optical tweezers—has 
enabled biochemists to look into the workings of individual molecules. We 
will examine single-molecule studies of membrane channels with the use 
of patch-clamp recording (Section 13.4), ATP-synthesizing complexes 
with the use of single-molecule fluorescence and molecular motors with 
the use of an optical trap (Section 34.2). Single-molecule studies open 
a new vista on the function of enzymes in particular and on all large 
biomolecules in general. 


Summary 


8.1 


8.2 


8.3 


8.4 


Enzymes Are Powerful and Highly Specific Catalysts 

Most catalysts in biological systems are enzymes, and nearly all enzymes 
are proteins. Enzymes are highly specific and have great catalytic 
power. They can enhance reaction rates by factors of 10° or more. Many 
enzymes require cofactors for activity. Such cofactors can be metal ions 
or small, vitamin-derived organic molecules called coenzymes. 


Free Energy Is a Useful Thermodynamic Function 

for Understanding Enzymes 

Free energy (G) is the most valuable thermodynamic function for deter- 
mining whether a reaction can take place and for understanding the 
energetics of catalysis. A reaction can take place spontaneously only if 
the change in free energy (AG) is negative. The free-energy change of a 
reaction that takes place when reactants and products are at unit activ- 
ity is called the standard free-energy change (AG®). Biochemists usually 
use AG®’, the standard free-energy change at pH 7. Enzymes do not 
alter reaction equilibria; rather, they increase reaction rates. 


Enzymes Accelerate Reactions by Facilitating the Formation 

of the Transition State 

Enzymes serve as catalysts by decreasing the free energy of activation of 
chemical reactions. Enzymes accelerate reactions by providing a reac- 
tion pathway in which the transition state (the highest-energy species) 
has a lower free energy and hence is more rapidly formed than in the 
uncatalyzed reaction. 

The first step in catalysis is the formation of an enzyme-substrate 
complex. Substrates are bound to enzymes at active-site clefts from 
which water is largely excluded when the substrate is bound. The speci- 
ficity of enzyme-substrate interactions arises mainly from hydrogen 
bonding, which is directional, and from the shape of the active site, 
which rejects molecules that do not have a sufficiently complementary 
shape. The recognition of substrates by enzymes is often accompanied 
by conformational changes at active sites, and such changes facilitate 
the formation of the transition state. 


The Michaelis-Menten Model Accounts for the Kinetic Properties 

of Many Enzymes 

The kinetic properties of many enzymes are described by the Michaelis— 
Menten model. In this model, an enzyme (E) combines with a substrate 
(S) to form an enzyme-substrate (ES) complex, which can proceed to 
form a product (P) or to dissociate into E and 5. 


1 ky 
E+S—=6S—>E+P 


1 


The rate Vo of formation of product is given by the Michaelis-Menten 
equation: 


[S] 


ee Nes] + Ku 


in which Vinax is the reaction rate when the enzyme is fully saturated 
with substrate and Kyy, the Michaelis constant, is the substrate concen- 
tration at which the reaction rate is half maximal. The maximal rate, 
Vmax, 18 equal to the product of ko, or Regt, and the total concentration 
of enzyme. The kinetic constant k,,;, called the turnover number, is 
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8.5 


8.6 


the number of substrate molecules converted into product per unit time 
at a single catalytic site when the enzyme is fully saturated with sub- 
strate. Turnover numbers for most enzymes are between 1 and 10° per 
second. The ratio of k.a/Kyy provides a penetrating probe into enzyme 
efficiency. 

Allosteric enzymes constitute an important class of enzymes whose 
catalytic activity can be regulated. These enzymes, which do not con- 
form to Michaelis-Menten kinetics, have multiple active sites. These 
active sites display cooperativity, as evidenced by a sigmoidal depen- 
dence of reaction velocity on substrate concentration. 


Enzymes Can Be Inhibited by Specific Molecules 

Specific small molecules or ions can inhibit even nonallosteric enzymes. 
In irreversible inhibition, the inhibitor is covalently linked to the 
enzyme or bound so tightly that its dissociation from the enzyme is very 
slow. Covalent inhibitors provide a means of mapping the enzyme’s 
active site. In contrast, reversible inhibition is characterized by a more 
rapid equilibrium between enzyme and inhibitor. A competitive inhib- 
itor prevents the substrate from binding to the active site. It reduces the 
reaction velocity by diminishing the proportion of enzyme molecules 
that are bound to substrate. Competitive inhibition can be overcome by 
raising the substrate concentration. In uncompetitive inhibition, the 
inhibitor combines only with the enzyme—substrate complex. In non- 
competitive inhibition, the inhibitor decreases the turnover number. 
Uncompetitive and noncompetitive inhibition cannot be overcome by 
raising the substrate concentration. 

The essence of catalysis is selective stabilization of the transition 
state. Hence, an enzyme binds the transition state more tightly than it 
binds the substrate. Transition-state analogs are stable compounds that 
mimic key features of this highest-energy species. They are potent and 
specific inhibitors of enzymes. Proof that transition-state stabilization 
is a key aspect of enzyme activity comes from the generation of catalytic 
antibodies. Transition-state analogs are used as antigens, or immuno- 
gens, in generating catalytic antibodies. 


Enzymes Can Be Studied One Molecule at a Time 

Many enzymes are now being studied in singulo, at the level of a single 
molecule. Such studies are important because they yield information 
that is difficult to obtain in studies of populations of molecules. Single- 
molecule methods reveal a distribution of enzyme characteristics rather 
than an average value as is acquired with the use of ensemble methods. 


APPENDIX: Enzymes Are Classified on the Basis of the Types of Reactions 


That They Catalyze 


Many enzymes have common names that provide little 
information about the reactions that they catalyze. For 
example, a proteolytic enzyme secreted by the pancreas 
is called trypsin. Most other enzymes are named for 
their substrates and for the reactions that they catalyze, 
with the suffix “ase” added. Thus, a peptide hydrolase 
is an enzyme that hydrolyzes peptide bonds, whereas 
ATP synthase is an enzyme that synthesizes ATP. 

To bring some consistency to the classification of 
enzymes, in 1964 the International Union of Biochemistry 


established an Enzyme Commission to develop a nomen- 
clature for enzymes. Reactions were divided into six 
major groups numbered 1 through 6 (Table 8.8). These 
groups were subdivided and further subdivided so that a 
four-digit number preceded by the letters EC for Enzyme 
Commission could precisely identify all enzymes. 

Consider as an example nucleoside monophosphate 
(NMP) kinase, an enzyme that we will examine in detail 
in Section 9.4. It catalyzes the following reaction: 


ATP + NMP == ADP + NDP 
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Table 8.8 Six major classes of enzymes 
Class Type of reaction Example Chapter 
1. Oxidoreductases Oxidation—-reduction Lactate dehydrogenase 16 
2. Transferases Group transfer Nucleoside monophosphate 9 
kinase (NMP kinase) 
3. Hydrolases Hydrolysis reactions (transfer of functional Chymotrypsin 9 
groups to water) 
4. Lyases Addition or removal of groups to form Fumarase 17 
double bonds 
5. Isomerases Isomerization (intramolecular group transfer) Triose phosphate isomerase 16 
6. Ligases Ligation of two substrates at the expense Aminoacyl-tRNA synthetase 30 


of ATP hydrolysis 


NMP kinase transfers a phosphoryl group from ATP 
to NMP to form a nucleoside diphosphate (NDP) and 
ADP. Consequently, it is a transferase, or member of 
group 2. Many groups other than phosphoryl groups, 
such as sugars and single-carbon units, can be trans- 
ferred. Transferases that shift a phosphoryl group are 
designated 2.7. Various functional groups can accept 
the phosphoryl] group. If a phosphate is the acceptor, 


the transferase is designated 2.7.4. The final number 
designates the acceptor more precisely. In regard to 
NMP kinase, a nucleoside monophosphate is the 
acceptor, and the enzyme’s designation is EC 2.7.4.4. 
Although the common names are used routinely, the 
classification number is used when the precise identity 
of the enzyme might be ambiguous. 


Key Terms 


enzyme (p. 220) 
substrate (p. 220) 
cofactor (p. 221) 
apoenzyme (p. 221) 
holoenzyme (p. 221) 
coenzyme (p. 221) 
prosthetic group (p. 221) 
free energy (p. 222) 

free energy of activation (p. 222) 
transition state (p. 225) 
active site (p. 227) 


induced fit (p. 228) 

Ky (the Michaelis constant) (p. 231) 

Vmax (maximal rate) (p. 232) 

Michaelis—Menten equation (p. 232) 

Lineweaver—Burk equation 
(double-reciprocal plot) (p. 233) 

turnover number (p. 234) 

Reat/ Ky ratio (p. 235) 

sequential reaction (p. 236) 


double-displacement (ping-pong) 
reaction (p. 237) 


allosteric enzyme (p. 237) 
competitive inhibition (p. 238) 
uncompetitive inhibition (p. 238) 
noncompetitive inhibition (p. 238) 
group-specific reagent (p. 241) 
affinity label (reactive substrate 
analog) (p. 241) 
mechanism-based (suicide) 
inhibition (p. 242) 
transition-state analog (p. 243) 
catalytic antibody (abzyme) (p. 244) 


Problems 


1. Raisons d’etre. What are the two properties of enzymes 
that make them especially useful catalysts? 


2. Partners. What does an apoenzyme require to become a 
holoenzyme? 


3. Different partners. What are the two main types of 
cofactors? 


4. One a day. Why are vitamins necessary for good health? 


5. A function of state. What is the fundamental mechanism 
by which enzymes enhance the rate of chemical reactions? 


6. Nooks and crannies. What is the structural basis for enzyme 
specificity? 


7. Give with one hand, take with the other. Why does the 
activation energy of a reaction not appear in the final AG of 
the reaction? 


8. Mountain climbing. Proteins are thermodynamically 
unstable. The AG of the hydrolysis of proteins is quite 
negative, yet proteins can be quite stable. Explain this 
apparent paradox. What does it tell you about protein 
synthesis? 
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9. Protection. Suggest why the enzyme lysozyme, which 
degrades cell walls of some bacteria, is present in tears. 


10. Stability matters. Transition-state analogs, which can 
be used as enzyme inhibitors and to generate catalytic 
antibodies, are often difficult to synthesize. Suggest a 
reason. 


11. Match’em. Match the K, values with the appropriate 
AG*’ values. 


Kis AG” (kJ mol” ') 
1 28.53 
if 1077 —11.42 
(c) 10* 5.69 
(d) 10° 0 
(e) 107! —22.84 


12. Free energy! Assume that you have a solution of 0.1 M 
glucose 6-phosphate. To this solution, you add the enzyme 
phosphoglucomutase, which catalyzes the following reaction: 


Phosphoglucomutase 


Glucose 6-phosphate glucose 1-phosphate 


The AG” for the reaction is +7.5 kJ mol”! (+1.8 kcal mol”). 


(a) Does the reaction proceed as written? If so, what are the 
final concentrations of glucose 6-phosphate and glucose 
1-phosphate? 


(b) Under what cellular conditions could you produce glu- 
cose 1-phosphate at a high rate? 


13. Free energy, too! Consider the following reaction: 
Glucose 1-phosphate === glucose 6-phosphate 


After reactant and product were mixed and allowed to reach 
equilibrium at 25°C, the concentration of each compound 
was measured: 


[Glucose 1-phosphate]., = 0.01 M 
[Glucose 6-phosphate].g = 0.19 M 


Calculate K,g and AG*’. 


14. Keeping busy. Many isolated enzymes, if incubated at 
37°C, will be denatured. However, if the enzymes are incu- 
bated at 37°C in the presence of substrate, the enzymes are 
catalytically active. Explain this apparent paradox. 


15. Active yet responsive. What is the biochemical advan- 
tage of having a Kyy approximately equal to the substrate 
concentration normally available to an enzyme? 


16. Angry biochemists. Many biochemists go bananas, and 
justifiably, when they see a Michaelis-Menten plot like the 
one shown at the top of the next column. To see why, deter- 
mine the Vo as a fraction of Vina; when the substrate con- 
centration is equal to 10 Ky4 and 20 Kg. Please control your 
outrage. 


max | 


Vo 


IS] 


17. Hydrolytic driving force. The hydrolysis of pyrophos- 
phate to orthophosphate is important in driving forward 
biosynthetic reactions such as the synthesis of DNA. This 
hydrolytic reaction is catalyzed in Escherichia coli by a pyro- 
phosphatase that has a mass of 120 kd and consists of six 
identical subunits. For this enzyme, a unit of activity is 
defined as the amount of enzyme that hydrolyzes 10 pmol 
of pyrophosphate in 15 minutes at 37°C under standard 
assay conditions. The purified enzyme has a Vax of 2800 
units per milligram of enzyme. 


(a) How many moles of substrate is hydrolyzed per second 
per milligram of enzyme when the substrate concentration 
is much greater than Kyy? 


(b) How many moles of active sites is there in 1 mg of 
enzyme? Assume that each subunit has one active site. 


(c) What is the turnover number of the enzyme? Compare 
this value with others mentioned in this chapter. 


18. Destroying the Trojan horse. Penicillin is hydrolyzed and 
thereby rendered inactive by penicillinase (also known as 
B-lactamase), an enzyme present in some penicillin-resistant 
bacteria. The mass of this enzyme in Staphylococcus aureus 
is 29.6 kd. The amount of penicillin hydrolyzed in 1 minute 
in a 10-ml solution containing 10 ” g of purified penicilli- 
nase was measured as a function of the concentration of 
penicillin. Assume that the concentration of penicillin does 
not change appreciably during the assay. 


[Penicillin] Amount hydrolyzed 
pM (nmol) 
1 0.11 
3 0.25 
5 0.34 
10 0.45 
30 0.58 
50 0.61 


(a) Plot Vo versus [S] and 1/Vo versus 1/[S] for these data. 
Does penicillinase appear to obey Michaelis-Menten 
kinetics? If so, what is the value of Ky4? 


(b) What is the value of Vinax? 


(c) What is the turnover number of penicillinase under 
these experimental conditions? Assume one active site per 
enzyme molecule. 


19. Counterpoint. Penicillinase (B-lactamase) hydrolyzes 
penicillin. Compare penicillinase with glycopeptide trans- 
peptidase. 


20. A different mode. The kinetics ofan enzyme are measured 
as a function of substrate concentration in the presence and 
absence of 100 4M inhibitor. 


(a) What are the values of Vina, and Kyy4 in the presence 
of this inhibitor? 


(b) What type of inhibition is it? 
(c) What is the dissociation constant of this inhibitor? 


Velocity (mol minute ') 


(S] 


(uM) No inhibitor Inhibitor 
3 10.4 2.1 
5 14:5 2.9 
10 22.5 AS 
30 33.8 6.8 
90 40.5 8.1 


(d) If[S] = 30 uM, what fraction of the enzyme molecules 
have a bound substrate in the presence and in the absence of 
100 pM inhibitor? 


21. A fresh view. The plot of 1/Vo versus 1/[S] is some- 
times called a Lineweaver—Burk plot. Another way of 
expressing the kinetic data is to plot Vy versus Vo/[S], 
which is known as an Eadie—Hofstee plot. 


(a) Rearrange the Michaelis-Menten equation to give Vo 
as a function of Vy/[S]. 


(b) What is the significance of the slope, the vertical inter- 
cept, and the horizontal intercept in a plot of Vo versus 


Vo/[S]° 


(c) Sketch a plot of Vp versus Vo/[S] in the absence of an 
inhibitor, in the presence of a competitive inhibitor, and in 
the presence of a noncompetitive inhibitor. 


22. Competing substrates. Suppose that two substrates, A 
and B, compete for an enzyme. Derive an expression relat- 
ing the ratio of the rates of utilization of A and B, Va/Vp, 
to the concentrations of these substrates and their values of 
Reat and Kyq. (Hint: Express Va as a function of ko.¢/Kyy for 
substrate A, and do the same for Vp.) Is specificity deter- 
mined by Ky alone? 
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Problems 


23. A tenacious mutant. Suppose that a mutant enzyme 
binds a substrate 100 times as tightly as does the native 
enzyme. What is the effect of this mutation on catalytic rate 
if the binding of the transition state is unaffected? 


24. More Michaelis-Menten. For an enzyme that follows 
simple Michaelis-Menten kinetics, what is the value of 
Vmax if Vo is equal to 1 pmol minute | at 10 Ky4? 


25. Controlled paralysis. Succinylcholine is a fast-acting, 
short-duration muscle relaxant that is used when a tube is 
inserted into a patient’s trachea or when a bronchoscope is 
used to examine the trachea and bronchi for signs of cancer. 
Within seconds of the administration of succinylcholine, 
the patient experiences muscle paralysis and is placed on a 
respirator while the examination proceeds. Succinylcholine 
is a competitive inhibitor of acetylcholinesterase, a nervous 
system enzyme, and this inhibition causes paralysis. 
However, succinylcholine is hydrolyzed by blood-serum 
cholinesterase, which shows a broader substrate specificity 
than does the nervous system enzyme. Paralysis lasts until 
the succinylcholine is hydrolyzed by the serum cholinest- 
erase, usually several minutes later. 


(a) As a safety measure, serum cholinesterase is measured 
before the examination takes place. Explain why this mea- 
surement is good idea. 


(b) What would happen to the patient if the serum cholin- 
esterase activity were only 10 units of activity per liter 
rather than the normal activity of about 80 units? 


(c) Some patients have a mutant form of the serum cholinest- 
erase that displays a Ky4 of 10 mM, rather than the normal 
1.4 mM. What will be the effect of this mutation on the 
patient? 


Data Interpretation Problems 


26. Varying the enzyme. For a one-substrate, enzyme-cat- 
alyzed reaction, double-reciprocal plots were determined 
for three different enzyme concentrations. Which of the 
following three families of curve would you expect to be 
obtained? Explain. 


1/Vo 1/Vo 1/Vo 


1/[S] 1/[S] 1/[S] 


27. Too much of a good thing. A simple Michaelis-Menten 
enzyme, in the absence of any inhibitor, displayed the fol- 
lowing kinetic behavior. The expected value of Vmax is 
shown on the y-axis in the graph on the following page. 
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(a) Draw a double-reciprocal plot that corresponds to the 
velocity-versus-substrate curve. 


(b) Explain the kinetic results. 


28. Rate-limiting step. In the conversion of A into D in the 
following biochemical pathway, enzymes Ea, Ep, and Ec 
have the Ky, values indicated under each enzyme. If all of 
the substrates and products are present at a concentration of 
10-*M and the enzymes have approximately the same 
Vmax, Which step will be rate limiting and why? 


Eq Ep Ec 
Km= 107M 10*M 107M 


29. Colored luminosity Tryptophan synthetase, a bacterial 
enzyme that contains a pyridoxal phosphate (PLP) pros- 
thetic group, catalyzes the synthesis of L-tryptophan from 
L-serine and an indole derivative. The addition of L-serine 
to the enzyme produces a marked increase in the fluores- 
cence of the PLP group, as the adjoining graph shows. The 
subsequent addition of indole, the second substrate, reduces 
this fluorescence to a level even lower than that produced 
by the enzyme alone. How do these changes in fluorescence 
support the notion that the enzyme interacts directly with 
its substrates? 


oe + Serine 


Enzyme alone 


Fluorescence intensity — 


+ Serine 
and indole 


! | 
450 500 550 


Wavelength (nm) 


Chapter Integration Problems 


30. Titration experiment. The effect of pH on the activity of 
an enzyme was examined. At its active site, the enzyme has 
an ionizable group that must be negatively charged for sub- 
strate binding and catalysis to take place. The ionizable 
group has a pK, of 6.0. The substrate is positively charged 
throughout the pH range of the experiment. 


E+St =— E-st >E74+Pr 
+ 
Ht 


| 


EH 


(a) Draw the Vo-versus-pH curve when the substrate con- 
centration is much greater than the enzyme Ky. 


(b) Draw the Vo-versus-pH curve when the substrate con- 
centration is much less than the enzyme Ky. 


(c) At which pH will the velocity equal one-half of the 
maximal velocity attainable under these condition? 


31. A question of stability. Pyridoxal phosphate (PLP) is a 
coenzyme for the enzyme ornithine aminotransferase. The 
enzyme was purified from cells grown in PLP-deficient 
media as well as from cells grown in media that contained 
pyridoxal phosphate. The stability of the enzyme was then 
measured by incubating the enzyme at 37°C and assaying 
for the amount of enzyme activity remaining. The following 
results were obtained. 


100% 


Enzyme activity 
remaining — 


+PLP 


—PLP 


0% - 
Time — 


(a) Why does the amount of active enzyme decrease with 
the time of incubation? 


(b) Why does the amount of enzyme from the PLP- 
deficient cells decline more rapidly? 


Catalytic Strategies 


CHAPTER 9 


Chess and enzymes have in common the use of strategy, consciously thought out in 
the game of chess and selected by evolution for the action of an enzyme. The three 
amino acid residues at the right, denoted by the white bonds, constitute a catalytic 
triad found in the active site of a class of enzymes that cleave peptide bonds. The 
substrate, represented by the molecule with the black bonds, is as hopelessly trapped 
as the king in the photograph of a chess match at the left and is sure to be cleaved. 
[Photograph courtesy of Wendie Berg.] 


hat are the sources of the catalytic power and specificity of enzymes? 

This chapter presents the catalytic strategies used by four classes of 
enzymes: serine proteases, carbonic anhydrases, restriction endonucleases, 
and myosins. Each class catalyzes reactions that require the addition of 
water to a substrate. The mechanisms of these enzymes have been revealed 
through the use of incisive experimental probes, including the techniques of 
protein structure determination (Chapter 3) and site-directed mutagenesis 
(Chapter 5). The mechanisms illustrate many important principles of 
catalysis. We shall see how these enzymes facilitate the formation of the 
transition state through the use of binding energy and induced fit as well as 
classes of several specific catalytic strategies. 

Each of the four classes of enzymes in this chapter illustrates the use 
of such strategies to solve a different problem. For serine proteases, exem- 
plified by chymotrypsin, the challenge is to promote a reaction that is 
almost immeasurably slow at neutral pH in the absence of a catalyst. For 
carbonic anhydrases, the challenge is to achieve a high absolute rate of reac- 
tion, suitable for integration with other rapid physiological processes. For 
restriction endonucleases such as EcoRV, the challenge is to attain a high 
degree of specificity. Finally, for myosins, the challenge is to utilize the free 
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energy associated with the hydrolysis of adenosine triphosphate (ATP) to 
drive other processes. Each of the examples selected is a member of a large 
protein class. For each of these classes, comparison between class members 
reveals how enzyme active sites have evolved and been refined. Structural 
and mechanistic comparisons of enzyme action are thus the sources of 
insight into the evolutionary history of enzymes. In addition, our knowl- 
edge of catalytic strategies has been used to develop practical applications, 
including potent drugs and specific enzyme inhibitors. Finally, although we 
shall not consider catalytic RNA molecules explicitly in this chapter, the 
principles also apply to these catalysts. 


A few basic catalytic principles are used by many enzymes 


In Chapter 8, we learned that enzymatic catalysis begins with substrate 
binding. The binding energy is the free energy released in the formation of a 
large number of weak interactions between the enzyme and the substrate. 
We can envision this binding energy as serving two purposes: it establishes 
substrate specificity and increases catalytic efficiency. Only the correct 
substrate can participate in most or all of the interactions with the enzyme 
and thus maximize binding energy, accounting for the exquisite substrate 
specificity exhibited by many enzymes. Furthermore, the full complement 
of such interactions is formed only when the combination of enzyme and 
substrate is in the transition state. Thus, interactions between the enzyme 
and the substrate stabilize the transition state, thereby lowering the free 
energy of activation. The binding energy can also promote structural 
changes in both the enzyme and the substrate that facilitate catalysis, a 
process referred to as induced fit. 

Enzymes commonly employ one or more of the following strategies to 
catalyze specific reactions: 


1. Covalent Catalysis. In covalent catalysis, the active site contains a 
reactive group, usually a powerful nucleophile, that becomes temporarily 
covalently attached to a part of the substrate in the course of catalysis. The 
proteolytic enzyme chymotrypsin provides an excellent example of this 
strategy (Section 9.1). 


2. General Acid—Base Catalysis. In general acid—base catalysis, a molecule 
other than water plays the role of a proton donor or acceptor. Chymotrypsin 
uses a histidine residue as a base catalyst to enhance the nucleophilic 
power of serine (Section 9.1), whereas a histidine residue in carbonic anhy- 
drase facilitates the removal of a hydrogen ion from a zinc-bound water 
molecule to generate hydroxide ion (Section 9.2). For myosins, a phosphate 
group of the ATP substrate serves as a base to promote its own hydrolysis 
(Section 9.3). 


3. Catalysis by Approximation. Many reactions include two distinct sub- 
strates, including all four classes of hydrolases considered in detail in this 
chapter. In such cases, the reaction rate may be considerably enhanced by 
bringing the two substrates together along a single binding surface on an 
enzyme. For example, carbonic anhydrase binds carbon dioxide and water 
in adjacent sites to facilitate their reaction (Section 9.2). 


4. Metal Ion Catalysis. Metal ions can function catalytically in several 
ways. For instance, a metal ion may facilitate the formation of nucleophiles 
such as hydroxide ion by direct coordination. A zinc(II) ion serves this pur- 
pose in catalysis by carbonic anhydrase (Section 9.2). Alternatively, a metal 
ion may serve as an electrophile, stabilizing a negative charge on a reaction 
intermediate. A magnesium(II) ion plays this role in EcoRV (Section 9.3). 


Finally, a metal ion may serve as a bridge between enzyme and substrate, 
increasing the binding energy and holding the substrate in a conformation 
appropriate for catalysis. This strategy is used by myosins (Section 9.4) and, 
indeed, by almost all enzymes that utilize ATP as a substrate. 


9.1 Proteases Facilitate a Fundamentally Difficult Reaction 


Protein turnover is an important process in living systems (Chapter 23). 
Proteins that have served their purpose must be degraded so that their con- 
stituent amino acids can be recycled for the synthesis of new proteins. 
Proteins ingested in the diet must be broken down into small peptides and 
amino acids for absorption in the gut. Furthermore, as described in detail in 
Chapter 10, proteolytic reactions are important in regulating the activity of 
certain enzymes and other proteins. 

Proteases cleave proteins by a hydrolysis reaction—the addition of a 
molecule of water to a peptide bond: 


O 

C R L 
Ri a 2 +HO Rr—G = + Ry—NH3* 
H 10) 


Although the hydrolysis of peptide bonds is thermodynamically favored, 
such hydrolysis reactions are extremely slow. In the absence of a catalyst, 
the half-life for the hydrolysis of a typical peptide at neutral pH is estimated 
to be between 10 and 1000 years. Yet, peptide bonds must be hydrolyzed 
within milliseconds in some biochemical processes. 

The chemical bonding in peptide bonds is responsible for their kinetic 
stability. Specifically, the resonance structure that accounts for the planarity 
of a peptide bond (Section 2.2) also makes such bonds resistant to hydroly- 
sis. This resonance structure endows the peptide bond with partial double- 
bond character: 


The carbon—nitrogen bond is strengthened by its double-bond charac- 
ter. Furthermore, the carbonyl carbon atom is less electrophilic and less 
susceptible to nucleophilic attack than are the carbonyl carbon atoms in 
more reactive compounds such as carboxylate esters. Consequently, to 
promote peptide-bond cleavage, an enzyme must facilitate nucleophilic 
attack at a normally unreactive carbonyl group. 


Chymotrypsin possesses a highly reactive serine residue 


A number of proteolytic enzymes participate in the breakdown of proteins 
in the digestive systems of mammals and other organisms. One such 
enzyme, chymotrypsin, cleaves peptide bonds selectively on the carboxyl- 
terminal side of the large hydrophobic amino acids such as tryptophan, 
tyrosine, phenylalanine, and methionine (Figure 9.1). Chymotrypsin is a 
good example of the use of covalent catalysis. The enzyme employs a power- 
ful nucleophile to attack the unreactive carbonyl carbon atom of the sub- 
strate. This nucleophile becomes covalently attached to the substrate 
briefly in the course of catalysis. 
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Figure 9.1 Specificity of chymotrypsin. 
Chymotrypsin cleaves proteins on the carboxyl 
side of aromatic or large hydrophobic amino 
acids (shaded orange). The likely bonds 
cleaved by chymotrypsin are indicated in red. 


Figure 9.2 An unusually reactive serine 
residue in chymotrypsin. Chymotrypsin is 
inactivated by treatment with 
diisopropylphosphofluoridate (DIPF), which 
reacts only with serine 195 among 28 
possible serine residues. 
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What is the nucleophile that chymotrypsin employs to attack the sub- 
strate carbonyl carbon atom? A clue came from the fact that chymotrypsin 
contains an extraordinarily reactive serine residue. Chymotrypsin molecules 
treated with organofluorophosphates such as diisopropylphosphofluoridate 
(DIPF) lost all activity irreversibly (Figure 9.2). Only a single residue, 
serine 195, was modified. This chemical modification reaction suggested that 
this unusually reactive serine residue plays a central role in the catalytic 
mechanism of chymotrypsin. 
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Chymotrypsin action proceeds in two steps linked 
by a covalently bound intermediate 


A study of the enzyme’s kinetics provided a second clue to chymotrypsin’s 
catalytic mechanism. The kinetics of enzyme action are often easily moni- 
tored by having the enzyme act on a substrate analog that forms a colored 
product. For chymotrypsin, such a chromogenic substrate is N-acetyl-L- 
phenylalanine p-nitrophenyl ester. This substrate is an ester rather than an 
amide, but many proteases will also hydrolyze esters. One of the products 
formed by chymotrypsin’s cleavage of this substrate is p-nitrophenolate, 
which has a yellow color (Figure 9.3). Measurements of the absorbance of 
light revealed the amount of p-nitrophenolate being produced. 

Under steady-state conditions, the cleavage of this substrate obeys 
Michaelis-Menten kinetics with a Kyy of 20 wM and a ko of 77 s !. The 
initial phase of the reaction was examined by using the stopped-flow 
method, which makes it possible to mix enzyme and substrate and monitor 
the results within a millisecond. This method revealed an initial rapid burst 
of colored product, followed by its slower formation as the reaction reached 
the steady state (Figure 9.4). These results suggest that hydrolysis proceeds 


N-Acetyl-.-phenylalanine p-nitrophenyl ester p-Nitrophenolate 


Figure 9.3 Chromogenic substrate. N-Acetyl--phenylalanine p-nitrophenyl ester yields a yellow 
product, p-nitrophenolate, on cleavage by chymotrypsin. p-Nitrophenolate forms by deprotonation 
of p-nitrophenol at pH 7. 


in two phases. In the first reaction cycle that takes place immediately after 
mixing, only the first phase must take place before the colored product is 
released. In subsequent reaction cycles, both phases must take place. Note 
that the burst is observed because the first phase is substantially more rapid 
than the second phase for this substrate. 

The two phases are explained by the formation of a covalently bound 
enzyme-substrate intermediate (Figure 9.5). First, the acyl group of the 
substrate becomes covalently attached to the enzyme as p-nitrophenolate 
(or an amine if the substrate is an amide rather than an ester) is released. The 
enzyme-acyl group complex is called the acyl-enzyme intermediate. Second, 
the acyl-enzyme intermediate is hydrolyzed to release the carboxylic acid 
component of the substrate and regenerate the free enzyme. Thus, one mol- Figure 9.4 Kinetics of chymotrypsin 
ecule of p-nitrophenolate is produced rapidly from each enzyme molecule as catalysis. Two phases are evident in the 


‘ : : . cleaving of N-acetyl-L-phenylalanine 
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enzyme to be “reset” by the hydrolysis of the acyl-enzyme intermediate, burst phase (pre-steady-state) and a steady- 
and both phases are required for enzyme turnover. state phase. 
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Figure 9.5 Covalent catalysis. Hydrolysis by chymotrypsin takes place in two phases: 
(A) acylation to form the acyl-enzyme intermediate followed by (B) deacylation to regenerate 
the free enzyme. 


Serine is part of a catalytic triad that also includes 
histidine and aspartate 


The three-dimensional structure of chymotrypsin was solved by David 
Blow in 1967. Overall, chymotrypsin is roughly spherical and comprises 
three polypeptide chains, linked by disulfide bonds. It is synthesized as a 
single polypeptide, termed chymotrypsinogen, which is activated by the pro- 
teolytic cleavage of the polypeptide to yield the three chains (Section 10.4). 
The active site of chymotrypsin, marked by serine 195, lies in a cleft on 
the surface of the enzyme (Figure 9.6). The structure of the active 
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hk Figure 9.6 Location of the active site 
in chymotrypsin. Chymotrypsin consists of 
three chains, shown in ribbon form in orange, 
blue, and green. The side chains of the 
catalytic triad residues are shown as ball-and- 
stick representations. Notice these side chains, 
including serine 195, lining the active site in 
the upper half of the structure. Also notice 
two intrastrand and two interstrand disulfide 
bonds in various locations throughout the 
molecule. [Drawn from 1GCT.pdb.] 
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site explained the special reactivity of serine 195 (Figure 9.7). The side 
chain of serine 195 is hydrogen bonded to the imidazole ring of histidine 57. 
The —NH group of this imidazole ring is, in turn, hydrogen bonded to the 
carboxylate group of aspartate 102. This constellation of residues is referred 
to as the catalytic triad. How does this arrangement of residues lead to the 
high reactivity of serine 195? The histidine residue serves to position the 
serine side chain and to polarize its hydroxyl group so that it is poised for 
deprotonation. In the presence of the substrate, the histidine residue accepts 
the proton from the serine 195 hydroxyl group. In doing so, the residue acts 
as a general base catalyst. The withdrawal of the proton from the hydroxyl 
group generates an alkoxide ion, which is a much more powerful nucleo- 
phile than is an alcohol. The aspartate residue helps orient the histidine 
residue and make it a better proton acceptor through hydrogen bonding and 
electrostatic effects. 


Alkoxide 
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Figure 9.7 The catalytic triad. The catalytic triad, shown on the left, converts serine 195 into a 
potent nucleophile, as illustrated on the right. 


These observations suggest a mechanism for peptide hydrolysis 
(Figure 9.8). After substrate binding (step 1), the reaction begins with the 
oxygen atom of the side chain of serine 195 making a nucleophilic attack on 
the carbonyl carbon atom of the target peptide bond (step 2). There are now 
four atoms bonded to the carbonyl carbon, arranged as a tetrahedron, 
instead of three atoms in a planar arrangement. This inherently unstable 
tetrahedral intermediate bears a formal negative charge on the oxygen atom 
derived from the carbonyl group. This charge is stabilized by interactions 


Oxyanion 
hole 
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Figure 9.8 Peptide hydrolysis by chymotrypsin. The mechanism of peptide hydrolysis 
illustrates the principles of covalent and acid-base catalysis. The reaction proceeds in eight steps: 
(1) substrate binding, (2) nucleophilic attack of serine on the peptide carbonyl group, (3) collapse 
of the tetrahedral intermediate, (4) release of the amine component, (5) water binding, (6) 
nucleophilic attack of water on the acyl-enzyme intermediate, (7) collapse of the tetrahedral 
intermediate; and (8) release of the carboxylic acid component. The dashed green lines represent 
hydrogen bonds. 


with NH groups from the protein in a site termed the oxyanion hole (Fig- 
ure 9.9). These interactions also help stabilize the transition state that 
precedes the formation of the tetrahedral intermediate. This tetrahedral 
intermediate collapses to generate the acyl-enzyme (step 3). This step is 
facilitated by the transfer of the proton being held by the positively charged 
histidine residue to the amino group formed by cleavage of the peptide 
bond. The amine component is now free to depart from the enzyme (step 4), 
completing the first stage of the hydrolytic reaction—acylation of the 
enzyme. 

The next stage—deacylation—begins when a water molecule takes the 
place occupied earlier by the amine component of the substrate (step 5). The 
ester group of the acyl-enzyme is now hydrolyzed by a process that 
essentially repeats steps 2 through 4. Now acting as a general acid catalyst, 
histidine 57 draws a proton away from the water molecule. The resulting 
OH ion attacks the carbonyl carbon atom of the acyl group, forming a 
tetrahedral intermediate (step 6). This structure breaks down to form the 
carboxylic acid product (step 7). Finally, the release of the carboxylic acid 
product (step 8) readies the enzyme for another round of catalysis. 

This mechanism accounts for all characteristics of chymotrypsin action 
except the observed preference for cleaving the peptide bonds just past 


Acyl-enzyme 


Oxyanion hole 


Figure 9.9 The oxyanion hole. The 
structure stabilizes the tetrahedral intermediate 
of the chymotrypsin reaction. Notice that 
hydrogen bonds (shown in green) link 
peptide NH groups and the negatively 
charged oxygen atom of the intermediate. 


259 


260 
CHAPTER 9 Catalytic Strategies 


Figure 9.11 Specificity nomenclature for 
protease-substrate interactions. The 
potential sites of interaction of the substrate 
with the enzyme are designated P (shown in 
red), and corresponding binding sites on the 
enzyme are designated S. The scissile bond 
(also shown in red) is the reference point. 


Figure 9.10 Specificity pocket of 
chymotrypsin. Notice that this pocket is 
lined with hydrophobic residues and is 
deep, favoring the binding of residues 
with long hydrophobic side chains such 
as phenylalanine (shown in green). Also 
notice that the active-site serine residue 
(serine 195) is positioned to cleave the 
peptide backbone between the residue 
bound in the pocket and the next residue 
in the sequence. The key amino acids 
that constitute the binding site are 
identified. 


residues with large, hydrophobic side chains. Examination of the three- 
dimensional structure of chymotrypsin with substrate analogs and enzyme 
inhibitors revealed the presence of a deep hydrophobic pocket, called the 5; 
pocket, into which the long, uncharged side chains of residues such as phe- 
nylalanine and tryptophan can fit. The binding of an appropriate side chain 
into this pocket positions the adjacent peptide bond into the active site for cleav- 
age (Figure 9.10). The specificity of chymotrypsin depends almost entirely 
on which amino acid is directly on the amino-terminal side of the peptide 
bond to be cleaved. Other proteases have more-complex specificity patterns. 
Such enzymes have additional pockets on their surfaces for the recognition 
of other residues in the substrate. Residues on the amino-terminal side of 
the scissile bond (the bond to be cleaved) are labeled P;, P», P3, and so forth, 
heading away from the scissile bond (Figure 9.11). Likewise, residues on the 
carboxyl side of the scissile bond are labeled P;’, P2', P3’, and so forth. The 
corresponding sites on the enzyme are referred to as Sy, Sy or Sy’, 59’, and 
so forth. 
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Catalytic triads are found in other hydrolytic enzymes 


See Many other peptide-cleaving proteins have subsequently been found 

to contain catalytic triads similar to that discovered in chymotrypsin. 
Some, such as trypsin and elastase, are obvious homologs of chymotrypsin. 
The sequences of these proteins are approximately 40% identical with that 
of chymotrypsin, and their overall structures are quite similar (Figure 9.12). 
These proteins operate by mechanisms identical with that of chymotrypsin. 


However, the three enzymes differ markedly in substrate specificity. 
Chymotrypsin cleaves at the peptide bond after residues with an aromatic 
or long nonpolar side chain. Trypsin cleaves at the peptide bond after 
residues with long, positively charged side chains—namely, arginine and 
lysine. Elastase cleaves at the peptide bond after amino acids with small side 
chains—such as alanine and serine. Comparison of the S; pockets of these 
enzymes reveals that these different specificities are due to small structural 
differences. In trypsin, an aspartate residue (Asp 189) is present at the bot- 
tom of the 5; pocket in place of a serine residue in chymotrypsin. The aspar- 
tate residue attracts and stabilizes a positively charged arginine or lysine 
residue in the substrate. In elastase, two residues at the top of the pocket in 
chymotrypsin and trypsin are replaced by much bulkier valine residues 
(Val 190 and Val 216). These residues close off the mouth of the pocket so 
that only small side chains can enter (Figure 9.13). 


Val 216 Val 190 


Chymotrypsin Trypsin Elastase 


Figure 9.13 The S, pockets of chymotrypsin, trypsin, and elastase. Certain residues play key 
roles in determining the specificity of these enzymes. The side chains of these residues, as well as 
those of the active-site serine residues, are shown in color. 


Other members of the chymotrypsin family include a collection of pro- 
teins that take part in blood clotting, to be discussed in Chapter 10, as well 
as the tumor marker protein prostate-specific antigen (PSA). In addition, a 
wide range of proteases found in bacteria, viruses, and plants belong to 
this clan. 

Other enzymes that are not homologs of chymotrypsin have been found 
to contain very similar active sites. As noted in Chapter 6, the presence of 
very similar active sites in these different protein families is a consequence 
of convergent evolution. Subtilisin, a protease in bacteria such as Bacillus 
amyloliquefaciens, is a particularly well characterized example. The active 
site of this enzyme includes both the catalytic triad and the oxyanion hole. 
However, one of the NH groups that forms the oxyanion hole comes from 
the side chain of an asparagine residue rather than from the peptide back- 
bone (Figure 9.14). Subtilisin is the founding member of another large 
family of proteases that includes representatives from Archaea, Bacteria, 
and Eukarya. 

Finally, other proteases have been discovered that contain an active-site 
serine or threonine residue that is activated not by a histidine—aspartate pair 


hk Figure 9.12 Structural similarity of 
trypsin and chymotrypsin. An overlay of 
the structure of chymotrypsin (red) on that of 


trypsin (blue) is shown. Notice the high 
degree of similarity. Only a-carbon-atom 


positions are shown. The mean deviation in 
position between corresponding a-carbon 
atoms is 1.7 A. [Drawn from 5PTP.pdb and 


1GCT.pdb] 
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Figure 9.14 The catalytic triad and 
oxyanion hole of subtilisin. Notice the two 
enzyme NH groups (both in the backbone 
and in the side chain of Asn 155) located in 
the oxyanion hole. The NH groups will 
stabilize a negative charge that develops on 
the peptide bond attacked by nucleophilic 
serine 221 of the catalytic triad. 
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but by a primary amino group from the side chain of lysine or by the 
N-terminal amino group of the polypeptide chain. 

Thus, the catalytic triad in proteases has emerged at least three times in 
the course of evolution. We can conclude that this catalytic strategy must be 
an especially effective approach to the hydrolysis of peptides and related 
bonds. 


The catalytic triad has been dissected by site-directed mutagenesis 


How can we be sure that the mechanism proposed for the catalytic triad is 
correct? One way is to test the contribution of individual amino acid 
residues to the catalytic power of a protease by using site-directed mutagen- 
esis (Section 5.2). Subtilisin has been extensively studied by this method. 
Each of the residues within the catalytic triad, consisting of aspartic acid 32, 
histidine 64, and serine 221, has been individually converted into alanine, 
and the ability of each mutant enzyme to cleave a model substrate has been 
examined (Figure 9.15). 

As expected, the conversion of active-site serine 221 
into alanine dramatically reduced catalytic power; the 
value of k,.; fell to less than one-millionth of its value for 
the wild-type enzyme. The value of Ky, was essentially 
unchanged; its increase by no more than a factor of two 
indicated that substrate continued to bind normally. The 
mutation of histidine 64 to alanine reduced catalytic 
power to a similar degree. The conversion of aspartate 32 
into alanine reduced catalytic power by less, although the 


S221A value of k,,, still fell to less than 0.005% of its wild-type 
H64A value. The simultaneous conversion of all three residues 
D32A into alanine was no more deleterious than the conversion 
of serine or histidine alone. These observations support 

the notion that the catalytic triad and, particularly, the 

Uncat. serine—histidine pair act together to generate a nucleo- 

[| phile of sufficient power to attack the carbonyl carbon 

atom of a peptide bond. Despite the reduction in their 


Figure 9.15 Site-directed mutagenesis of subtilisin. Residues of 
the catalytic triad were mutated to alanine, and the activity of the 


catalytic power, the mutated enzymes still hydrolyze 
peptides a thousand times as fast as buffer at pH 8.6. 


mutated enzyme was measured. Mutations in any component of the Site-directed mutagenesis also offered a way to probe 
catalytic triad cause a dramatic loss of enzyme activity. Note that the the importance of the oxyanion hole for catalysis. The 
activity is displayed on a logarithmic scale. The mutations are mutation of asparagine 155 to glycine eliminated the 


identified as follows: the first letter is the one-letter abbreviation for the 
amino acid being altered; the number identifies the position of the 
residue in the primary structure; and the second letter is the one-letter 


side-chain NH group from the oxyanion hole of subtili- 
sin. The elimination of the NH group reduced the value 


abbreviation for the amino acid replacing the original one. Uncat. refers of Rea to 0.2% of its wild-type value but increased the 


to the estimated rate for the uncatalyzed reaction. 


value of Ky¢ by only a factor of two. These observations 


demonstrate that the NH group of the asparagine residue plays a significant 
role in stabilizing the tetrahedral intermediate and the transition state lead- 
ing to it. 


Cysteine, aspartyl, and metalloproteases are other major classes of 
peptide-cleaving enzymes 


Ave Not all proteases utilize strategies based on activated serine residues. 

Classes of proteins have been discovered that employ three alternative 
approaches to peptide-bond hydrolysis (Figure 9.16). These classes are the 
(1) cysteine proteases, (2) aspartyl proteases, and (3) metalloproteases. In 
each case, the strategy is to generate a nucleophile that attacks the peptide 
carbonyl group (Figure 9.17). 

The strategy used by the cysteine proteases is most similar to that used by 
the chymotrypsin family. In these enzymes, a cysteine residue, activated by 
a histidine residue, plays the role of the nucleophile that attacks the peptide 
bond (see Figure 9.17) in a manner quite analogous to that of the serine 
residue in serine proteases. Because the sulfur atom in cysteine is inherently 
a better nucleophile than is the oxygen atom in serine, cysteine proteases 
appear to require only this histidine residue in addition to cysteine and not 
the full catalytic triad. A well-studied example of these proteins is papain, 
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9.1 Proteases 


\ Figure 9.16 Three classes of 
proteases and their active sites. These 
examples of a cysteine protease, an aspartyl 
protease, and a metalloprotease use a 
histidine-activated cysteine residue, an 
aspartate-activated water molecule, and a 
metal-activated water molecule, respectively, 
as the nucleophile. The two halves of renin 
are in blue and red to highlight the 
approximate twofold symmetry of aspartyl 
proteases. Votice how different these active 
sites are despite the similarity in the reactions 
they catalyze. [Drawn from 1PPN.pdb.; 1HRN. 
pdb; 1LND.pdb.] 


(A) CYSTEINE PROTEASES 


Figure 9.17 The activation strategies for 
three classes of proteases. The peptide 
carbonyl group is attacked by (A) a histidine- 
activated cysteine in the cysteine proteases, 
(B) an aspartate-activated water molecule in 
the aspartyl proteases, and (C) a metal- 
activated water molecule in the 
metalloproteases. For the metalloproteases, 
the letter B represents a base (often 
glutamate) that helps deprotonate the metal- 
bound water. 
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an enzyme purified from the fruit of the papaya. Mammalian proteases 
homologous to papain have been discovered, most notably the cathepsins, 
proteins having a role in the immune system and other systems. The 
cysteine-based active site arose independently at least twice in the course 
of evolution; the caspases, enzymes that play a major role in apoptosis, 
have active sites similar to that of papain, but their overall structures are 
unrelated. 

The second class comprises the aspartyl proteases. The central feature of 
the active sites is a pair of aspartic acid residues that act together to allow a 
water molecule to attack the peptide bond. One aspartic acid residue (in its 
deprotonated form) activates the attacking water molecule by poising it for 
deprotonation. The other aspartic acid residue (in its protonated form) 
polarizes the peptide carbonyl group so that it is more susceptible to attack 
(see Figure 9.17). Members of this class include renin, an enzyme having a 
role in the regulation of blood pressure, and the digestive enzyme pepsin. 
These proteins possess approximate twofold symmetry. A likely scenario 
is that two copies of a gene for the ancestral enzyme fused to form a single 
gene that encoded a single-chain enzyme. Each copy of the gene would have 
contributed an aspartate residue to the active site. The individual chains 
are now joined to make a single chain in the aspartyl proteases present in 
human immunodeficiency virus (HIV) and other retroviruses (Figure 9.18). 
This observation is consistent with the idea that the enzyme may have 
originally existed as separate subunits. 

The metalloproteases constitute the final major class of peptide-cleaving 
enzymes. The active site of such a protein contains a bound metal ion, 
almost always zinc, that activates a water molecule to act as a nucleophile to 
attack the peptide carbonyl group. The bacterial enzyme thermolysin and 
the digestive enzyme carboxypeptidase A are classic examples of the zinc 
proteases. Thermolysin, but not carboxypeptidase A, is a member of a large 
and diverse family of homologous zinc proteases that includes the matrix 
metalloproteases, enzymes that catalyze the reactions in tissue remodeling 
and degradation. 

In each of these three classes of enzymes, the active site includes features 
that act to (1) activate a water molecule or another nucleophile, (2) polarize 
the peptide carbonyl group, and (3) stabilize a tetrahedral intermediate 
(see Figure 9.17). 


Protease inhibitors are important drugs 


Several important drugs are protease inhibitors. For example, captopril, 
used to regulate blood pressure, is an inhibitor of the angiotensin- 
converting enzyme (ACE), a metalloprotease. Indinavir (Crixivan), retro- 
vir, and more than 20 other compounds used in the treatment of AIDS are 
inhibitors of HIV protease, which is an aspartyl protease. HIV protease 


cleaves multidomain viral proteins into their active forms; blocking this 
process completely prevents the virus from being infectious (see Fig- 
ure 9.18). HIV protease inhibitors, in combination with inhibitors of other 
key HIV enzymes, dramatically reduced deaths due to AIDS in circum- 
stances where these drugs can be used (see Figure 36.21). 

Indinavir resembles the peptide substrate of the HIV protease. Indinavir 
is constructed around an alcohol that mimics the tetrahedral intermediate; 
other groups are present to bind into the Sy, 51, 5;', and 59’ recognition sites 
on the enzyme (Figure 9.19). X-ray crystallographic studies revealed that, 
in the active site, indinavir adopts a conformation that approximates the 
twofold symmetry of the enzyme (Figure 9.20). The active site of HIV pro- 
tease is covered by two flexible flaps that fold down on top of the bound 
inhibitor. The OH group of the central alcohol interacts with the two 
aspartate residues of the active site. In addition, two carbonyl groups of 
the inhibitor are hydrogen bonded to a water molecule (not shown in 
Figure 9.20), which, in turn, is hydrogen bonded to a peptide NH group in 
each of the flaps. This interaction of the inhibitor with water and the enzyme 
is not possible within cellular aspartyl proteases such as renin. Thus the 
interaction may contribute to the specificity of indinavir for HIV protease. 
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hk Figure 9.18 HIV protease, a dimeric 
aspartyl protease. The protease is a dimer 
of identical subunits, shown in blue and 
yellow, consisting of 99 amino acids each. 
Notice the placement of active-site aspartic 
acid residues, one from each chain, which are 
shown as ball-and-stick structures. The flaps 
will close down on the binding pocket after 
substrate has been bound. [Drawn from 
3PHV.pdb.] 


Figure 9.19 Indinavir, an HIV protease 
inhibitor. The structure of indinavir (Crixivan) 
is shown in comparison with that of a peptide 
substrate of HIV protease. The scissile bond in 
the substrate is highlighted in red. 
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hk Figure 9.20 HIV protease-indinavir complex. (Left) The HIV protease is shown with the 
inhibitor indinavir bound at the active site. Notice the twofold symmetry of the enzyme structure. 
(Right) The drug has been rotated to reveal its approximately twofold symmetric conformation. 
[Drawn from 1HSH.pdb.] 


Protease inhibitors used as drugs must be specific for one enzyme without 
inhibiting other proteins within the body to prevent side effects. 


9.2 Carbonic Anhydrases Make a Fast Reaction Faster 


Carbon dioxide is a major end product of aerobic metabolism. In mammals, 
this carbon dioxide is released into the blood and transported to the lungs 
for exhalation. While in the red blood cells, carbon dioxide reacts with water 
(Section 7.3). The product of this reaction is a moderately strong acid, car- 
bonic acid (pK, = 3.5), which is converted into bicarbonate ion (HCO3 ) on 
the loss of a proton. 
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Even in the absence of a catalyst, this hydration reaction proceeds at a 
moderately fast pace. At 37°C near neutral pH, the second-order rate 
constant k; is 0.0027 M's. This value corresponds to an effective first- 
order rate constant of 0.15 s / in water ({H2O] = 55.5 M). The reverse reac- 
tion, the dehydration of HCO; , is even more rapid, with a rate constant of 
k_, = 50s |. These rate constants correspond to an equilibrium constant of 
K, =5.4 X 10° and a ratio of [CO] to [H2COs] of 340: 1 at equilibrium. 


k 2 Carbon dioxide hydration and HCO3 dehydration are often coupled 
to rapid processes, particularly transport processes. Thus, almost all 
organisms contain enzymes, referred to as carbonic anhydrases, that increase 
the rate of reaction beyond the already reasonable spontaneous rate. For 
example, carbonic anhydrases dehydrate HCO3 in the blood to form CO, 
for exhalation as the blood passes through the lungs. Conversely, they con- 
vert CO» into HCO; to generate the aqueous humor of the eye and other 
secretions. Furthermore, both CO2 and HCO3 are substrates and products 


for a variety of enzymes, and the rapid interconversion of these species may 
be necessary to ensure appropriate substrate levels. So important are these 
enzymes in human beings that mutations in some carbonic anhydrases 
have been found to be associated with osteopetrosis (excessive formation of 
dense bones accompanied by anemia) and mental retardation. 

Carbonic anhydrases accelerate CO hydration dramatically. The most- 
active enzymes hydrate CO at rates as high as k.a = 10° s1, or a million 
times a second per enzyme molecule. Fundamental physical processes such as 
diffusion and proton transfer ordinarily limit the rate of hydration, and so the 
enzymes employ special strategies to attain such prodigious rates. 


Carbonic anhydrase contains a bound zinc ion 
essential for catalytic activity 


Less than 10 years after the discovery of carbonic anhydrase in 1932, this 
enzyme was found to contain a bound zinc ion. Moreover, the zinc ion 
appeared to be necessary for catalytic activity. This discovery, remarkable at 
the time, made carbonic anhydrase the first known zinc-containing enzyme. 
At present, hundreds of enzymes are known to contain zinc. In fact, more 
than one-third of all enzymes either contain bound metal ions or require the 
addition of such ions for activity. Metal ions have several properties that 
increase chemical reactivity: their positive charges, their ability to form 
strong yet kinetically labile bonds, and, in some cases, their capacity to be 
stable in more than one oxidation state. The chemical reactivity of metal 
ions explains why catalytic strategies that employ metal ions have been 
adopted throughout evolution. 

X-ray crystallographic studies have supplied the most-detailed and 
direct information about the zinc site in carbonic anhydrase. At least seven 
carbonic anhydrases, each with its own gene, are present in human 
beings. They are all clearly homologous, as revealed by substantial sequence 
identity. Carbonic anhydrase II, a major protein component of red blood 
cells, has been the most extensively studied (Figure 9.21). It is also one of 
the most active carbonic anhydrases. 

Zinc is found only in the +2 state in biological systems. A zinc atom is 
essentially always bound to four or more ligands; in carbonic anhydrase, 
three coordination sites are occupied by the imidazole rings of three histi- 
dine residues and an additional coordination site is occupied by a water 
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k Figure 9.21 The structure of 
human carbonic anhydrase II 
and its zinc site. (Left) Notice that 
the zinc ion is bound to the 
imidazole rings of three histidine 
residues as well as to a water 
molecule. (Right) Notice the 
location of the zinc site in a cleft 
near the center of the enzyme. 
[Drawn from 1CA2.pdb.] 
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Figure 9.22 Effect of pH on carbonic 
anhydrase activity. Changes in pH alter the 
rate of carbon dioxide hydration catalyzed by 
carbonic anhydrase II. The enzyme is 
maximally active at high pH. 


molecule (or hydroxide ion, depending on pH). Because the molecules 
occupying the coordination sites are neutral, the overall charge on the 
Zn(His)3 unit remains +2. 


Catalysis entails zinc activation of a water molecule 


How does this zinc complex facilitate carbon dioxide hydration? A major 
clue comes from the pH profile of enzymatically catalyzed carbon dioxide 
hydration (Figure 9.22). 

At pH 8, the reaction proceeds near its maximal rate. As the pH decreas- 
es, the rate of the reaction drops. The midpoint of this transition is near pH 
7, suggesting that a group that loses a proton at pH 7 (pK, = 7) plays an 
important role in the activity of carbonic anhydrase. Moreover, the curve 
suggests that the deprotonated (high pH) form of this group participates 
more effectively in catalysis. Although some amino acids, notably histidine, 
have pK, values near 7, a variety of evidence suggests that the group responsible 
for this transition is not an amino acid but is the zinc-bound water molecule. 

The binding of a water molecule to the positively charged zinc center 
reduces the pK, of the water molecule from 15.7 to 7 (Figure 9.23). 
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Figure 9.23 The pK, of zinc-bound water. Binding to zinc lowers the pK, of water from 15.7 to 7. 


With the pK, lowered, many water molecules lose a proton at neutral pH, 
generating a substantial concentration of hydroxide ion (bound to the zinc 
atom). A zinc-bound hydroxide ion (OH ) is a potent nucleophile able to 
attack carbon dioxide much more readily than water does. Adjacent to the 
zinc site, carbonic anhydrase also possesses a hydrophobic patch that serves 
as a binding site for carbon dioxide (Figure 9.24). Based on these obser- 
vations, a simple mechanism for carbon dioxide hydration can be proposed 
(Figure 9.25): 


1. The zinc ion facilitates the release of a proton from a water molecule, 
which generates a hydroxide ion. 


2. The carbon dioxide substrate binds to the enzyme’s active site and is 
positioned to react with the hydroxide ion. 


Figure 9.24 Carbon dioxide binding site. 
Crystals of carbonic anhydrase were exposed 
to carbon dioxide gas at high pressure and 
low temperature and x-ray diffraction data 
were collected. The electron density for 
carbon dioxide, clearly visible adjacent to the 
zinc and its bound water, reveals the carbon 
dioxide binding site. [After J. F. Domsic, B. S. 
Awvaru, C. U. Kim, S. M. Gruner, M. Agbandje- 
McKenna, D. N. Silverman, and R. McKenna. 
J. Biol. Chem. 283:30766-30771, 2008.] 


3. The hydroxide ion attacks the carbon dioxide, converting it into bicar- 
bonate ion, HCO; . 


4. The catalytic site is regenerated with the release of HCO3 and the bind- 
ing of another molecule of water. 


Thus, the binding of a water molecule to the zinc ion favors the formation 
of the transition state by facilitating proton release and by positioning the 
water molecule to be in close proximity to the other reactant. 

Studies of a synthetic analog model system provide evidence for the mech- 
anism’s plausibility. A simple synthetic ligand binds zinc through four 
nitrogen atoms (compared with three histidine nitrogen atoms in the 
enzyme), as shown in Figure 9.26. One water molecule remains bound to 
the zinc ion in the complex. Direct measurements reveal that this water 
molecule has a pK, value of 8.7, not as low as the value for the water mole- 
cule in carbonic anhydrase but substantially lower than the value for free 
water. At pH 9.2, this complex accelerates the hydration of carbon dioxide 
more than 100-fold. Although its rate of catalysis is much less efficient than 
catalysis by carbonic anhydrase, the model system strongly suggests that 
the zinc-bound hydroxide mechanism is likely to be correct. Carbonic 
anhydrases have evolved to employ the reactivity intrinsic to a zinc-bound 
hydroxide ion as a potent catalyst. 


(A) 


A proton shuttle facilitates rapid regeneration of the 
active form of the enzyme 


As noted earlier, some carbonic anhydrases can hydrate carbon dioxide at 
rates as high as a million times a second (10°s"'). The magnitude of this rate 
can be understood from the following observations. In the first step of a 
carbon dioxide hydration reaction, the zinc-bound water molecule must 
lose a proton to regenerate the active form of the enzyme (Figure 9.27). The 
rate of the reverse reaction, the protonation of the zinc-bound hydroxide 
ion, is limited by the rate of proton diffusion. Protons diffuse very rapidly 
with second-order rate constants near 1071! M7! 51. Thus, the backward 
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Figure 9.25 Mechanism of carbonic 
anhydrase. The zinc-bound hydroxide 
mechanism for the hydration of carbon 
dioxide reveals one aspect of metal ion 
catalysis. The reaction proceeds in four steps: 
(1) water deprotonation; (2) carbon dioxide 
binding; (3) nucleophilic attack by hydroxide 
on carbon dioxide; and (4) displacement of 
bicarbonate ion by water. 


Figure 9.26 A synthetic analog model 
system for carbonic anhydrase. (A) An 
organic compound, capable of binding zinc, 
was synthesized as a model for carbonic 
anhydrase. The zinc complex of this ligand 
accelerates the hydration of carbon dioxide 
more than 100-fold under appropriate 
conditions. (B) The structure of the presumed 
active complex showing zinc bound to the 
ligand and to one water molecule. 


Figure 9.27 Kinetics of water 
deprotonation. The kinetics of deprotonation 
and protonation of the zinc-bound water 
molecule in carbonic anhydrase. 
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Figure 9.28 The effect of buffer on 
deprotonation. The deprotonation of the 
zinc-bound water molecule in carbonic 
anhydrase is aided by buffer component B. 
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rate constant k! must be less than 10'! M~! 1. Because the equilibri- 
um constant K is equal to k,/k_;, the forward rate constant is given by k, = 
K-k 4. Thus, ifk ;<10!'M's ‘and K = 10 M (because pK, = 7), then 
k; must be less than or equal to 10* s’. In other words, the rate of proton 
diffusion limits the rate of proton release to less than 10* s! for a grou 
with pK, = 7. However, if carbon dioxide is hydrated at a rate of 10° s“, 
then every step in the mechanism (see Figure 9.25) must take place at least 
this fast. How is this apparent paradox resolved? 

The answer became clear with the realization that the 
highest rates of carbon dioxide hydration require the presence of 
buffer, suggesting that the buffer components participate in the 
reaction. The buffer can bind or release protons. The 
advantage is that, whereas the concentrations of protons 
and hydroxide ions are limited to 10-’ M at neutral pH, 
the concentration of buffer components can be much 
higher, of the order of several millimolar. If the buffer 
component BH™ has a pK, of 7 (matching that for the 
zinc-bound water molecule), then the equilibrium con- 
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Figure 9.29 The effect of buffer concentration on the rate 
of carbon dioxide hydration. The rate of carbon dioxide 
hydration increases with the concentration of the buffer 


1,2-dimethylbenzimidazole. The buffer enables the 


stant for the reaction in Figure 9.28 is 1. The rate of proton 
abstraction is given by k,’ : [B]. The second-order rate 
constants k,’ and k_;' will be limited by buffer diffusion to 
values less than approximately 10? M's‘. Thus, buffer 
concentrations greater than [B] = 10°? M (1 mM) may be 
high enough to support carbon dioxide hydration rates of 
10°M's | because ky’ - [B] = (10? Ms‘) + (103M) = 
10° s!. This prediction is confirmed experimentally 
(Figure 9.29). 


enzyme to achieve its high catalytic rates. 


Figure 9.30 Histidine proton shuttle. 

(1) Histidine 64 abstracts a proton from the 
zinc-bound water molecule, generating a 
nucleophilic hydroxide ion and a protonated 
histidine. (2) The buffer (B) removes a proton 
from the histidine, regenerating the 
unprotonated form. 
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The molecular components of many buffers are too 
large to reach the active site of carbonic anhydrase. Carbonic 
anhydrase II has evolved a proton shuttle to allow buffer 
components to participate in the reaction from solution. 
The primary component of this shuttle is histidine 64. 

This residue transfers protons from the zinc-bound water molecule to the 
protein surface and then to the buffer (Figure 9.30). Thus, catalytic function 
has been enhanced through the evolution of an apparatus for controlling 
proton transfer from and to the active site. Because protons participate in 
many biochemical reactions, the manipulation of the proton inventory 
within active sites is crucial to the function of many enzymes and explains 
the prominence of acid—base catalysis. 
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Convergent evolution has generated zinc-based active 
sites in different carbonic anhydrases 


Me Carbonic anhydrases homologous to the human enzymes, referred to 

as a-carbonic anhydrases, are common in animals and in some bacteria 
and algae. In addition, two other families of carbonic anhydrases have been 
discovered. Proteins in these families contain the zinc ion required for cata- 
lytic activity but are not significantly similar in sequence to the a-carbonic 
anhydrases. The B-carbonic anhydrases are found in higher plants and in 
many bacterial species, including E. coli. Spectroscopic and structural stud- 
ies reveal that the zinc ion is bound by one histidine residue and two cysteine 
residues. Moreover, the overall enzyme structures are unrelated to those of 
the a-carbonic anhydrases. In plants, these enzymes facilitate the accumu- 
lation of carbon dioxide, crucial for the Calvin cycle in photosynthesis. A 
third family, the y-carbonic anhydrases, was initially identified in the 
archaeon Methanosarcina thermophila. The crystal structure of this enzyme 
reveals three zinc sites extremely similar to the zinc site in the a-carbonic 
anhydrases. In this case, however, the three zinc sites lie at the interfaces 
between the three subunits of a trimeric enzyme (Figure 9.31). The very 
striking left-handed B-helical structure (a B strand twisted into a left- 
handed helix) present in this enzyme is, again, different from any structure 
present in the a- and B-carbonic anhydrases. Thus, convergent evolution 
has generated carbonic anhydrases that rely on coordinated zinc ions at least 
three times. 
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hk Figure 9.31 y-Carbonic anhydrase. 
(Left) The zinc site of y-carbonic anhydrase. 
Notice that the water-binding zinc ion is 
bound to three histidine residues. (Middle) 
The trimeric structure of the protein (individual 
chains are labeled A, B, and C). Each chain 
consists primarily of a left-handed B helix. 
(Right) The protein is rotated to show a top- 
down view that highlights its threefold 
symmetry. Notice the position of the zinc sites 
(green) at the interfaces between subunits. 
[Drawn from 1THJ.pdb.] 


9.3 Restriction Enzymes Catalyze Highly Specific 
DNA-Cleavage Reactions 


We next consider a hydrolytic reaction that results in the cleavage of DNA. 
Bacteria and archaea have evolved mechanisms to protect themselves from 
viral infections. Many viruses inject their DNA genomes into cells; once 
inside, the viral DNA hiyacks the cell’s machinery to drive the production of 
viral proteins and, eventually, of progeny virus. Often, a viral infection 
results in the death of the host cell. A major protective strategy for the host 
is to use restriction endonucleases (restriction enzymes) to degrade the viral 
DNA on its introduction into a cell. These enzymes recognize particular 
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base 


base sequences, called recognition sequences or recognition sites, in their target 
DNA and cleave that DNA at defined positions. We have already considered 
the utility of these important enzymes for dissecting genes and genomes 
(Section 5.2). The most well studied class of restriction enzymes comprises 
the type II restriction enzymes, which cleave DNA within their recognition 
sequences. Other types of restriction enzymes cleave DNA at positions 
somewhat distant from their recognition sites. 

Restriction endonucleases must show tremendous specificity at two 
levels. First, they must not degrade host DNA containing the recognition 
sequences. Second, they must cleave only DNA molecules that contain rec- 
ognition sites (hereafter referred to as cognate DNA) without cleaving DNA 
molecules that lack these sites. How do these enzymes manage to degrade 
viral DNA while sparing their own? In E. coli, the restriction endonuclease 
EcoRV cleaves double-stranded viral DNA molecules that contain the 
sequence 5'-GATATC-3’ but leaves intact host DNA containing hundreds 
of such sequences. We shall return to the strategy by which host cells protect 
their own DNA at the end of this section. 

Restriction enzymes must cleave DNA only at recognition sites, without 
cleaving at other sites. Suppose that a recognition sequence is six base pairs 
long. Because there are 4°, or 4096, sequences having six base pairs, the con- 
centration of sites that must not be cleaved will be approximately 4000-fold 
higher than the concentration of sites that should be cleaved. Thus, to keep 
from damaging host-cell DNA, restriction enzymes must cleave cognate 
DNA molecules much more than 4000 times as efficiently as they cleave 
nonspecific sites. We shall return to the mechanism used to achieve the 
necessary high specificity after considering the chemistry of the cleavage 
process. 


Cleavage is by in-line displacement of 3’-oxygen from 
phosphorus by magnesium-activated water 


A restriction endonuclease catalyzes the hydrolysis of the phosphodiester 
backbone of DNA. Specifically, the bond between the 3’-oxygen atom and 
the phosphorus atom is broken. The products of this reaction are DNA 
strands with a free 3'-hydroxyl group and a 5’-phosphoryl group at the 
cleavage site (Figure 9.32). This reaction proceeds by nucleophilic attack 
at the phosphorus atom. We will consider two alternative mechanisms, 
suggested by analogy with the proteases. The restriction endonuclease 
might cleave DNA by mechanism 1 through a covalent intermediate, 
employing a potent nucleophile (Nu), or by mechanism 2 through direct 
hydrolysis: 
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Figure 9.32 Hydrolysis of a phosphodiester bond. All restriction enzymes catalyze the 
hydrolysis of DNA phosphodiester bonds, leaving a phosphoryl group attached to the 5’ end. The 
bond that is cleaved is shown in red. 


Mechanism 1 (covalent intermediate) 
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Each mechanism postulates a different nucleophile to attack the phosphorus 
atom. In either case, each reaction takes place by in-line displacement: 
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The incoming nucleophile attacks the phosphorus atom, and a pentacoordi- 
nate transition state is formed. This species has a trigonal bipyramidal 
geometry centered at the phosphorus atom, with the incoming nucleophile 
at one apex of the two pyramids and the group that is displaced (the leaving 
group, L) at the other apex. Note that the displacement inverts the stereo- 
chemical conformation at the tetrahedral phosphorous atom, analogous to 
the interconversion of the R and 5 configurations around a tetrahedral 
carbon center (Section 2.1). 

The two mechanisms differ in the number of times that the displace- 
ment takes place in the course of the reaction. In the first type of mecha- 
nism, a nucleophile in the enzyme (analogous to serine 195 in chymotrypsin) 
attacks the phosphate group to form a covalent intermediate. In a second 
step, this intermediate is hydrolyzed to produce the final products. In this 
case, two displacement reactions take place at the phosphorus atom. 
Consequently, the stereochemical configuration at the phosphorus atom 
would be inverted and then inverted again, and the overall configuration 
would be retained. In the second type of mechanism, analogous to that used 
by the aspartyl- and metalloproteases, an activated water molecule attacks 
the phosphorus atom directly. In this mechanism, a single displacement 
reaction takes place at the phosphorus atom. Hence, the stereochemical 
configuration at the phosphorus atom is inverted after cleavage. To deter- 
mine which mechanism is correct, we examine the stereochemistry at the 
phosphorus atom after cleavage. 

A difficulty is that the stereochemistry is not easily observed, because 
two of the groups bound to the phosphorus atom are simple oxygen atoms, 
identical with each other. This difficulty can be circumvented by replacing 
one oxygen atom with sulfur (producing a species called a phosphoro- 
thioate). Let us consider EcoRV endonuclease. This enzyme cleaves the phos- 
phodiester bond between the T and the A at the center of the recognition 
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Cleavage site 


thymine 


Figure 9.33 Labeling with phosphorothioates. Phosphorothioate 
groups, in which one of the nonbridging oxygen atoms is replaced by a 
sulfur atom, can be used to label specific sites in the DNA backbone to 
determine the overall stereochemical course of a displacement reaction. 
Here, a phosphorothioate is placed at sites that can be cleaved by 


EcoRV endonuclease. 


Figure 9.34 Stereochemistry of cleaved 
DNA. Cleavage of DNA by EcoRV 
endonuclease results in overall inversion of 
the stereochemical configuration at the 
phosphorus atom, as indicated by the 
stereochemistry of the phosphorus atom 
bound to one bridging oxygen atom, one '°0, 
one '80, and one sulfur atom. This 
configuration strongly suggests that the 
hydrolysis takes place by water's direct attack 
at the phosphorus atom. 
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sequence 5'-GATATC-3’. The first step 1s to synthe- 
size an appropriate substrate for EcoRV containing 
phosphorothioates at the sites of cleavage (Figure 9.33). 
The reaction is then performed in water that has 
been greatly enriched in '°O to allow the incoming 
oxygen atom to be marked. The location of the SO 
label with respect to the sulfur atom indicates whether 
the reaction proceeds with inversion or retention of ste- 
reochemistry. The analysis revealed that the stereochemi- 
cal configuration at the phosphorus atom was inverted 
only once with cleavage. This result is consistent with a 
direct attack by water at the phosphorus atom and rules 
out the formation of any covalently bound intermediate 
(Figure 9.34). 


adenine 


Restriction enzymes require magnesium for 
catalytic activity 


Many enzymes that act on phosphate-containing sub- 
strates require Mg”* or some other similar divalent 
cation for activity. One or more Mg”* (or similar) cat- 
ions are essential to the function of restriction endonu- 
cleases. What are the functions of these metal ions? 

Direct visualization of the complex between EcoRV endonuclease and 
cognate DNA molecules in the presence of Mg’* by crystallization has not 
been possible, because the enzyme cleaves the substrate under these cir- 
cumstances. Nonetheless, metal ion complexes can be visualized through 
several approaches. In one approach, crystals of EcoRV endonuclease are 
prepared bound to oligonucleotides that contain the enzyme’s recognition 
sequence. These crystals are grown in the absence of magnesium to prevent 
cleavage; after their preparation, the crystals are soaked in solutions con- 
taining the metal. Alternatively, crystals have been grown with the use of a 
mutated form of the enzyme that is less active. Finally, Mg? can be 
replaced by metal ions such as Ca** that bind but do not result in much 
catalytic activity. In all cases, no cleavage takes place, and so the locations of 
the metal ion-binding sites are readily determined. 

As many as three metal ions have been found to be present per active 
site. The roles of these multiple metal ions is still under investigation. One 
ion-binding site is occupied in essentially all structures. This metal ion is 
coordinated to the protein through two aspartate residues and to one of the 
phosphate-group oxygen atoms near the site of cleavage. This metal ion 
binds the water molecule that attacks the phosphorus atom, helping to posi- 
tion and activate it in a manner similar to that for the Zn?* ion of carbonic 
anhydrase (Figure 9.35). 
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The complete catalytic apparatus is assembled only within 
complexes of cognate DNA molecules, ensuring specificity 


We now return to the question of specificity, the defining feature of restric- 
tion enzymes. The recognition sequences for most restriction endonucleases 
are inverted repeats. This arrangement gives the three-dimensional structure 
of the recognition site a twofold rotational symmetry (Figure 9.36). 

The restriction enzymes display a corresponding symmetry: they are 
dimers whose two subunits are related by twofold rotational symmetry. The 
matching symmetry of the recognition sequence and the enzyme facilitates 
the recognition of cognate DNA by the enzyme. This similarity in structure 
has been confirmed by the determination of the structure of the complex 
between EcoRV endonuclease and DNA fragments containing its recogni- 
tion sequence (Figure 9.37). The enzyme surrounds the DNA in a tight 
embrace. 

An enzyme’s binding affinity for substrates often determines specificity. 
Surprisingly, however, binding studies performed in the absence of magne- 
sium have demonstrated that the EcoRV endonuclease binds to all sequences, 
both cognate and noncognate, with approximately equal affinity. Why, 
then, does the enzyme cleave only cognate sequences? The answer lies in a 
unique set of interactions between the enzyme and a cognate DNA 
sequence. 

Within the 5’-GATATC-3’ sequence, the G and A bases at the 5’ end 
of each strand and their Watson—Crick partners directly contact the enzyme 
by hydrogen bonding with residues that are located in two loops, one 
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Figure 9.35 A magnesium ion-binding 
site in EcoRV endonuclease. The 
magnesium ion helps to activate a water 
molecule and positions it so that it can attack 
the phosphorus atom. 


Figure 9.36 Structure of the recognition 
site of EcoRV endonuclease. (A) The 
sequence of the recognition site, which is 
symmetric around the axis of rotation 
designated in green. (B) The inverted repeat 
within the recognition sequence of EcoRV 
(and most other restriction endonucleases) 
endows the DNA site with twofold rotational 
symmetry. 


The structures of complexes formed with noncognate DNA fragments 
are strikingly different from those formed with cognate DNA: the noncog- 
nate DNA conformation is not substantially distorted (Figure 9.39). This 
lack of distortion has important consequences with regard to catalysis. No phos- 
phate is positioned sufficiently close to the active-site aspartate residues to 
complete a magnesium ion-binding site (see Figure 9.35). Hence, the nonspe- 
cific complexes do not bind the magnesium ions and the complete catalytic 
apparatus is never assembled. The distortion of the substrate and the subse- 
quent binding of the magnesium ion account for the catalytic specificity of 
more than 1,000,000-fold that is observed for EcoRV endonculease. Thus, 
enzyme specificity may be determined by the specificity of enzyme action rather 
than the specificity of substrate binding. 

We can now see the role of binding energy in this strategy for attaining 
catalytic specificity. The distorted DNA makes additional contacts with the 
enzyme, increasing the binding energy. However, the increase in binding 
energy is canceled by the energetic cost of distorting the DNA from its 
relaxed conformation (Figure 9.40). Thus, for EcoRV endonuclease, there is 
little difference in binding affinity for cognate and nonspecific DNA frag- 
ments. However, the distortion in the cognate complex dramatically affects 
catalysis by completing the magnesium ion-binding site. This example 
illustrates how enzymes can utilize available binding energy to deform sub- 
strates and poise them for chemical transformation. Interactions that take 
place within the distorted substrate complex stabilize the transition state 
leading to DNA hydrolysis. 
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Host-cell DNA is protected by the addition of methyl 
groups to specific bases 


How does a host cell harboring a restriction enzyme protect its own DNA? 
The host DNA is methylated on specific adenine bases within host recogni- 
tion sequences by other enzymes called methylases (Figure 9.41). An endo- 
nuclease will not cleave DNA if its recognition sequence 1s methylated. For 
each restriction endonuclease, the host cell produces a corresponding meth- 
ylase that marks the host DNA at the appropriate methylation site. These 
pairs of enzymes are referred to as restriction-modification systems. 

The distortion in the DNA explains how methylation blocks catalysis 
and protects host-cell DNA. The host EF. coli adds a methyl group to the 
amino group of the adenine nucleotide at the 5’ end of the recognition 
sequence. The presence of the methyl group blocks the formation of a 
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Figure 9.40 Greater binding energy of 
EcoRV endonuclease bound to cognate 
versus noncognate DNA. The additional 
interactions between EcoRV endonuclease 
and cognate DNA increase the binding energy, 
which can be used to drive DNA distortions 
necessary for forming a catalytically competent 
complex. 


Figure 9.41 Protection by methylation. 
The recognition sequence for EcoRV 
endonuclease (left) and the sites of 
methylation (right) in DNA protected from the 
catalytic action of the enzyme. 
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Figure 9.42 Methylation of adenine. The 
methylation of adenine blocks the formation 
of hydrogen bonds between EcoRV 
endonuclease and cognate DNA molecules 
and prevents their hydrolysis. 


hk Figure 9.43 A conserved structural 
core in type II restriction enzymes. Four 
conserved structural elements, including the 
active-site region (in blue), are highlighted in 
color in these models of a single monomer 
from each dimeric enzyme. Notice that these 
elements adapt similar structures in each 
enzyme. The positions of the amino acid 
sequences that form these elements within 
each overall sequence are represented 
schematically below each structure. [Drawn 
from 1RVB.pdb; 1ERI.pdb; 1BHM.pdb.] 
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hydrogen bond between the amino group and the side-chain carbonyl group 
of asparagine 185 (Figure 9.42). This asparagine residue is closely linked to 
the other amino acids that form specific contacts with the DNA. The 
absence of the hydrogen bond disrupts other interactions between 
the enzyme and the DNA substrate, and the distortion necessary for 
cleavage will not take place. 


Type II restriction enzymes have a catalytic core in common and are 
probably related by horizontal gene transfer 


Me Type II restriction enzymes are prevalent in Archaea and Bacteria. 
What can we tell of the evolutionary history of these enzymes? 
Comparison of the amino acid sequences of a variety of type II restriction 
endonucleases did not reveal significant sequence similarity between most 
pairs of enzymes. However, a careful examination of three-dimensional 
structures, taking into account the location of the active sites, revealed the 
presence of a core structure conserved in the different enzymes. This struc- 
ture includes B strands that contain the aspartate (or, in some cases, gluta- 
mate) residues forming the magnesium ion-binding sites (Figure 9.43). 
These observations indicate that many type II restriction enzymes are 
indeed evolutionarily related. Analyses of the sequences in greater detail 
suggest that bacteria may have obtained genes encoding these enzymes from 
other species by horizontal gene transfer, the passing between species of 
pieces of DNA (such as plasmids) that provide a selective advantage in a 
particular environment. For example, EcoRI (from E. coli) and RsrI (from 
Rhodobacter sphaeroides) are 50% identical in sequence over 266 amino 
acids, clearly indicative of a close evolutionary relationship. However, these 
species of bacteria are not closely related. Thus, these species appear to 
have obtained the gene for these restriction endonucleases from a common source 
more recently than the time of their evolutionary divergence. Moreover, the 
codons used by the gene encoding EcoRI endonuclease to specify given 
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amino acids are strikingly different from the codons used by most E. coli 279 

genes, which suggests that the gene did not originate in E. coli. 9.4 Myosins 
Horizontal gene transfer may be a common event. For example, genes 

that inactivate antibiotics are often transferred, leading to the transmission 

of antibiotic resistance from one species to another. For restriction- 

modification systems, protection against viral infections may have favored 

horizontal gene transfer. 


9.4 Myosins Harness Changes in Enzyme Conformation to 
Couple ATP Hydrolysis to Mechanical Work 


The final enzymes that we will consider are the myosins. These enzymes 
catalyze the hydrolysis of adenosine triphosphate (ATP) to form adenosine 
diphosphate (ADP) and inorganic phosphate (P;) and use the energy associ- 
ated with this thermodynamically favorable reaction to drive the motion of 
molecules within cells. 
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For example, when we lift a book, the energy required comes from ATP 
hydrolysis catalyzed by myosin in our muscles. Myosins are found in all 
eukaryotes and the human genome encodes more than 40 different myosins. 
Myosins generally have elongated structures with globular domains that 
actually carry out ATP hydrolysis (Figure 9.44). In this chapter, we will 
focus on the globular AT Pase domains, particularly the strategies that allow 
myosins to hydrolyze ATP in a controlled manner and to use the free energy 
associated with this reaction to promote substantial conformational changes 
within the myosin molecule. These conformational changes are amplified 
by other structures in the elongated myosin molecules to transport proteins 
or other cargo substantial distances within cells. In Chapter 35, we will 
examine the action of myosins and other molecular-motor proteins in much 
more detail. 

As will be discussed in Chapter 15, ATP is used as the major currency 
of energy inside cells. Many enzymes use ATP hydrolysis to drive other 
reactions and processes. In almost all cases, an enzyme that hydrolyzed 
ATP without any such coupled processes would simply drain the energy 
reserves of a cell without benefit. 


ATP hydrolysis proceeds by the attack of water on 
the gamma phosphoryl group 


In our examination of the mechanism of restriction enzymes, we learned 
that an activated water molecule performs a nucleophilic attack on phos- 


Globular ATPase domains 


phorus to cleave the phosphodiester backbone of DNA. The cleavage of Figure 9.44 Elongated structure of 
ATP by myosins follows an analogous mechanism. To understand the myo- muscle myosin. An electron micrograph 
sin mechanism in more detail, we must first examine the structure of the showing myosin from mammalian muscle. 


mviosin AT Pase dona This dimeric protein has an elongated 
y : structure with two globular ATPase domains 


The structures of the AT’Pase domains of several different myosins per dimer. [Courtesy of Dr. Paula Flicker, 
have been examined. One such domain, that from the soil-living amoeba Dr. Theo Walliman, and Dr. Peter Vibert] 


h Figure 9.45 Myosin-ATP complex structure. An overlay of 
the structures of the ATPase domain from Dictyostelium discoideum 
myosin with no ligands bound (blue) and the complex of this protein 
with ATP and magnesium bound (red). Notice that the two structures 
are extremely similar to one another. [Drawn from 1FMV.pdb and 
1FMW.pdb]. 


Dictyostelium discoideum, an organism that has been 
extremely useful for studying cell movement and molecu- 
lar-motor proteins, has been studied in great detail. The 
crystal structure of this protein fragment in the absence of 
nucleotides revealed a single globular domain comprising 
approximately 750 amino acids. A water-filled pocket is 
present toward the center of the structure, suggesting a 
possible nucleotide-binding site. Crystals of this protein 
were soaked in a solution containing ATP and the struc- 
ture was examined again. Remarkably, this structure 
revealed intact ATP bound in the active site with very little 
change in the overall structure and without evidence of 
significant hydrolysis (Figure 9.45). The ATP is also 
bound to a Mg”* ion. 

Kinetic studies of myosins, as well as many other 
enzymes having ATP or other nucleoside triphosphates as 
a substrate, reveal that these enzymes are essentially inac- 
tive in the absence of divalent metal ions such as magne- 
sium (Mg”*) or manganese (Mn") but acquire activity on 
the addition of these ions. In contrast with the enzymes 


discussed so far, the metal is not a component of the active site. Rather, 
nucleotides such as ATP bind these ions, and it is the metal ion—nucleotide 
complex that is the true substrate for the enzymes. The dissociation con- 
stant for the ATP-Mg”* complex is approximately 0.1 mM, and thus, 
given that intracellular Mg’ concentrations are typically in the millimolar 
range, essentially all nucleoside triphosphates are present as NTP—Mg** 
complexes. Magnesium or manganese complexes of nucleoside triphosphates 
are the true substrates for essentially all NTP-dependent enzymes. 

The nucleophilic attack by a water molecule on the y-phosphoryl group 
requires some mechanism such as a basic residue or a bound metal ion to 
activate the water. Examination of the myosin—ATP complex structure 
shows no basic residue in an appropriate position and reveals that the 
bound Mg”* ion is too far away from the phosphoryl group to play this 
role. These observations suggest why this ATP complex is relatively stable; 
the enzyme is not in a conformation that is competent to catalyze the reac- 
tion. This observation suggests that the domain must undergo a confor- 
mational change to catalyze the AT P-hydrolysis reaction. 


Formation of the transition state for ATP hydrolysis is associated 
with a substantial conformational change 


The catalytically competent conformation of the myosin ATPase domain 
must bind and stabilize the transition state of the reaction. In analogy with 
restriction enzymes, we expect that ATP hydrolysis includes a pentacoordi- 
nate transition state. 
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Figure 9.46 Myosin ATPase Transition-State Analog. The structure of the transition-state 
analog formed by treating the myosin ATPase domain with ADP and vanadate (VO,°") in the 
presence of magnesium is shown. Notice that the vanadium ion is coordinated to five oxygen 
atoms including one from ADP. The positions of two residues that bind magnesium as well as Ser 
236, a residue that appears to play a direct role in catalysis, are shown. [Drawn from 1VOM.pdb] 


Such pentacoordinate structures based on phosphorus are too unstable to be 
readily observed. However, transition-state analogs in which other atoms 
replace phosphorus are more stable. The transition metal vanadium, in 
particular, forms similar structures. The myosin ATPase domain was crys- 
tallized in the presence of ADP and vanadate, VO," . The result was the 
formation of a complex that closely matches the expected transition-state 
structure (Figure 9.46). As expected, the vanadium atom is coordinated to 
five oxygen atoms, including one oxygen atom from ADP diametrically 
opposite an oxygen atom that is analogous to the attacking water molecule 
in the transition state. The Me?* ion is coordinated to one oxygen atom 
from the vanadate, one oxygen atom from the ADP, two hydroxyl groups 
from the enzyme, and two water molecules. In this position, this ion does 
not appear to play any direct role in activating the attacking water. However, 
an additional residue from the enzyme, Ser 236, is well positioned to play a 
role in catalysis (see Figure 9.46). In the proposed mechanism of ATP 
hydrolysis based on this structure, the water molecule attacks the -phos- 
phoryl group, with the hydroxyl group of Ser 236 facilitating the transfer of 
a proton from the attacking water to the hydroxyl group of Ser 236, which, 
in turn, is deprotonated by one of the oxygen atoms of the y-phosphory] 
group (Figure 9.47). Thus, in effect, the ATP serves as a base to promote its 
own hydrolysis. 

Comparison of the overall structures of the myosin ATPase domain 
complexed with ATP and with the ADP-vanadate reveals some remarkable 
differences. Around the active site, some residues move somewhat. In par- 
ticular, a stretch of amino acids moves closer to the nucleotide by approxi- 
mately 2 A and interact with the oxygen atom that corresponds to the 
attacking water molecule. These changes help facilitate the hydrolysis 
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Figure 9.47 Facilitating Water Attack. The 
water molecule attacking the y-phosphoryl 
group of ATP is deprotonated by the hydroxy! 
group of Ser 236, which, in turn, is 
deprotonated by one of the oxygen atoms of 
the y-phosphoryl group forming the HPO,” 
product. 
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hk Figure 9.48 Myosin conformational changes. A comparison of the overall structures of the 
myosin ATPase domain with ATP bound (shown in red) and that with the transition-state analog ADP— 
vanadate (shown in blue). Notice the large conformational change of a region at the carboxyl-terminus 
of the domain, some parts of which move as much as 25 A. [Drawn from 1FMW.pdb and 1VOM.pdb]. 


reaction by stabilizing the transition state. However, examination of the 
overall structure shows even more striking changes. 

A region comprising approximately 60 amino acids at the carboxyl- 
terminus of the domain adopts a different configuration in the ADP— 
vanadate complex, displaced by as much as 25 A from its position in the 
ATP complex (Figure 9.48). This displacement tremendously amplifies the 
relatively subtle changes that take place in the active site. The effect of this 
motion 1s amplified even more as this carboxyl-terminal domain is connected 
to other structures within the elongated structures typical of myosin mole- 
cules (see Figure 9.44). Thus, the conformation that is capable of promoting 
the ATP hydrolysis reaction is itself substantially different from other con- 
formational changes that take place in the course of the catalytic cycle. 


The altered conformation of myosin persists for a 
substantial period of time 


Myosins are slow enzymes, typically turning over approximately once per 
second. What steps limit the rate of turnover? In an experiment that was 
particularly revealing, the hydrolysis of ATP was catalyzed by the myosin 
ATPase domain from mammalian muscle. The reaction took place in water 
labeled with '8O to track the incorporation of solvent oxygen into the reac- 
tion products. The fraction of oxygen in the phosphate product was ana- 
lyzed. In the simplest case, the phosphate would be expected to contain one 
oxygen atom derived from water and three initially present in the terminal 
phosphoryl group of ATP. 
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Instead, between two and three of the oxygen atoms in the phosphate were 
found, on average, to be derived from water. These observations indicate 
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Figure 9.49 Reversible hydrolysis of ATP 
within the myosin active site. For myosin, 
more than one atom of oxygen from water is 


that the ATP hydrolysis reaction within the enzyme active site is reversible. inconsoeated i inciesnicphoashale: The 


Each molecule of ATP is cleaved to ADP and P; and then re-formed from oxygen atoms are incorporated in cycles of 
these products several times before the products are released from the hydrolysis of ATP to ADP and inorganic 
enzyme (Figure 9.49). At first glance, this observation is startling because phosphate, phosphate rotation within the 


active site, and reformation of ATP now 


ATP hydrolysis is a very favorable reaction with an equilibrium constant of oe 
containing oxygen from water. 


approximately 140,000. However, this equilibrium constant applies to the 
molecules free in solution, not within the active site of an enzyme. Indeed, 
more-extensive analysis suggests that this equilibrium constant on the 
enzyme is approximately 10, indicative of a general strategy used by 
enzymes. Enzymes catalyze reactions by stabilizing the transition state. The 
structure of this transition state is intermediate between the enzyme-bound 
reactants and the enzyme-bound products. Many of the interactions that 
stabilize the transition state will help equalize the stabilities of the reactants 
and the products. Thus, the equilibrium constant between enzyme-bound reac- 
tants and products is often close to 1, regardless of the equilibrium constant for 
the reactants and products free in solution. 

These observations reveal that the hydrolysis of ATP to ADP and P; is 
not the rate-limiting step for the reaction catalyzed by myosin. Instead, the 
release of the products, particularly P;, from the enzyme is rate limiting. The 
fact that a conformation of myosin with ATP hydrolyzed but still bound to 
the enzyme persists for a significant period of time is critical for coupling 
conformational changes that take place in the course of the reaction to other 
processes. 


Myosins are a family of enzymes containing P-loop structures 


NZ X-ray crystallography has yielded the three-dimensional structures 

of a number of different enzymes that share key structural character- 
istics and, almost certainly, an evolutionary history with myosin. In par- 
ticular, a conserved NTP-binding core domain is present. This domain 
consists of a central B sheet, surrounded on both sides by a helices (Fig- 
ure 9.50). A characteristic feature of this domain is a loop between the first 
8 strand and the first helix. This loop typically has several glycine residues 
that are often conserved between more closely related members of this 
large and diverse family. The loop is often referred to as the P-loop because 
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hk Figure 9.50 The core domain of NMP 
kinases. Notice the P-loop shown in green. The 
dashed lines represent the remainder of the protein 
structure. [Drawn from 1GKY.pdb.] 


it interacts with phosphoryl groups on the bound nucleotide. P-loop 
NTPase domains are present in a remarkably wide array of proteins, many 
of which participate in essential biochemical processes. Examples include 
ATP synthase, the key enzyme responsible for ATP generation; signal- 
transduction proteins such as G proteins; proteins essential for translating 
mRNA into proteins, such as elongation factor Tu; and DNA and RNA 
unwinding helicases. The wide utility of P-loop NTPase domains is per- 
haps best explained by their ability to undergo substantial conformational 
changes on nucleoside triphosphate binding and hydrolysis. We shall 
encounter these domains throughout the book and shall observe how they 
function as springs, motors, and clocks. To allow easy recognition of these 
domains in the book, they will be depicted with the inner surfaces of the 
ribbons in a ribbon diagram shown in purple and the P-loop shown in 
green (Figure 9.51). 


Adenylate kinase a subunit of transducin 6 subunit of ATP synthase 


hk Figure 9.51 Three proteins containing P-loop NTPase domains. /Votice the conserved 
domains shown with the inner surfaces of the ribbons in purple and the P-loops in green. [Drawn 
from 4AKE.pdb; 1TND.pdb; 1BMF.pdb.] 


Summary 


9.1 


9.2 


9.3 


Enzymes adopt conformations that are structurally and chemically 
complementary to the transition states of the reactions that they 
catalyze. Sets of interacting amino acid residues make up sites with 
the special structural and chemical properties necessary to stabilize 
the transition state. Enzymes use five basic strategies to form and stabi- 
lize the transition state: (1) the use of binding energy, (2) covalent cata- 
lysis, (3) general acid—base catalysis, (4) metal ion catalysis, and 
(5) catalysis by approximation. The four classes of enzymes examined 
in this chapter catalyze the addition of water to their substrates but have 
different requirements for catalytic speed, specificity, and coupling to 
other processes. 


Proteases Facilitate a Fundamentally Difficult Reaction 

The cleavage of peptide bonds by chymotrypsin is initiated by the 
attack by a serine residue on the peptide carbonyl group. The attacking 
hydroxyl group is activated by interaction with the imidazole group of 
a histidine residue, which is, in turn, linked to an aspartate residue. 
This Ser-His-Asp catalytic triad generates a powerful nucleophile. The 
product of this initial reaction is a covalent intermediate formed by 
the enzyme and an acyl group derived from the bound substrate. The 
hydrolysis of this acyl-enzyme intermediate completes the cleavage 
process. The tetrahedral intermediates for these reactions have a nega- 
tive charge on the peptide carbonyl oxygen atom. This negative charge 
is stabilized by interactions with peptide NH groups in a region on the 
enzyme termed the oxyanion hole. 

Other proteases employ the same catalytic strategy. Some of these 
proteases, such as trypsin and elastase, are homologs of chymotrypsin. 
Other proteases, such as subtilisin, contain a very similar catalytic triad 
that has arisen by convergent evolution. Active-site structures that 
differ from the catalytic triad are present in a number of other classes 
of proteases. These classes employ a range of catalytic strategies but, 
in each case, a nucleophile is generated that is sufficiently powerful to 
attack the peptide carbonyl group. In some enzymes, the nucleophile is 
derived from a side chain whereas, in others, an activated water mol- 
ecule attacks the peptide carbonyl directly. 


Carbonic Anhydrases Make a Fast Reaction Faster 

Carbonic anhydrases catalyze the reaction of water with carbon dioxide 
to generate carbonic acid. The catalysis can be extremely fast: some 
carbonic anhydrases hydrate carbon dioxide at rates as high as 1 million 
times per second. A tightly bound zinc ion is a crucial component of the 
active sites of these enzymes. Each zinc ion binds a water molecule and 
promotes its deprotonation to generate a hydroxide ion at neutral pH. 
This hydroxide ion attacks carbon dioxide to form bicarbonate ion, 
HCO; . Because of the physiological roles of carbon dioxide and bicar- 
bonate ions, speed is of the essence for this enzyme. To overcome limi- 
tations imposed by the rate of proton transfer from the zinc-bound 
water molecule, the most-active carbonic anhydrases have evolved a 
proton shuttle to transfer protons to a buffer. 


Restriction Enzymes Catalyze Highly Specific DNA-Cleavage Reactions 

A high level of substrate specificity is often the key to biological func- 
tion. Restriction endonucleases that cleave DNA at specific recognition 
sequences discriminate between molecules that contain these recognition 
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9.4 


sequences and those that do not. Within the enzyme-substrate complex, 
the DNA substrate is distorted in a manner that generates a magnesium 
ion-binding site between the enzyme and DNA. The magnesium ion 
binds and activates a water molecule, which attacks the phosphodiester 
backbone. 

Some enzymes discriminate between potential substrates by binding 
them with different affinities. Others may bind many potential sub- 
strates but promote chemical reactions efficiently only on specific mol- 
ecules. Restriction endonucleases such as EcoRV endonuclease employ 
the latter mechanism. Only molecules containing the proper recognition 
sequence are distorted in a manner that allows magnesium ion binding 
and, hence, catalysis. Restriction enzymes are prevented from acting on 
the DNA of a host cell by the methylation of key sites within its recog- 
nition sequences. The added methyl groups block specific interactions 
between the enzymes and the DNA such that the distortion necessary 
for cleavage does not take place. 


Myosins Harness Changes in Enzyme Conformation to 

Couple ATP Hydrolysis to Mechanical Work 

Finally, myosins catalyze the hydrolysis of adenosine triphosphate 
(ATP) to form adenosine diphosphate (ADP) and inorganic phosphate 
(P;). The conformations of myosin ATPase domains free of bound 
nucleotides and with bound ATP are quite similar. Through the use of 
ADP and vanadate (VO,°  ), an excellent mimic of the transition state 
for ATP hydrolysis bound to the myosin ATPase domain can be pro- 
duced. The structure of this complex reveals that dramatic conforma- 
tional changes take place on formation of this species from the ATP 
complex. These conformational changes are used to drive substantial 
motions in molecular motors. The rate of ATP hydrolysis by myosin is 
relatively low and is limited by the rate of product release from the 
enzyme. The hydrolysis of ATP to ADP and P; within the enzyme is 
reversible with an equilibrium constant of approximately 10, compared 
with an equilibrium constant of 140,000 for these species free in solu- 
tion. Myosins are examples of P-loop NTPase enzymes, a large collec- 
tion of protein families that play key roles in a range of biological pro- 
cesses by virtue of the conformational changes that they undergo with 
various nucleotides bound. 
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Problems 

1. No burst. Examination of the cleavage of the amide sub- by noting the color produced by the release of the amino 


strate, A, by chymotrypsin with the use of stopped-flow part of the substrate (highlighted in orange). Why is no 
kinetic methods reveals no burst. The reaction is monitored burst observed? 


2. Contributing to your own demise. Consider the subtilisin 
substrates A and B. 


Phe-Ala-Gln-Phe-X Phe-Ala-His-Phe-X 
A B 


These substrates are cleaved (between Phe and X) by native 
subtilisin at essentially the same rate. However, the His 
64-to-Ala mutant of subtilisin cleaves substrate B more 
than 1000-fold as rapidly as it cleaves substrate A. Propose 
an explanation. 


3. 1 + 1 # 2. Consider the following argument. In subtili- 
sin, mutation of Ser 221 to Ala results in a 10°-fold decrease 
in activity. Mutation of His 64 to Ala results in a similar 
10°-fold decrease. Therefore, simultaneous mutation of 
Ser 221 to Ala and His 64 to Ala should result in a 10° x 
10° = 10!?-fold reduction in activity. Is this reduction 
correct? Why or why not? 


4. Adding a charge. In chymotrypsin, a mutant was con- 
structed with Ser 189, which is in the bottom of the sub- 
strate-specificity pocket, changed to Asp. What effect 
would you predict for this Ser 189->Asp 189 mutation? 


5. Conditional results. In carbonic anhydrase II, mutation of 
the proton-shuttle residue His 64 to Ala was expected to 
result in a decrease in the maximal catalytic rate. However, 
in buffers such as imidazole with relatively small molecular 
components, no rate reduction was observed. In buffers 
with larger molecular components, significant rate reduc- 
tions were observed. Propose an explanation. 


6. How many sites? A researcher has isolated a restriction 
endonuclease that cleaves at only one particular 10-base- 
pair site. Would this enzyme be useful in protecting cells 
from viral infections, given that a typical viral genome is 
50,000 base pairs long? Explain. 


7. Is faster better? Restriction endonucleases are, in general, 
quite slow enzymes with typical turnover numbers of 1s. 
Suppose that endonucleases were faster with turnover 
numbers similar to those for carbonic anhydrase (10° s‘). 
Would this increased rate be beneficial to host cells, assum- 
ing that the fast enzymes have similar levels of specificity? 
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Problems 


8. Adopting a new gene. Suppose that one species of bacteria 
obtained one gene encoding a restriction endonuclease by 
horizontal gene transfer. Would you expect this acquisition 
to be beneficial? 


9. Chelation therapy. Treatment of carbonic anhydrase with 
high concentrations of the metal chelator EDTA (ethylene- 
diaminetetraacetic acid) results in the loss of enzyme activ- 
ity. Propose an explanation. 


10. An aldehyde inhibitor. Elastase is specifically inhibited 
by an aldehyde derivative of one of its substrates: 


H3C H 
\ : 


CH 
N-Acetyl-Pro-Ala-Pro—N~ ee 
H 


| 
oO 
(a) Which residue in the active site of elastase is most likely 


to form a covalent bond with this aldehyde? 
(b) What type of covalent link would be formed? 


11. Identify the enzyme. Consider the structure of molecule 
A. Which enzyme discussed in the chapter do you think 
molecule A will most effectively inhibit? 


THEN 


O 
| H H 
Cc iC N 
H;C~ Nn Np NCH; 
H 
o- 
Molecule A 


12. Acid test. At pH 7.0, carbonic anhydrase exhibits a 
Reat of 600,000 s /. Estimate the value expected for Reat 
at pH 6.0. 


13. Restriction. To terminate a reaction in which a restric- 
tion enzyme cleaves DNA, researchers often add high con- 
centrations of the metal chelator EDTA (ethylenedi- 
aminetetraacetic acid). Why does the addition of EDTA 


terminate the reaction? 


14. Labeling strategy. ATP is added to the myosin ATPase 
domain in water labeled with !°O. After 50% of the ATP 
has been hydrolyzed, the remaining ATP is isolated and 
found to contain !8O, Explain. 


15. Viva le resistance. Many patients become resistant to 
HIV protease inhibitors with the passage of time owing to 
mutations in the HIV gene that encodes the protease. 
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Mutations are not found in the aspartate residue that inter- 
acts with the drugs. Why not? 


16. More than one way to skin k,g;. Serine 236 in Dictyo- 
stelium discoideum myosin has been mutated to alanine. The 
mutated protein showed modestly reduced ATPase activity. 
Analysis of the crystal structure of the mutated protein 
revealed that a water molecule occupied the position of 
the hydroxyl group of the serine residue in the wild-type 


protein. Propose a mechanism for the ATPase activity of 
the mutated enzyme. 


Mechanism Problem 


17. Complete the mechanism. On the basis of the informa- 
tion provided in Figure 9.17, complete the mechanisms for 
peptide-bond cleavage by (a) a cysteine protease, (b) an 
aspartyl protease, and (c) a metalloprotease. 


Regulatory Strategies 


he activity of enzymes must often be regulated so that they function at 

the proper time and place. This regulation is essential for coordination of 
the vast array of biochemical processes taking place at any instant in an 
organism. Enzymatic activity is regulated in five principal ways: 


1. Allosteric Control. Allosteric proteins contain distinct regulatory sites 
and multiple functional sites. The binding of small signal molecules at regu- 
latory sites is a significant means of controlling the activity of these proteins. 
Moreover, allosteric proteins show the property of cooperativity: activity at 
one functional site affects the activity at others. Proteins displaying allo- 
steric control are thus information transducers: their activity can be modified 
in response to signal molecules or to information shared among active sites. 
This chapter examines one of the best-understood allosteric proteins: the 
enzyme aspartate transcarbamoylase (ATCase). Catalysis by aspartate trans- 
carbamoylase of the first step in pyrimidine biosynthesis is inhibited by 
cytidine triphosphate, the final product of that biosynthesis, in an example 
of feedback inhibition. We have already examined an allosteric protein— 
hemoglobin, the oxygen transport protein in the blood (Chapter 7). 


2. Multiple Forms of Enzymes. Isozymes, or isoenzymes, provide an avenue 
for varying regulation of the same reaction at distinct locations or times to 
meet the specific physiological needs in the particular tissue at a particular 
time. Isozymes are homologous enzymes within a single organism that 
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Like motor traffic, metabolic pathways flow more efficiently when 
regulated by signals. Cytidine triphosphate (CTP), the final product 
of a multistep pathway, controls flux through the pathway by 
inhibiting the committed step catalyzed by aspartate 
transcarbamoylase (ATCase). [(Left) Michael Winokur Photography/ 
Getty Images.] 


OUTLINE 


10.1 Aspartate Transcarbamoylase Is 
Allosterically Inhibited by the 
End Product of Its Pathway 


10.2 Isozymes Provide a Means of 
Regulation Specific to Distinct 
Tissues and Developmental 
Stages 


10.3, Covalent Modification Is a 
Means of Regulating Enzyme 
Activity 


10.4 Many Enzymes Are Activated by 
Specific Proteolytic Cleavage 
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290 catalyze the same reaction but differ slightly in structure and more obvi- 

CHAPTER 10 Regulatory Strategies ously in Kyy and Vax values as well as in regulatory properties. Often, 
isozymes are expressed in a distinct tissue or organelle or at a distinct stage 
of development. 


3. Reversible Covalent Modification. The catalytic properties of many 
enzymes are markedly altered by the covalent attachment of a modifying 
group, most commonly a phosphoryl group. ATP serves as the phosphoryl 
donor in these reactions, which are catalyzed by protein kinases. The removal 
of phosphoryl groups by hydrolysis is catalyzed by protein phosphatases. 
This chapter considers the structure, specificity, and control of protein 
kinase A (PKA), a ubiquitous eukaryotic enzyme that regulates diverse tar- 
get proteins. 


4, Proteolytic Activation. The enzymes controlled by some of these regula- 
tory mechanisms cycle between active and inactive states. A different regu- 
latory strategy is used to irreversibly convert an inactive enzyme into an 
active one. Many enzymes are activated by the hydrolysis of a few peptide 
bonds or even one such bond in inactive precursors called zymogens or 
proenzymes. This regulatory mechanism generates digestive enzymes such 
as chymotrypsin, trypsin, and pepsin. Blood clotting is due to a remarkable 
cascade of zymogen activations. Active digestive and clotting enzymes are 
switched off by the irreversible binding of specific inhibitory proteins that 
are irresistible lures to their molecular prey. 


5. Controlling the Amount of Enzyme Present. Enzyme activity can also be 
regulated by adjusting the amount of enzyme present. This important form 
of regulation usually takes place at the level of transcription. We will con- 
sider the control of gene transcription in Chapter 31. 


To begin here, we will consider the principles of allostery by examining 
the enzyme aspartate transcarbamoylase. 


10.1 Aspartate Transcarbamoylase Is Allosterically Inhibited 
by the End Product of Its Pathway 


Aspartate transcarbamoylase catalyzes the first step in the biosynthesis of 
pyrimidines: the condensation of aspartate and carbamoyl phosphate to 
form N-carbamoylaspartate and orthophosphate (Figure 10.1). This reaction 
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Figure 10.1 ATCase reaction. Aspartate 0 fe} 0 


transcarbamoylase catalyzes the committed 

step, the condensation of aspartate and 

carbamoyl phosphate to form HO OH 
N-carbamoylaspartate, in pyrimidine synthesis. Cytidine triphosphate (CTP) 


is the committed step in the pathway that will ultimately yield pyrimidine 
nucleotides such as cytidine triphosphate (CTP). How is this enzyme regu- 
lated to generate precisely the amount of CTP needed by the cell? 

John Gerhart and Arthur Pardee found that ATCase is inhibited by 
CTP, the final product of the ATCase-initiated pathway. The rate of the 
reaction catalyzed by ATCase is fast at low concentrations of CTP but 
slows as CTP concentration increases (Figure 10.2). Thus, the pathway 
continues to make new pyrimidines until sufficient quantities of CTP have 
accumulated. The inhibition of ATCase by CTP is an example of feedback 
inhibition, the inhibition of an enzyme by the end product of the pathway. 
Feedback inhibition by CTP ensures that N-carbamoylaspartate and 
subsequent intermediates in the pathway are not needlessly formed when 
pyrimidines are abundant. 

The inhibitory ability of CTP is remarkable because CTP is structurally 
quite different from the substrates of the reaction (see Figure 10.1). Thus CTP 
must bind to a site distinct from the active site at which substrate binds. 
Such sites are called allosteric or regulatory sites. CTP is an example of an 
allosteric inhibitor. In ATCase (but not all allosterically regulated enzymes), 
the catalytic sites and the regulatory sites are on separate polypeptide 
chains. 


Allosterically regulated enzymes do not follow 
Michaelis—Menten kinetics 


Allosteric enzymes are distinguished by their response to changes in sub- 
strate concentration in addition to their susceptibility to regulation by other 
molecules. Let us examine the rate of product formation as a function of 
substrate concentration for ATCase (Figure 10.3). The curve differs from 
that expected for an enzyme that follows Michaelis-Menten kinetics. The 
observed curve is referred to as sigmoidal because it resembles the letter 
“S.” The vast majority of allosteric enzymes display sigmoidal kinetics. 
Recall from the discussion of hemoglobin that sigmoidal curves result from 
cooperation between subunits: the binding of substrate to one active site in 
a molecule increases the likelihood that substrate will bind to other active 
sites. To understand the basis of sigmoidal enzyme kinetics and inhibition 
by CTP, we need to examine the structure of ATCase. 


ATCase consists of separable catalytic and regulatory subunits 


What is the evidence that ATCase has distinct regulatory and catalytic 
sites? ATCase can be literally separated into regulatory (r) and catalytic (c) 
subunits by treatment with a mercurial compound such as p-hydroxymer- 
curibenzoate, which reacts with sulfhydryl groups (Figure 10.4). 
Ultracentrifugation following treatment with mercurials revealed that 
ATCase is composed of two kinds of subunits (Figure 10.5). The subunits 
can be readily separated by ion-exchange chromatography because they dif- 
fer markedly in charge or by centrifugation in a sucrose density gradient 
because they differ in size. These size differences are manifested in the sed- 
imentation coefficients: that of the native enzyme is 11.65, whereas those of 
the dissociated subunits are 2.85 and 5.85. The attached p-mercuribenzoate 
groups can be removed from the separated subunits by adding an excess of 
mercaptoethanol, providing isolated subunits for study. 

The larger subunit is the catalytic subunit. This subunit displays cata- 
lytic activity but is unresponsive to CTP and does not display sigmoidal 
kinetics. The isolated smaller subunit can bind CTP, but has no catalytic 
activity. Hence, that subunit is the regulatory subunit. The catalytic subunit 
(c3) consists of three chains (34 kd each), and the regulatory subunit (rz) 
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Figure 10.2 CTP inhibits ATCase. 

Cytidine triphosphate, an end product of the 
pyrimidine-synthesis pathway, inhibits aspartate 
transcarbamoylase despite having little 
structural similarity to reactants or products. 
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N-carbamoylaspartate 
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Figure 10.3 ATCase displays sigmoidal 
kinetics. A plot of product formation as a 
function of substrate concentration produces 
a sigmoidal curve because the binding of 
substrate to one active site increases the 
activity at the other active sites. Thus, the 
enzyme shows cooperativity. 
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Figure 10.4 Modification of cysteine 
residues. p-Hydroxymercuribenzoate reacts 
with crucial cysteine residues in aspartate 
transcarbamoylase. 

291 


292 
CHAPTER 10 Regulatory Strategies 


Figure 10.5 Ultracentrifugation studies 
of ATCase. Sedimentation velocity patterns 
of (A) native ATCase and (B) the enzyme 
after treatment with p-hydroxymercuribenzoate 
show that the enzyme can be dissociated 
into regulatory (r) and catalytic (c) subunits. 
[After J. C. Gerhart and H. K. Schachman. 
Biochemistry 4:1054-1062, 19651] 


(A) Zinc 
domain. ™~ , 


Figure 10.6 Structure of ATCase. 
(A) The quaternary structure of aspartate 
transcarbamoylase as viewed from the top. 
The drawing in the center is a simplified 
representation of the relations between 
subunits. A single catalytic trimer [catalytic (c) 
chains, shown in yellow] is visible; in this view, 
the second trimer is hidden below the one 
visible. Notice that each r chain interacts with 
ac chain through the zinc domain. (B) A side 
view of the complex.[Drawn from 1 RAI.pdb.] 
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consists of two chains (17 kd each). The catalytic and regulatory subunits 
combine rapidly when they are mixed. The resulting complex has the same 
structure, Core, as the native enzyme: two catalytic trimers and three regula- 
tory dimers. 


2C + 312 — Core 


Most strikingly, the reconstituted enzyme has the same allosteric and kinetic 
properties as those of the native enzyme. Thus, ATCase is composed of 
discrete catalytic and regulatory subunits, and the interaction of the subunits 
in the native enzyme produces its regulatory and catalytic properties. 


Allosteric interactions in ATCase are mediated by large changes in 
quaternary structure 


What are the subunit interactions that account for the properties of 
ATCase? Significant clues have been provided by the three-dimensional 
structure of ATCase in various forms. Two catalytic trimers are stacked 
one on top of the other, linked by three dimers of the regulatory chains 
(Figure 10.6). There are significant contacts between the catalytic and the 
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regulatory subunits: each r chain within a regulatory dimer interacts with a 
c chain within a catalytic trimer. The c chain makes contact with a struc- 
tural domain in the r chain that is stabilized by a zinc ion bound to four 
cysteine residues. The mercurial compound p-hydroxymercuribenzoate is 
able to dissociate the catalytic and regulatory subunits because mercury 
binds strongly to the cysteine residues, displacing the zinc and destabilizing 
this r-subunit domain. 

To locate the active sites, the enzyme was crystallized in the presence of 
N-(phosphonacetyl)-L-aspartate (PALA), a bisubstrate analog (an analog 
of the two substrates) that resembles an intermediate along the pathway of 
catalysis (Figure 10.7). PALA is a potent competitive inhibitor of ATCase; 
it binds to the active sites and blocks them. The structure of the ATCase— 
PALA complex reveals that PALA binds at sites lying at the boundaries 
between pairs of c chains within a catalytic trimer (Figure 10.8). Each cata- 
lytic trimer contributes three active sites to the complete enzyme. Further 
examination of the ATCase-PALA complex reveals a remarkable change in 
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Figure 10.7 PALA, a bisubstrate analog. 
(Top) Nucleophilic attack by the amino 

group of aspartate on the carbonyl carbon 
atom of carbamoyl phosphate generates an 
intermediate on the pathway to the 
formation of N-carbamoylaspartate. (Bottom) 
N-(Phosphonacetyl)-L-aspartate (PALA) is an 
analog of the reaction intermediate and a 
potent competitive inhibitor of aspartate 
transcarbamoylase. 


h Figure 10.8 The active 
site of ATCase. Some of the 
crucial active-site residues are 
shown binding to the inhibitor 
PALA (shaded gray). Notice that 
the active site is composed 
mainly of residues from one c 
chain, but an adjacent c chain 
also contributes important 
residues (boxed in green). 
[Drawn from 8ATC.pdb.] 


Figure 10.9 The T-to-R state transition 
in ATCase. Aspartate transcarbamoylase 
exists in two conformations: a compact, 
relatively inactive form called the tense (T) 
state and an expanded form called the 
relaxed (R) state. Notice that the structure of 
ATCase changes dramatically in the transition 
from the T state to the R State. PALA binding 
stabilizes the R state. 
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quaternary structure on binding of PALA. The two catalytic trimers move 
12 A farther apart and rotate approximately 10 degrees about their common 
threefold axis of symmetry. Moreover, the regulatory dimers rotate approx- 
imately 15 degrees to accommodate this motion (Figure 10.9). The enzyme 
literally expands on PALA binding. In essence, ATCase has two distinct 
quaternary forms: one that predominates in the absence of substrate or sub- 
strate analogs and another that predominates when substrates or analogs are 
bound. We call these forms the T (for tense) state and the R (for relaxed) 
state, respectively, as we did for the two quaternary states of hemoglobin. 

How can we explain the enzyme’s sigmoidal kinetics in light of the 
structural observations? Like hemoglobin, the enzyme exists in an equilib- 
rium between the T state and the R state. In the absence of substrate, almost 
all the enzyme molecules are in the T state. The T state has a low affinity for 
substrate and hence shows a low catalytic activity. The occasional binding 
of a substrate molecule to one active site in an enzyme increases the likeli- 
hood that the entire enzyme shifts to the R state with its higher binding 
affinity. The addition of more substrate has two effects. First, it increases 
the probability that each enzyme molecule will bind at least one substrate 
molecule. Second, it increases the average number of substrate molecules 
bound to each enzyme. The presence of additional substrate will increase 
the fraction of enzyme molecules in the more active R state because the 
position of the equilibrium depends on the number of active sites that are 
occupied by substrate. We considered this property, called cooperativity 
because the subunits cooperate with one another, when we discussed the 
sigmoidal oxygen-binding curve of hemoglobin. The effects of substrates 
on allosteric enzymes are referred to as homotropic effects (from the Greek 
homos, “same’’). 

This mechanism for allosteric regulation is referred to as the concerted 
mechanism because the change in the enzyme is “all or none”; the entire 
enzyme is converted from T into R, affecting all of the catalytic sites equally. 
In contrast, the sequential model assumes that the binding of ligand to one 
site on the complex can affect neighboring sites without causing all subunits 
to undergo the T-to-R transition. Although the concerted mechanism 
explains the behavior of ATCase well, most other allosteric enzymes have 
features of both models. 

The sigmoidal curve for ATCase can be pictured as a composite of two 
Michaelis-Menten curves, one corresponding to the T state and the other to 
the R state. An increase in substrate concentration favors a transition from 
the T-state curve to the R-state curve (Figure 10.10). Note that such sig- 
moidal behavior has an additional consequence: in the concentration range 
at which the T-to-R transition is taking place, the curve depends quite 
steeply on the substrate concentration. The enzyme is switched from a less 


Figure 10.10 Basis for the sigmoidal 
curve. The generation of the sigmoidal curve 
by the property of cooperativity can be 
understood by imagining an allosteric enzyme 
as a mixture of two Michaelis-Menten 
enzymes, one with a high value of Kjy that 
corresponds to the T state and another with a 
low value of Ky that corresponds to the R state. 
As the concentration of substrate is increased, 
the equilibrium shifts from the T state to the R 
state, which results in a steep rise in activity 
with respect to substrate concentration. 
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active state to a more active state within a narrow range of substrate concen- 
tration. This behavior is beneficial when a response to small changes in 
substrate concentration is physiologically important. 

In studies of the isolated catalytic trimer, the catalytic subunit shows 
the hyperbolic curve characteristic of Michaelis-Menten kinetics, which is 
indistinguishable from the curve deduced for the R state (see Figure 10.10). 
Thus, the term tense is apt: in the T state, the regulatory dimers hold the two 
catalytic trimers sufficiently close to each other that key loops on their sur- 
faces collide and interfere with conformational adjustments necessary for 
high-affinity substrate binding and catalysis. 


Allosteric regulators modulate the T-to-R equilibrium 


We now turn our attention to the effects of CTP. As noted earlier, CTP 
inhibits the action of ATCase. X-ray studies of ATCase in the presence of 
CTP revealed (1) that the enzyme is in the T state when bound to CTP and 
(2) that a binding site for this nucleotide exists in each regulatory chain in a 
domain that does not interact with the catalytic subunit (Figure 10.11). 
Each active site is more than 50 A from the nearest CTP-binding site. The 
question naturally arises, How can CTP inhibit the catalytic activity of the 
enzyme when it does not interact with the catalytic chain? 

The quaternary structural changes observed on substrate-analog bind- 
ing suggest a mechanism for inhibition by CTP (Figure 10.12). The binding 
of the inhibitor CTP shifts the equilibrium toward the T state, decreasing net 
enzyme activity. The binding of CTP makes it more difficult for substrate 
binding to convert the enzyme into the R state. Consequently, CTP increas- 
es the initial phase of the sigmoidal curve (Figure 10.13). More substrate is 
required to attain a given reaction rate. 


T state R state 
(less active) (more active) 


Favored by CTP binding Favored by substrate binding 


Figure 10.12 The R state and the T state are in equilibrium. Even in the absence of any 
substrate or regulators, aspartate transcarbamoylase exists in equilibrium between the R and the T 
states. Under these conditions, the T state is favored by a factor of approximately 200. 
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Figure 10.11 CTP stabilizes the T state. 
The binding of CTP to the regulatory subunit 
of aspartate transcarbamoylase stabilizes the 
T state. 


+0.4 mM CTP 


formation —~> 


Rate of 
N-carbamoylaspartate 


| | 
10 20 


[Aspartate], mM 


Figure 10.13 Effect of CTP on ATCase 
kinetics. Cytidine triphosphate (CTP) stabilizes 
the T state of aspartate transcarbamoylase, 
making it more difficult for substrate binding 
to convert the enzyme into the R state. As a 
result, the curve is shifted to the right, as 
shown in red. 
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Figure 10.15 Quantitative description of 
the concerted model. In this description of 
the concerted model, fractional activity, ¥ is 
the fraction of active sites bound to substrate 
and is directly proportional to reaction velocity; 
a is the ratio of [S] to the dissociation 
constant of S with the enzyme in the R state; 
and L is the ratio of the concentration of 
enzyme in the T state to that in the R state. 
The binding of the regulators ATP and CTP to 
ATCase changes the value of L and thus the 
response to substrate concentration. 

To construct these curves, the formula 
describing the concerted model in the 
appendix to Chapter 7 was used, with 
c=0.1andn = 6. 
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Figure 10.14 Effect of ATP on ATCase BL 
kinetics. ATP is an allosteric activator of aspartate S 
transcarbamoylase because it stabilizes the R = 
state, making it easier for substrate to bind. As | | 
10 20 


a result, the curve is shifted to the left, as 


shown in blue. [Aspartate], mM 


Interestingly, ATP, too, is an allosteric effector of ATCase. However, 
the effect of ATP is to increase the reaction rate at a given aspartate concen- 
tration (Figure 10.14). At high concentrations of ATP, the kinetic profile 
shows a less-pronounced sigmoidal behavior. ATP competes with CTP for 
binding to regulatory sites. Consequently, high levels of ATP prevent CTP 
from inhibiting the enzyme. The effects of nonsubstrate molecules on allos- 
teric enzymes (such as those of CTP and ATP on ATCase) are referred to 
as heterotropic effects (from the Greek héteros, ‘“different’”’). Substrates gen- 
erate the sigmoidal curve (homotropic effects), whereas regulators shift the 
Ky (heterotropic effects). Note, however, that both types of effect are gen- 
erated by altering the T/R ratio. 

The increase in ATCase activity in response to increased ATP con- 
centration has two potential physiological explanations. First, high ATP 
concentration signals a high concentration of purine nucleotides in the cell; 
the increase in ATCase activity will tend to balance the purine and pyrimi- 
dine pools. Second, a high concentration of ATP indicates that energy 
is available for mRNA synthesis and DNA replication and leads to the 
synthesis of pyrimidines needed for these processes. 

The Appendix to Chapter 7 includes a quantitative description of the 
concerted model. Although developed to describe a binding process, the 
model also applies to enzyme activity because the fraction of enzymatic 
active sites with substrate bound is proportional to enzymatic activity. A 
key aspect of this model is the equilibrium between the T and the R states. 
We defined L as the equilibrium constant between the R and the T forms. 


R= T L=— 


The effects of CTP and ATP can be modeled simply by changing the 
value of L. For the CTP-saturated form, the value of L increases from 
200 to 1250. Thus, more substrate is required to shift the equilibrium 
appreciably to the R form. For the ATP-saturated form, the value of 
L decreases to 70 (Figure 10.15). 

Thus, the concerted model provides us with a good description of the 
kinetic behavior of ATCase in the presence of its key regulators. 


10.2 Isozymes Provide a Means of Regulation Specific to 
Distinct Tissues and Developmental Stages 


Isozymes, or isoenzymes, are enzymes that differ in amino acid sequence yet 
catalyze the same reaction. Usually, these enzymes display different kinetic 
parameters, such as Ky, or respond to different regulatory molecules. They 
are encoded by different genes, which usually arise through gene duplication 
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and divergence. Isozymes can often be distinguished from one another by 
biochemical properties such as electrophoretic mobility. 

The existence of isozymes permits the fine-tuning of metabolism to meet the 
needs of a given tissue or developmental stage. Consider the example of lactate 
dehydrogenase (LDH), an enzyme that catalyzes a step in anaerobic glucose 
metabolism and glucose synthesis. Human beings have two isozymic poly- 
peptide chains for this enzyme: the H isozyme is highly expressed in heart 
muscle and the M isozyme is expressed in skeletal muscle. The amino acid 
sequences are 75% identical. Each functional enzyme is tetrameric, and 
many different combinations of the two isozymic polypeptide chains are 
possible. The H4 isozyme, found in the heart, has a higher affinity for sub- 
strates than does the M4 isozyme. The two isozymes also differ in that high 
levels of pyruvate allosterically inhibit the H4 but not the My isozyme. The 
other combinations, such as H3M, have intermediate properties. We will 
consider these isozymes in their biological context in Chapter 16. 

The My, isozyme functions optimally in the anaerobic environment 
of hard-working skeletal muscle, whereas the H4 isozyme does so in the 
aerobic environment of heart muscle. Indeed, the proportions of these 
isozymes change throughout the development of the rat heart as the tissue 
switches from an anaerobic environment to an aerobic one (Figure 10.16A). 
Figure 10.16B shows the tissue-specific forms of lactate dehydrogenase in 
adult rat tissues. 


¥e The appearance of some isozymes in the blood is a sign of tissue 
damage, useful for clinical diagnosis. For instance, an increase in 
serum levels of Hy relative to H3M is an indication that a myocardial infarc- 
tion, or heart attack, has damaged heart-muscle cells, leading to the release 
of cellular material. 


10.3 Covalent Modification Is a Means of Regulating 
Enzyme Activity 


The covalent attachment of a molecule to an enzyme or protein can modify 
its activity. In these instances, a donor molecule provides the functional 
moiety being attached. Most modifications are reversible. Phosphorylation 
and dephosphorylation are the most common means of covalent modifi- 
cation. The attachment of acetyl groups and their removal are another 
common means. Histones—proteins that are packaged with DNA into 
chromosomes—are extensively acetylated and deacetylated in vivo on 
lysine residues (Section 31.3). More heavily acetylated histones are associ- 
ated with genes that are being actively transcribed. The acetyltransferase 


Figure 10.16 Isozymes of lactate 


dehydrogenase. (A) The rat heart lactate 


dehydrogenase (LDH) isozyme profile 


changes in the course of development. The H 
isozyme is represented by squares and the M 
isozyme by circles. The negative and positive 
numbers denote the days before and after 
birth, respectively. (B) LDH isozyme content 
varies by tissue. [(A) After W.-H. Li, Molecular 
Evolution (Sinauer, 1997), p. 283; (B) after 
K. Urich, Comparative Animal Biochemistry 
(Springer Verlag, 1990), p. 542.] 
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Acetylated lysine 


Table 10.1 Common covalent modifications of protein activity 


Example of 
Modification Donor molecule modified protein Protein function 
Phosphorylation ATP Glycogen Glucose homeostasis; 
phosphorylase energy transduction 
Acetylation Acetyl CoA Histones DNA packing; 
transcription 
Myristoylation Myristoyl CoA Sre Signal transduction 
ADP ribosylation NAD* RNA polymerase ‘Transcription 
Farnesylation Farnesyl Ras Signal transduction 
pyrophosphate 
y-Carboxylation HCO; Thrombin Blood clotting 
Sulfation 3'-Phosphoadenosine- Fibrinogen Blood-clot formation 
5'-phosphosulfate 
Ubiquitination Ubiquitin Cyclin Control of cell cycle 


and deacetylase enzymes are themselves regulated by phosphorylation, 
showing that the covalent modification of a protein can be controlled by the 
covalent modification of the modifying enzymes. 

Modification is not readily reversible in some cases. The irreversible 
attachment of a lipid group causes some proteins in signal-transduction 
pathways, such as Ras (a GTPase) and Src (a protein tyrosine kinase), to 
become affixed to the cytoplasmic face of the plasma membrane. Fixed in 
this location, the proteins are better able to receive and transmit information 
that is being passed along their signaling pathways (Chapter 14). Mutations 
in both Ras and Src are seen in a wide array of cancers. The attachment of 
the small protein ubiquitin can signal that a protein is to be destroyed, the 
ultimate means of regulation (Chapter 23). The protein cyclin must be 
ubiquitinated and destroyed before a cell can enter anaphase and proceed 
through the cell cycle. 

Virtually all the metabolic processes that we will examine are regulated 
in part by covalent modification. Indeed, the allosteric properties of many 
enzymes are modified by covalent modification. Table 10.1 lists some of the 
common covalent modifications. 


Kinases and phosphatases control the extent of protein 
phosphorylation 


We will see phosphorylation used as a regulatory mechanism in virtually 
every metabolic process in eukaryotic cells. Indeed, as much as 30% of 
eukaryotic proteins are phosphorylated. The enzymes catalyzing phospho- 
rylation reactions are called protein kinases. These enzymes constitute one 
of the largest protein families known: there are more than 100 homologous 
protein kinases in yeast and more than 500 in human beings. This multi- 
plicity of enzymes allows regulation to be fine-tuned according to a specific 
tissue, time, or substrate. 

ATP is the most common donor of phosphoryl groups. The terminal (y) 
phosphoryl group of ATP is transferred to a specific amino acid of the 
acceptor protein or enzyme. In eukaryotes, the acceptor residue is com- 
monly one of the three containing a hydroxyl group in its side chain. 
Transfers to serine and threonine residues are handled by one class of protein 
kinases and to tyrosine residues by another. Tyrosine kinases, which are 
unique to multicellular organisms, play pivotal roles in growth regulation, 
and mutations in these enzymes are commonly observed in cancer cells. 
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Table 10.2 lists a few of the known serine and threonine protein kinases. 
The acceptors in protein-phosphorylation reactions are located inside cells, 
where the phosphoryl-group donor ATP is abundant. Proteins that are 
entirely extracellular are not regulated by reversible phosphorylation. 

Protein kinases vary in their degree of specificity. Dedicated protein 
kinases phosphorylate a single protein or several closely related ones. 
Multifunctional protein kinases modify many different targets; they have a 
wide reach and can coordinate diverse processes. Comparisons of amino 
acid sequences of many phosphorylation sites show that a multifunctional 
kinase recognizes related sequences. For example, the consensus sequence 
recognized by protein kinase A is Arg-Arg-X-Ser-Z or Arg-Arg-X-Thr-Z, 
in which X is a small residue, Z is a large hydrophobic one, and Ser or Thr 
is the site of phosphorylation. However, this sequence is not absolutely 
required. Lysine, for example, can substitute for one of the arginine resi- 
dues but with some loss of affinity. Short synthetic peptides containing a 
consensus motif are nearly always phosphorylated by serine—threonine 
protein kinases. Thus, the primary determinant of specificity is the amino acid 
sequence surrounding the serine or threonine phosphorylation site. However, 
distant residues can contribute to specificity. For instance, a change in 
protein conformation can open or close access to a possible phosphoryla- 
tion site. 


Table 10.2 Examples of serine and threonine kinases and their activating signals 


Signal Enzyme 
Cyclic nucleotides Cyclic AMP-dependent protein kinase 
Cyclic GMP-dependent protein kinase 
Ca?* and calmodulin Ca”*~calmodulin protein kinase 
Phosphorylase kinase or glycogen synthase kinase 2 
AMP AMP-activated kinase 
Diacylglycerol Protein kinase C 


Metabolic intermediates and § Many target-specific enzymes, such as pyruvate dehydrogenase 
other “local” effectors kinase and branched-chain ketoacid dehydrogenase kinase 


Source: After D. Fell, Understanding the Control of Metabolism (Portland Press, 1997), Table 7.2. 
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Free energy 


* Protein-OH + ATP 


Protein-OPO;7> + ADP 


ip 


Protein—OH + HOPO;7~ 


Protein phosphatases reverse the effects of kinases by catalyzing the 
removal of phosphoryl groups attached to proteins. The enzyme hydrolyzes 
the bond attaching the phosphoryl group. 


2= R= 
fe) ia Protein oi p 
[es phosphatase ee 
oa Da + Ho PORE OH ee 
fo) O 
Phosphorylated Orthophosphate 
protein (P;) 


The unmodified hydroxyl-containing side chain is regenerated and 
orthophosphate (P;) is produced. These enzymes play a vital role in cells 
because they turn off the signaling pathways that are activated by kinases. 
One class of highly conserved phosphatase called PP2A suppresses the 
cancer-promoting activity of certain kinases. 

Importantly, the phosphorylation and dephosphorylation reactions are 
not the reverse of one another; each is essentially irreversible under phys- 
iological conditions. Furthermore, both reactions take place at negligible 
rates in the absence of enzymes. Thus, phosphorylation of a protein sub- 
strate will take place only through the action of a specific protein kinase 
and at the expense of ATP cleavage, and dephosphorylation will take 
place only through the action of a phosphatase. The result is that target 
proteins cycle unidirectionally between unphosphorylated and phospho- 
rylated forms. The rate of cycling between the phosphorylated and the 
dephosphorylated states depends on the relative activities of kinases and 
phosphatases. 


Phosphorylation is a highly effective means of regulating the activities 
of target proteins 


Phosphorylation is acommon covalent modification of proteins in all forms 
of life, which leads to the question, What makes protein phosphorylation 
so valuable in regulating protein function that its use is ubiquitous? 
Phosphorylation is a highly effective means of controlling the activity of 
proteins for several reasons: 


1. The free energy of phosphorylation is large. Of the —50 kJ mol! 
(—12 kcal mol!) provided by ATP, about half is consumed in making 
phosphorylation irreversible; the other half is conserved in the phospho- 
rylated protein. A free-energy change of 5.69 kJ mol ' (1.36 kcal mol!) 
corresponds to a factor of 10 in an equilibrium constant. Hence, phospho- 
rylation can change the conformational equilibrium between different func- 
tional states by a large factor, of the order of 10*. In essence, the energy 
expenditure allows for a stark shift from one state to another. 


2. A phosphoryl group adds two negative charges to a modified protein. These 
new charges may disrupt electrostatic interactions in the unmodified pro- 
tein and allow new electrostatic interactions to be formed. Such structural 
changes can markedly alter substrate binding and catalytic activity. 


3. A phosphoryl group can form three or more hydrogen bonds. The tetrahe- 
dral geometry of a phosphoryl group makes these bonds highly directional, 
allowing for specific interactions with hydrogen-bond donors. 


4, Phosphorylation and dephosphorylation can take place in less than a second 
or over a span of hours. The kinetics can be adjusted to meet the timing needs 
of a physiological process. 


5. Phosphorylation often evokes highly amplified effects. A single activated 301 
kinase can phosphorylate hundreds of target proteins in a short interval. If 10.3 Covalent Modification 
the target protein is an enzyme, it can in turn transform a large number of 

substrate molecules. 


6. ATP is the cellular energy currency (Chapter 15). The use of this com- 
pound as a phosphoryl-group donor links the energy status of the cell to the 
regulation of metabolism. 


Cyclic AMP activates protein kinase A by altering the quaternary structure NH> 
Let us examine a specific protein kinase that helps animals cope with stress- . a 

ful situations. The “flight or fight” response is common to many animals ys SN 
presented with a dangerous or exciting situation. Muscle becomes primed oe ! we Ly 
for action. This priming is the result of the activity of a particular protein 4 NT Sy 
kinase. In this case, the hormone epinephrine (adrenaline) triggers the for- 

mation of cyclic AMP (cAMP), an intracellular messenger formed by the —~y 
cyclization of ATP. Cyclic AMP subsequently activates a key enzyme: pro- a =. % 

tein kinase A (PKA). The kinase alters the activities of target proteins by _p—o OH 
phosphorylating specific serine or threonine residues. The striking finding oO” i 

is that most effects of cAMP in eukaryotic cells ave achieved through the acti- - J alitancuaemaneeneanine 
vation by cAMP of PKA. ' (cAMP) oe 


PKA provides a clear example of the integration of allosteric regulation 
and phosphorylation. PKA is activated by cAMP concentrations near 
10 nM. The activation mechanism is reminiscent of that of aspartate trans- 
carbamoylase. Like that enzyme, PKA in muscle consists of two kinds of 
subunits: a 49-kd regulatory (R) subunit and a 38-kd catalytic (C) subunit. 
In the absence of cAMP, the regulatory and catalytic subunits form an R2C 
complex that is enzymatically inactive (Figure 10.17). The binding of two 
molecules of cAMP to each of the regulatory subunits leads to the dissocia- 
tion of R»C into an Ry subunit and two C subunits. These free catalytic 
subunits are then enzymatically active. Thus, the binding of cAMP to the 
vegulatory subunit relieves its inhibition of the catalytic subunit. PKA and 
most other kinases exist in isozymic forms for fine-tuning regulation to 
meet the needs of a specific cell or developmental stage. 

How does the binding of cAMP activate the kinase? Each R chain con- 
tains the sequence Arg-Arg-Gly-Ala-Ile, which matches the consensus 
sequence for phosphorylation except for the presence of alanine in place of 
serine. In the RyC2 complex, this pseudosubstrate sequence of R occupies the 
catalytic site of C, thereby preventing the entry of protein substrates (see 
Figure 10.17). The binding of cAMP to the R chains allosterically moves 
the pseudosubstrate sequences out of the catalytic sites. The released C 
chains are then free to bind and phosphorylate substrate proteins. 


cAMP Figure 10.17 
% ry Regulation of protein 
& kinase A. The binding of 
four molecules of cAMP 
activates protein kinase 
A by dissociating the 
+ 4 cAMP inhibited holoenzyme 
+ + (R2Co) into a regulatory 
subunit (R2) and two 
catalytically active 
Activ : subunits (C). Each R 
: ee chain includes cAMP- 
i CY binding domains and 
cAMP-binding a pseudosubstrate 
domains sequence. 


Pseudosubstrate 
sequence 


hk Figure 10.18 Protein kinase A bound 
to an inhibitor. This space-filling model 
shows a complex of the catalytic subunit of 
protein kinase A with an inhibitor bearing a 
pseudosubstrate sequence. Notice that the 
inhibitor (yellow) binds to the active site, a 
cleft between the domains of the enzyme. 
The bound ATP, shown in red, is in the active 
site adjacent to the site to which the inhibitor 
is bound. [Drawn from 1ATP.pdb.] 


Figure 10.19 Binding of pseudosubstrate 
to protein kinase A. Notice that the 
inhibitor makes multiple contacts with the 
enzyme. The two arginine side chains of the 
pseudosubstrate form salt bridges with three 
glutamate carboxylate groups. Hydrophobic 
interactions also are important in the 
recognition of substrate. The isoleucine 
residue of the pseudosubstrate is in contact 
with a pair of leucine residues of the enzyme. 
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ATP and the target protein bind to a deep cleft in the catalytic 
subunit of protein kinase A 


nla X-ray crystallography revealed the three-dimensional struc- 

ture of the catalytic subunit of PKA bound to ATP and a 
20-residue peptide inhibitor. The 350-residue catalytic subunit of 
PKA has two lobes (Figure 10.18). ATP and part of the inhibitor 
fill a deep cleft between the lobes. The smaller lobe makes many 
contacts with ATP—Mg”", whereas the larger lobe binds the pep- 
tide and contributes the key catalytic residues. As with other 
kinases, the two lobes move closer to one another on substrate 
binding; mechanisms that restrict this domain closure provide a 
means of regulating protein kinase activity. The PKA structure has 
broad significance because residues 40 to 280 constitute a conserved 
catalytic core that is common to essentially all known protein kinases. 
We see here an example of a successful biochemical solution to a 
problem (in this case, protein phosphorylation) being employed 
many times in the course of evolution. 

The bound peptide in this crystal occupies the active site 
because it contains the pseudosubstrate sequence Arg-Arg-Asn- 
Ala-Ile (Figure 10.19). The structure of the complex reveals the 
interactions by which the enzyme recognizes the consensus sequence. The 
guanidinium group of the first arginine residue forms an ion pair with the 
carboxylate side chain of a glutamate residue (Glu 127) of the enzyme. The 
second arginine likewise interacts with two other carboxylate groups. The 
nonpolar side chain of isoleucine, which matches Z in the consensus 
sequence (p. 299), fits snugly in a hydrophobic groove formed by two leucine 
residues of the enzyme. 


>? adie 198 


Leu 205 


10.4 Many Enzymes Are Activated by Specific 
Proteolytic Cleavage 


We turn now to a different mechanism of enzyme regulation. Many 
enzymes acquire full enzymatic activity as they spontaneously fold into 
their characteristic three-dimensional forms. In contrast, the folded forms 


Table 10.3 Gastric and pancreatic zymogens 


Site of synthesis Zymogen Active enzyme 
Stomach Pepsinogen Pepsin 

Pancreas Chymotrypsinogen Chymotrypsin 
Pancreas Trypsinogen Trypsin 

Pancreas Procarboxypeptidase Carboxypeptidase 


of other enzymes are inactive until the cleavage of one or a few specific 
peptide bonds. The inactive precursor is called a zymogen or a proenzyme. 
An energy source such as ATP is not needed for cleavage. Therefore, in 
contrast with reversible regulation by phosphorylation, even proteins 
located outside cells can be activated by this means. Another noteworthy 
difference is that proteolytic activation, in contrast with allosteric control 
and reversible covalent modification, takes place just once in the life of an 
enzyme molecule. 

Specific proteolysis is acommon means of activating enzymes and other 
proteins in biological systems. For example: 


1. The digestive enzymes that hydrolyze proteins are synthesized as zymo- 
gens in the stomach and pancreas (Table 10.3). 


2. Blood clotting is mediated by a cascade of proteolytic activations that 
ensures a rapid and amplified response to trauma. 


3. Some protein hormones are synthesized as inactive precursors. For 
example, insulin is derived from proinsulin by proteolytic removal of a 
peptide. 


4. The fibrous protein collagen, the major constituent of skin and bone, is 
derived from procollagen, a soluble precursor. 


5. Many developmental processes are controlled by the activation of zymo- 
gens. For example, in the metamorphosis of a tadpole into a frog, large 
amounts of collagen are resorbed from the tail in the course of a few days. 
Likewise, much collagen is broken down in a mammalian uterus after 
delivery. The conversion of procollagenase into collagenase, the active 
protease, is precisely timed in these remodeling processes. 


6. Programmed cell death, or apoptosis, is mediated by proteolytic enzymes 
called caspases, which are synthesized in precursor form as procaspases. 
When activated by various signals, caspases function to cause cell death in 
most organisms, ranging from C. elegans to human beings. Apoptosis pro- 
vides a means of sculpting the shapes of body parts in the course of develop- 
ment and a means of eliminating damaged or infected cells. 


We next examine the activation and control of zymogens, using as 
examples several digestive enzymes as well as blood-clot formation. 


Chymotrypsinogen is activated by specific cleavage of a single 
peptide bond 


Chymotrypsin is a digestive enzyme that hydrolyzes proteins in the small 
intestine. Its mechanism of action was described in detail in Chapter 9. Its 
inactive precursor, chymotrypsinogen, is synthesized in the pancreas, as are 
several other zymogens and digestive enzymes. Indeed, the pancreas is one 
of the most active organs in synthesizing and secreting proteins. The 
enzymes and zymogens are synthesized in the acinar cells of the pancreas 
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Chymotrypsinogen 
(inactive) 


a-Chymotrypsin Hypean 


(active) 
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Figure 10.21 Proteolytic activation of 
chymotrypsinogen. The three chains of 
a-chymotrypsin are linked by two interchain 
disulfide bonds (A to B, and B to C). 


Ribosomes attached to 
endoplasmic reticulum 


Zymogen 
granule 


Figure 10.20 Secretion of zymogens by 
an acinar cell of the pancreas. Zymogens 
are synthesized on ribosomes attached to the 
endoplasmic reticulum. They are subsequently 
processed in the Golgi apparatus and 
packaged into zymogen or secretory granules. 
With the proper signal, the granules fuse with 
the plasma membrane, discharging their 
contents into the lumen of the pancreatic 
ducts. Cell cytoplasm is depicted as pale 
green. Membranes and lumen are shown as 
dark green. 


and stored inside membrane-bounded granules (Figure 10.20). The zymo- 
gen granules accumulate at the apex of the acinar cell; when the cell is 
stimulated by a hormonal signal or a nerve impulse, the contents of the 
granules are released into a duct leading into the duodenum. 

Chymotrypsinogen, a single polypeptide chain consisting of 245 amino 
acid residues, is virtually devoid of enzymatic activity. It is converted into a 
fully active enzyme when the peptide bond joining arginine 15 and isoleu- 
cine 16 is cleaved by trypsin (Figure 10.21). The resulting active enzyme, 
called m-chymotrypsin, then acts on other m-chymotrypsin molecules by 
removing two dipeptides to yield a-chymotrypsin, the stable form of the 
enzyme. The three resulting chains in a-chymotrypsin remain linked to one 
another by two interchain disulfide bonds. The striking feature of this 
activation process is that cleavage of a single specific peptide bond transforms 
the protein from a catalytically inactive form into one that is fully active. 


Proteolytic activation of chymotrypsinogen leads to the formation of a 
substrate-binding site 


How does cleavage of a single peptide bond activate the zymogen? The 
cleavage of the peptide bond between amino acids 15 and 16 triggers key 
conformational changes, which were revealed by the elucidation of the 
three-dimensional structure of chymotrypsinogen. 


1. The newly formed amino-terminal group of isoleucine 16 turns inward and 
forms an ionic bond with aspartate 194 in the interior of the chymotrypsin 
molecule (Figure 10.22). 


2. This electrostatic interaction triggers a number of conformational 
changes. Methionine 192 moves from a deeply buried position in the zymo- 
gen to the surface of the active enzyme, and residues 187 and 193 move 


lle 16 
J (chymotrypsinogen) 


h Figure 10.22 Conformations 
of chymotrypsinogen (red) and 
chymotrypsin (blue). Notice the 
alteration of the position of isoleucine 
16 in chymotrypsin. The electrostatic 
interaction between the a-amino 
group of isoleucine 16 and the 
carboxylate of aspartate 194, essential 
for the structure of active 
chymotrypsin, is possible only in 
chymotrypsin. [Drawn from 1GCT.pdb 
and 2GCA.pdb.] 
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farther apart from each other. These changes result in the formation of the 
substrate-specificity site for aromatic and bulky nonpolar groups. One side of 
this site is made up of residues 189 through 192. This cavity for binding part 
of the substrate is not fully formed in the zymogen. 


3. The tetrahedral transition state in catalysis by chymotrypsin is stabilized 
by hydrogen bonds between the negatively charged carbonyl oxygen atom 
of the substrate and two NH groups of the main chain of the enzyme 
(see Figure 9.8). One of these NH groups is not appropriately located in 
chymotrypsinogen, and so the oxyanion hole is incomplete in the zymogen. 


4. The conformational changes elsewhere in the molecule are very small. 
Thus, the switching on of enzymatic activity in a protein can be accomplished 
by discrete, highly localized conformational changes that are triggered by the 
hydrolysis of a single peptide bond. 


The generation of trypsin from trypsinogen leads to the activation of 
other zymogens 


The structural changes accompanying the activation of trypsinogen, the 
precursor of the proteolytic enzyme trypsin, are somewhat different from 
those in the activation of chymotrypsinogen. X-ray analyses have shown 
that the conformation of four stretches of polypeptide, constituting about 
15% of the molecule, changes markedly on activation. These regions are very 
flexible in the zymogen, whereas they have a well-defined conformation in 
trypsin. Furthermore, the oxyanion hole in trypsinogen is too far from histi- 
dine 57 to promote the formation of the tetrahedral transition state. 

The digestion of proteins in the duodenum requires the concurrent 
action of several proteolytic enzymes, because each is specific for a limited 
number of side chains. Thus, the zymogens must be switched on at the same 
time. Coordinated control is achieved by the action of trypsin as the common 
activator of all the pancreatic zymogens—trypsinogen, chymotrypsinogen, 
proelastase, procarboxypeptidase, and prolipase, a lipid degrading enzyme. 
To produce active trypsin, the cells that line the duodenum secrete an 
enzyme, enteropeptidase, which hydrolyzes a unique lysine—isoleucine pep- 
tide bond in trypsinogen as the zymogen enters the duodenum from the 
pancreas. The small amount of trypsin produced in this way activates more 
trypsinogen and the other zymogens (Figure 10.23). Thus, the formation of 
trypsin by enteropeptidase is the master activation step. 
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Figure 10.23 Zymogen activation by 
proteolytic cleavage. Enteropeptidase 
initiates the activation of the pancreatic 
zymogens by activating trypsin, which then 
activates other zymogens. Active enzymes are 
shown in yellow; zymogens are shown in 
orange. 
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Some proteolytic enzymes have specific inhibitors 


The conversion of a zymogen into a protease by cleavage of a single peptide 
bond is a precise means of switching on enzymatic activity. However, this 
activation step is irreversible, and so a different mechanism is needed to 
stop proteolysis. Specific protease inhibitors accomplish this task. For 
example, pancreatic trypsin inhibitor, a 6-kd protein, inhibits trypsin by 
binding very tightly to its active site. The dissociation constant of the com- 
plex is 0.1 pM, which corresponds to a standard free energy of binding of 
about —75 kJ mol! (—18 kcal mol !). In contrast with nearly all known 
protein assemblies, this complex is not dissociated into its constituent 
chains by treatment with denaturing agents such as 8 M urea or 6 M guani- 
dine hydrochloride. 

The reason for the exceptional stability of the complex is that pancreatic 
trypsin inhibitor is a very effective substrate analog. X-ray analyses showed 
that the inhibitor lies in the active site of the enzyme, positioned such that 
the side chain of lysine 15 of this inhibitor interacts with the aspartate side 
chain in the specificity pocket of trypsin. In addition, there are many hydro- 
gen bonds between the main chain of trypsin and that of its inhibitor. 
Furthermore, the carbonyl group of lysine 15 and the surrounding atoms of 
the inhibitor fit snugly in the active site of the enzyme. Comparison of the 
structure of the inhibitor bound to the enzyme with that of the free inhibitor 
reveals that the structure is essentially unchanged on binding to the enzyme 
(Figure 10.24). Thus, the inhibitor is preorganized into a structure that is 
highly complementary to the enzyme’s active site. Indeed, the peptide bond 
between lysine 15 and alanine 16 in pancreatic trypsin inhibitor is cleaved 
but at a very slow rate: the half-life of the trypsin—inhibitor complex is sev- 
eral months. In essence, the inhibitor is a substrate, but its intrinsic struc- 
ture is so nicely complementary to the enzyme’s active site that it binds very 
tightly, rarely progressing to the transition state and is turned over slowly. 


cz The amount of trypsin is much greater than the amount of inhibitor. 
Why does trypsin inhibitor exist? Recall that trypsin activates other 
zymogens. Consequently, the prevention of even small amounts of trypsin 
from initiating the inappropriately activated cascade prematurely is vital. 
Trypsin inhibitor binds to trypsin molecules in the pancreas or pancreatic 
ducts. This inhibition prevents severe damage to those tissues, which could 
lead to acute pancreatitis. 

Pancreatic trypsin inhibitor is not the only important protease inhibitor. 
a,-Antitrypsin (also called a ,-antiproteinase), a 53-kd plasma protein, 


Trypsin-pancreatic trypsin Free pancreatic 
inhibitor complex trypsin inhibitor 


protects tissues from digestion by elastase, a secretory product of neutrophils 
(white blood cells that engulf bacteria). Antielastase would be a more accu- 
rate name for this inhibitor, because it blocks elastase much more effectively 
than it blocks trypsin. Like pancreatic trypsin inhibitor, a;-antitrypsin 
blocks the action of target enzymes by binding nearly irreversibly to their 
active sites. Genetic disorders leading to a deficiency of a;-antitrypsin show 
that this inhibitor is physiologically important. For example, the substitu- 
tion of lysine for glutamate at residue 53 in the type Z mutant slows the 
secretion of this inhibitor from liver cells. Serum levels of the inhibitor are 
about 15% of normal in people homozygous for this defect. The conse- 
quence is that excess elastase destroys alveolar walls in the lungs by digest- 
ing elastic fibers and other connective-tissue proteins. 

The resulting clinical condition is called emphysema (also known as 
destructive lung disease). People with emphysema must breathe much harder 
than normal people to exchange the same volume of air because their alveoli 
are much less resilient than normal. Cigarette smoking markedly increases 
the likelihood that even a type Z heterozygote will develop emphysema. The 
reason is that smoke oxidizes methionine 358 of the inhibitor (Figure 10.25), 
a residue essential for binding elastase. Indeed, this methionine side chain is 
the bait that selectively traps elastase. The methionine sulfoxide oxidation 
product, in contrast, does not lure elastase, a striking consequence of the 
insertion of just one oxygen atom into a protein and a striking example of 
the effect of behavior on biochemistry. We will consider another protease 
inhibitor, antithrombin III, when we examine the control of blood clotting. 


Blood clotting is accomplished by a cascade of zymogen activations 


cz Enzymatic cascades are often employed in biochemical systems to 
achieve a rapid response. In a cascade, an initial signal institutes a 
series of steps, each of which is catalyzed by an enzyme. At each step, the 
signal is amplified. For instance, if a signal molecule activates an enzyme 
that in turn activates 10 enzymes and each of the 10 enzymes in turn acti- 
vates 10 additional enzymes, after four steps the original signal will have 
been amplified 10,000-fold. Blood clots are formed by a cascade of zymogen 
activations: the activated form of one clotting factor catalyzes the activation 
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Figure 10.24 Interaction of trypsin 
with its inhibitor. Structure of a complex of 
trypsin (yellow) and pancreatic trypsin 
inhibitor (red). Notice that lysine 15 of the 
inhibitor penetrates into the active site of the 
enzyme. There it forms a salt bridge with 
aspartate 189 in the active site. Also notice 
that bound inhibitor and the free inhibitor are 
almost identical in structure. [Drawn from 
1BPl.pdb.] 
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Figure 10.25 Oxidation of methionine to 
methionine sulfoxide. 


INTRINSIC PATHWAY 


Damaged surface 


| 


Kininogen 
Kallikrein 
__ 
XI 
——— EXTRINSIC PATHWAY 
x “la 
_ 
x Ka Vila “VI 
VII, <—— Trauma 
__ yy sa 
x Xa x 
‘Na 
on 
FINAL 
COMMON 
PATHWAY 
= 
* = activated eal 
by thrombin L 


Figure 10.26 Blood-clotting cascade. A 
fibrin clot is formed by the interplay of the 
intrinsic, extrinsic, and final common 
pathways. The intrinsic pathway begins with 
the activation of factor XII (Hageman factor) 
by contact with abnormal surfaces produced 
by injury. The extrinsic pathway is triggered by 
trauma, which releases tissue factor (TF). TF 
forms a complex with VII, which initiates a 
cascade-activating thrombin. Inactive forms of 
clotting factors are shown in red; their 
activated counterparts (indicated by the 
subscript “a”) are in yellow. Stimulatory 
proteins that are not themselves enzymes are 
shown in blue boxes. A striking feature of this 
process is that the activated form of one 
clotting factor catalyzes the activation of the 
next factor. 
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of the next (Figure 10.26). Thus, very small amounts of 
the initial factors suffice to trigger the cascade, ensuring 
a rapid response to trauma. 

Two means of initiating blood clotting have been 
described, the intrinsic pathway and the extrinsic path- 
way. The intrinsic clotting pathway is activated by 
exposure of anionic surfaces on rupture of the endothe- 
lial lining of the blood vessels. The extrinsic pathway, 
which appears to be most crucial in blood clotting, is 
initiated when trauma exposes tissue factor (TF), an 
integral membrane glycoprotein. Shortly after the tissue 
factor is exposed, small amounts of thrombin, the key 
protease in clotting, are generated. Thrombin then 
amplifies the clotting process by activating enzymes 
and factors that lead to the generation of yet more 
thrombin, an example of positive feedback. The extrin- 
sic and intrinsic pathways converge on a common 
sequence of final steps to form a clot composed of the 
protein fibrin (see Figure 10.26). Note that the active 
forms of the clotting factors are designated with a sub- 
script “a,” whereas factors that are activated by throm- 
bin are designated with an asterisk. 


Fibrinogen is converted by thrombin into a fibrin 
clot 


The best-characterized part of the clotting process is the 
final step in the cascade: the conversion of fibrinogen 


into fibrin by thrombin, a proteolytic enzyme. Fibrinogen is made up of 
three globular units connected by two rods (Figure 10.27). This 340-kd 
protein consists of six chains: two each of Aa, BB, and y. The rod regions 
are triple-stranded a-helical coiled coils, a recurring motif in proteins 
(Section 2.3). Thrombin cleaves four arginine—glycine peptide bonds in 
the central globular region of fibrinogen. On cleavage, an A peptide of 
18 residues is released from each of the two Aa chains, as is a B peptide 
of 20 residues from each of the two BB chains. These A and B peptides are 
called fibrinopeptides. A fibrinogen molecule devoid of these fibrinopeptides 
is called a fibrin monomer and has the subunit structure (aPy)2. 


(B) 
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k Figure 10.27 Structure of a fibrinogen molecule. (A) A ribbon diagram. The two rod 
regions are a-helical coiled coils, connected to a globular region at each end. The structure of the 
central globular region has not been determined. (B) A schematic representation showing the 
positions of the fibrinopeptides A and B. [Part A drawn from 1DEQ.pdb.] 


Fibrin monomers spontaneously assemble into ordered fibrous arrays 309 
called fibrin. Electron micrographs and low-angle x-ray patterns show that 10.4 Activation by Proteolytic Cleavage 
fibrin has a periodic structure that repeats every 23 nm (Figure 10.28). 

Higher-resolution images reveal how the removal of the fibrinopep- 

tides permits the fibrin monomers to come together to 
form fibrin. The homologous £ and y chains have globu- 
lar domains at the carboxyl-terminal ends (Figure 10.29). 
These domains have binding “holes” that interact 
with peptides. The B domain is specific for sequences of 
the form H3N*-Gly-His-Arg-, whereas the y domain 
binds H3N*-Gly-Pro-Arg-. Exactly these sequences 
(sometimes called “knobs”) are exposed at the amino- 
terminal ends of the B and a chains, respectively, on 
thrombin cleavage. The knobs of the a subunits fit into Figure 10.28 Electron micrograph of fibrin. The 23-nm period 
the holes on the yy subunits of another monomer to form along the fiber axis is half the length of a fibrinogen molecule. 

a protofibril. This protofibril is extended when the knobs [Courtesy of Dr. Henry Slayter,] 

of the B subunits fit into the holes of B subunits of other 

protofibrils. Thus, analogous to the activation of chy- 

motrypsinogen, peptide-bond cleavage exposes new amino termini that can 

participate in specific interactions. The newly formed “soft clot” is stabi- 

lized by the formation of amide bonds between the side chains of lysine and 

glutamine residues in different monomers. 


fe) ‘C=O 


HN. H..,, / Transglutaminase 
\ A ——a_ 
os NH, + +HsN XN 
/ HH . . NH 
O=¢ Fi 
Glutamine Lysine 
Fi fe) c= 
\ roar « ‘ é 
+ NH 
ond i ed \ 4 
fH H NH 
O=C 
Cross-link 


This cross-linking reaction is catalyzed by transglutaminase (factor 
XIII,), which itself is activated from the protransglutaminase form by 
thrombin. 
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Figure 10.29 Formation of a fibrin clot. 


Ce Cs Cs. (1) Thrombin cleaves fibrinopeptides A and B 
from the central globule of fibrinogen. (2) 
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“knobs” exposed at the amino-terminal ends 
of the 8 and y chains to form clots. 
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Figure 10.30 Modular structure of prothrombin. Cleavage of two peptide bonds yields 
hrombin. All the y-carboxyglutamate residues are in the gla domain. 


Prothrombin is readied for activation by a vitamin K-dependent 
modification 


Thrombin is synthesized as a zymogen called prothrombin. The inactive 
molecule comprises four major domains, with the serine protease domain 
at its carboxyl terminus. The first domain is called a gla domain (a 
y-carboxyglutamate-rich domain), and the second and third domains are 
called kringle domains (named after a Danish pastry that they resemble; 
Figure 10.30). These domains work in concert to keep prothrombin in an 
inactive form and to target it to appropriate sites for its activation by factor 
X, (a serine protease) and factor V, (a stimulatory protein). Activation is 
begun by proteolytic cleavage of the bond between arginine 274 and threo- 
nine 275 to release a fragment containing the first three domains. Cleavage 
of the bond between arginine 323 and isoleucine 324 (analogous to the key 
bond in chymotrypsinogen) yields active thrombin. 


Vitamin K (Figure 10.31) has been known for many years to be 
essential for the synthesis of prothrombin and several other clotting 
factors. Indeed, it is called vitamin K because a deficiency in this vitamin 
results in defective blood koagulation (Scandinavian spelling). The results 
of studies of the abnormal prothrombin synthesized in the absence of vita- 
min K or in the presence of vitamin K antagonists, such as dicoumarol, 
revealed the vitamin’s importance to proper clot formation. Dicoumarol is 
found in spoiled sweet clover and causes a fatal hemorrhagic disease in 
cattle fed on this hay. This coumarin derivative is used clinically as an anti- 
coagulant to prevent thromboses in patients prone to clot formation. 
Dicoumarol and such related vitamin K antagonists as warfarin also serve 
as effective rat poisons. Cows fed dicoumarol synthesize an abnormal pro- 
thrombin that does not bind Ca, in contrast with normal prothrombin. 
This difference was puzzling for some time because abnormal prothrom- 
bin has the same number of amino acid residues as that of normal 
prothrombin and gives the same amino acid analysis after acid hydrolysis. 
Nuclear magnetic resonance studies revealed that normal prothrombin 
contains y-carboxyglutamate, a formerly unknown residue that evaded 
detection because its second carboxy] group is lost on acid hydrolysis in the 
course of amino acid analysis. The abnormal prothrombin formed subse- 
quent to the administration of anticoagulants lacks this modified amino 
acid. In fact, the first 10 glutamate residues in the amino-terminal region of 
prothrombin are carboxylated to y-carboxyglutamate by a vitamin 
K-dependent enzyme system (Figure 10.32). The vitamin K-dependent 
carboxylation reaction converts glutamate, a weak chelator of Ca**, into 
y-carboxyglutamate, a much stronger chelator. Prothrombin is thus able 
to bind Ca’*, but what is the effect of this binding? The binding of Ca’* by 
prothrombin anchors the zymogen to phospholipid membranes derived 
from blood platelets after injury. The binding of prothrombin to phospho- 
lipid surfaces is crucial because it brings prothrombin into close proximity 
to two clotting proteins that catalyze its conversion into thrombin. The 


h Figure 10.32 The calcium-binding 
region of prothrombin. Prothrombin 
binds calcium ions with the modified 
amino acid y-carboxyglutamate (red). 
[Drawn from 2PF2.pdb.] 


Calcium ions 


calcium-binding domain is removed during activation, freeing the throm- 
bin from the membrane so that it can cleave fibrinogen and other targets. 


Hemophilia revealed an early step in clotting 


e Some important breakthroughs in the elucidation of clotting path- 
ways have come from studies of patients with bleeding disorders. 
Classic hemophilia, or hemophilia A, is the best-known clotting defect. This 
disorder is genetically transmitted as a sex-linked recessive characteristic. In 
classic hemophilia, factor VIII (antihemophilic factor) of the intrinsic pathway 
is missing or has markedly reduced activity. Although factor VIII is not itself 
a protease, it markedly stimulates the activation of factor X, the final pro- 
tease of the intrinsic pathway, by factor [X,, a serine protease (Figure 10.33). 
Thus, activation of the intrinsic pathway is severely impaired in hemophilia. 

In the past, hemophiliacs were treated with transfusions of a concen- 
trated plasma fraction containing factor VIII. This therapy carried the risk 
of infection. Indeed, many hemophiliacs contracted hepatitis and, more 
recently, AIDS. A safer source of factor VIII was urgently needed. With the 
use of biochemical purification and recombinant DNA techniques, the gene 
for factor VIII was isolated and expressed in cells grown in culture. 
Recombinant factor VIII purified from these cells has largely replaced 
plasma concentrates in treating hemophilia. 


(ial Figure 10.33 Action of antihemophilic factor. 
Antihemophilic factor Antihemophilic factor (Factor VIII) stimulates the activation of 
(Vill) factor X by factor IX. Interestingly, the activity of factor VIII is 
‘ markedly increased by limited proteolysis by thrombin. This 
a _/ Proteolysis positive feedback amplifies the clotting signal and accelerates 
Pa Ea’ clot formation after a threshold has been reached. 


The clotting process must be precisely regulated 


There is a fine line between hemorrhage and thrombosis, the formation of 
blood clots in blood vessels. Clots must form rapidly yet remain confined 
to the area of injury. What are the mechanisms that normally limit clot 
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An account of a hemorrhagic disposition 
existing in certain families 


“About seventy or eighty years ago, a woman 
by the name of Smith settled in the vicinity 
of Plymouth, New Hampshire, and 
transmitted the following idiosyncrasy to her 
descendants. It is one, she observed, to 
which her family is unfortunately subject 
and has been the source not only of great 
solicitude, but frequently the cause of death. 
If the least scratch is made on the skin of 
some of them, as mortal a hemorrhage will 
eventually ensue as if the largest wound is 
inflicted. . . . It is a surprising circumstance 
that the males only are subject to this 
strange affection, and that all of them are 
not liable to it... . Although the females are 
exempt, they are still capable of transmitting 
it to their male children: 

John Otto (1803) 
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Figure 10.34 Electron micrograph of a 
mast cell. Heparin and other molecules in 
the dense granules are released into the 
extracellular space when the cell is triggered 
to secrete. [Courtesy of Lynne Mercer.] 


formation to the site of injury? The lability of clotting factors contributes 
significantly to the control of clotting. Activated factors are short-lived 
because they are diluted by blood flow, removed by the liver, and degraded 
by proteases. For example, the stimulatory protein factors V, and VIII, 
are digested by protein C, a protease that is switched on by the action of 
thrombin. Thus, thrombin has a dual function: it catalyzes the formation of 
fibrin and it initiates the deactivation of the clotting cascade. 

Specific inhibitors of clotting factors are also critical in the termination 
of clotting. For instance, tissue factor pathway inhibitor (TFPI) inhibits the 
complex of TF—VII,-X,. Separate domains in TFPI inhibit VII, and X,. 
Another key inhibitor is antithrombin III, a plasma protein that inactivates 
thrombin by forming an irreversible complex with it. Antithrombin III 
resembles a ,-antitrypsin except that it inhibits thrombin much more 
strongly than it inhibits elastase (see Figure 10.24). Antithrombin III also 
blocks other serine proteases in the clotting cascade—namely, factors XII, 
XI,, [X,, and X,. The inhibitory action of antithrombin III is enhanced by 
heparin, a negatively charged polysaccharide found in mast cells near the 
walls of blood vessels and on the surfaces of endothelial cells (Figure 10.34). 
Heparin acts as an anticoagulant by increasing the rate of formation of 
irreversible complexes between antithrombin III and the serine protease 
clotting factors. Antitrypsin and antithrombin are serpins, a family of serine 
protease inhibitors. 


e The importance of the ratio of thrombin to antithrombin 1s illustrated 
in the case of a 14-year-old boy who died of a bleeding disorder 
because of a mutation in his q,-antitrypsin, which normally inhibits 
elastase. Methionine 358 in a1-antitrypsin’s binding pocket for elastase was 
replaced by arginine, resulting in a change in specificity from an elastase 
inhibitor toa thrombin inhibitor. a,-Antitrypsin activity normally increases 
markedly after injury to counteract excess elastase arising from stimulated 
neutrophils. The mutant a -antitrypsin caused the patient’s thrombin 
activity to drop to such a low level that hemorrhage ensued. We see here a 
striking example of how a change of a single residue in a protein can dra- 
matically alter specificity and an example of the critical importance of hav- 
ing the right amount of a protease inhibitor. 

Antithrombin limits the extent of clot formation, but what happens to 
the clots themselves? Clots are not permanent structures but are designed 
to dissolve when the structural integrity of damaged areas is restored. 
Fibrin is split by plasmin, a serine protease that hydrolyzes peptide bonds 
in the coiled-coil regions. Plasmin molecules can diffuse through aque- 
ous channels in the porous fibrin clot to cut the accessible connector rods. 
Plasmin is formed by the proteolytic activation of plasminogen, an inac- 
tive precursor that has a high affinity for the fibrin clots. This conversion 
is carried out by tissue-type plasminogen activator (TPA), a 72-kd protein 
that has a domain structure closely related to that of prothrombin (Figure 
10.35). However, a domain that targets TPA to fibrin clots replaces the 
membrane-targeting gla domain of prothrombin. The TPA bound to 
fibrin clots swiftly activates adhering plasminogen. In contrast, TPA 
activates free plasminogen very slowly. The gene for TPA has been cloned 
and expressed in cultured mammalian cells. Clinical studies have shown 
that TPA administered intravenously within an hour of the formation of a 


Figure 10.35 Modular structure of tissue-type plasminogen activator (TPA). 


(A) 


(B) 


Figure 10.36 The effect of tissue-type plasminogen factor. TPA leads to the dissolution of 
blood clots, as shown by x-ray images of blood vessels in the heart (A) before and (B) 3 hours 
after the administration of TPA. The position of the clot is marked by the arrow in part A. [After 
F. Van de Werf, P. A. Ludbrook, S. R. Bergmann, A. J. Tiefenbrunn, K. A. A. Fox, H. de Geest, M. 
Verstraete, D. Collen, and B. E. Sobel. New Engl. J. Med. 310(1984):609-613.] 


blood clot in a coronary artery markedly increases the likelihood of 
surviving a heart attack (Figure 10.36). 
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Aspartate Transcarbamoylase Is Allosterically Inhibited by the End 
Product of Its Pathway 

Allosteric proteins constitute an important class of proteins whose 
biological activity can be regulated. Specific regulatory molecules can 
modulate the activity of allosteric proteins by binding to distinct regu- 
latory sites, separate from the functional sites. These proteins have 
multiple functional sites, which display cooperation as evidenced by a 
sigmoidal dependence of function on substrate concentration. 
Aspartate transcarbamoylase (ATCase), one of the best-understood 
allosteric enzymes, catalyzes the synthesis of N-carbamoylaspartate, 
the first intermediate in the synthesis of pyrimidines. ATCase is 
feedback inhibited by cytidine triphosphate, the final product of the 
pathway. ATP reverses this inhibition. ATCase consists of separable 
catalytic (c3) subunits (which bind the substrates) and regulatory (r2) 
subunits (which bind CTP and ATP). The inhibitory effect of CTP, 
the stimulatory action of ATP, and the cooperative binding of 
substrates are mediated by large changes in quaternary structure. On 
binding substrates, the c3 subunits of the cerg enzyme move apart and 
reorient themselves. This allosteric transition is highly concerted. All 
subunits of an ATCase molecule simultaneously interconvert from 
the T (low-affinity) to the R (high-affinity) state. 


Isozymes Provide a Means of Regulation Specific to Distinct Tissues 
and Developmental Stages 

Isozymes differ in structural characteristics but catalyze the same reac- 
tion. They provide a means of fine-tuning metabolism to meet the 
needs of a given tissue or developmental stage. The results of gene- 
duplication events provide the means for subtle regulation of enzyme 
function. 


Covalent Modification Is a Means of Regulating Enzyme Activity 
The covalent modification of proteins is a potent means of controlling 
the activity of enzymes and other proteins. Phosphorylation is the 
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most common type of reversible covalent modification. Signals can be 
highly amplified by phosphorylation because a single kinase can act on 
many target molecules. The regulatory actions of protein kinases are 
reversed by protein phosphatases, which catalyze the hydrolysis of 
attached phosphoryl groups. 

Cyclic AMP serves as an intracellular messenger in the transduc- 
tion of many hormonal and sensory stimuli. Cyclic AMP switches on 
protein kinase A, a major multifunctional kinase, by binding to the 
regulatory subunit of the enzyme, thereby releasing the active cata- 
lytic subunits of PKA. In the absence of cAMP, the catalytic sites of 
PKA are occupied by pseudosubstrate sequences of the regulatory 
subunit. 


10.4 Many Enzymes Are Activated by Specific Proteolytic Cleavage 


The activation of an enzyme by the proteolytic cleavage of one or a few 
peptide bonds is a recurring control mechanism seen in processes as 
diverse as the activation of digestive enzymes and blood clotting. The 
inactive precursor is a zymogen (proenzyme). Trypsinogen is activated 
by enteropeptidase or trypsin, and trypsin then activates a host of 
other zymogens, leading to the digestion of foodstuffs. For instance, 
trypsin converts chymotrypsinogen, a zymogen, into active chy- 
motrypsin by hydrolyzing a single peptide bond. 

A striking feature of the clotting process is that it is accomplished 
by a cascade of zymogen conversions, in which the activated form 
of one clotting factor catalyzes the activation of the next precursor. 
Many of the activated clotting factors are serine proteases. In the 
final step of clot formation, fibrinogen, a highly soluble molecule in 
the plasma, is converted by thrombin into fibrin by the hydrolysis of 
four arginine—glycine bonds. The resulting fibrin monomer sponta- 
neously forms long, insoluble fibers called fibrin. Zymogen activa- 
tion is also essential in the lysis of clots. Plasminogen is converted 
into plasmin, a serine protease that cleaves fibrin, by tissue-type 
plasminogen activator. Although zymogen activation is irreversible, 
specific inhibitors of some proteases exert control. The irreversible 
protein inhibitor antithrombin III holds blood clotting in check in 
the clotting cascade. 
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Problems 

1. Context please. The allosteric properties of aspartate trans- 2. Activity profile. A histidine residue in the active site of 
carbamoylase have been discussed in detail in this chapter. aspartate transcarbamoylase is thought to be important in 


What is the function of aspartate transcarbamoylase? stabilizing the transition state of the bound substrates. 


Predict the pH dependence of the catalytic rate, assuming 
that this interaction is essential and dominates the pH-activity 
profile of the enzyme. (See equations on p. 16.) 


3. Knowing when to say when. What is feedback inhibition? 
Why is it a useful property? 


4. Knowing when to get going. What is the biochemical 
rationale for ATP serving as a positive regulator of 
ATCase? 


5. No T. What would be the effect of a mutation in an 
allosteric enzyme that resulted in a T/R ratio of 0? 


6. Turned upside down. An allosteric enzyme that follows 
the concerted mechanism has a T/R ratio of 300 in the 
absence of substrate. Suppose that a mutation reversed the 
ratio. How would this mutation affect the relation between 
the rate of the reaction and the substrate concentration? 


7. Partners. As shown in Figure 10.2, CTP inhibits 
ATCase; however, the inhibition is not complete. Can you 
suggest another molecule that might enhance the inhibition 
of ATCase? Hint: See Figure 25.2. 


8. RT equilibrium. Differentiate between homotropic and 
heterotropic effectors. 


9. Restoration project. If isolated regulatory subunits and 
catalytic subunits of ATCase are mixed, the native enzyme 
is reconstituted. What is the biological significance of the 
observation? 


10. Because it’s an enzyme. X-ray crystallographic studies 
of ATCase in the R form required the use of the bisubstrate 
analog PALA. Why was this analog, acompetitive inhibitor, 
used instead of the actual substrates? 


11. Allosteric switching. A substrate binds 100 times as 
tightly to the R state of an allosteric enzyme as to its T state. 
Assume that the concerted (MWC) model applies to this 
enzyme. (See equations for the Concerted Model in the 
Appendix to Chapter 7.) 


(a) By what factor does the binding of one substrate mole- 
cule per enzyme molecule alter the ratio of the concentra- 
tions of enzyme molecules in the R and T states? 


(b) Suppose that L, the ratio of [T] to [R] in the absence of 
substrate, is 10’ and that the enzyme contains four binding 
sites for substrate. What is the ratio of enzyme molecules in 
the R state to those in the T state in the presence of saturat- 
ing amounts of substrate, assuming that the concerted 
model is obeyed? 


12. Allosteric transition. Consider an allosteric protein that 
obeys the concerted model. Suppose that the ratio of T to R 
formed in the absence of ligand is 10°, Kr = 2 mM, and 
Kr = 5 pM. The protein contains four binding sites for 
ligand. What is the fraction of molecules in the R form 
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when 0, 1, 2, 3, and 4 ligands are bound? (See equations for 
the Concerted Model in the Appendix to Chapter 7.) 


13. Negative cooperativity. You have isolated a dimeric 
enzyme that contains two identical active sites. The binding 
of substrate to one active site decreases the substrate affin- 
ity of the other active site. Can the concerted model account 
for this negative cooperativity? 


14. Paradoxical at first glance. Recall that phosphonacetyl- 
L-aspartate (PALA) isa potent inhibitor of ATCase because 
it mimics the two physiological substrates. However, low 
concentrations of this unreactive bisubstrate analog increase 
the reaction velocity. On the addition of PALA, the reaction 
rate increases until an average of three molecules of PALA 
are bound per molecule of enzyme. This maximal velocity 
is 17-fold greater than it is in the absence of PALA. The 
reaction rate then decreases to nearly zero on the addition of 
three more molecules of PALA per molecule of enzyme. 
Why do low concentrations of PALA activate ATCase? 


15. Regulation energetics. The phosphorylation and dephos- 
phorylation of proteins is a vital means of regulation. 
Protein kinases attach phosphoryl groups, whereas only a 
phosphatase will remove the phosphoryl group from the 
target protein. What is the energy cost of this means of 
covalent regulation? 


16. Viva la difference. What is an isozyme? 


17. Fine-tuning biochemistry. What is the advantage for an 
organism to have isozymic forms of an enzyme? 


18. Making matches. 


(a) ATCase 1. Protein phosphorylation 
(b) T state 2. Required to modify 
(c) Restate glutamate 

(d) Phosphorylation 3. Activates a particular 
(e) Kinase kinase 

(f) Phosphatase 4. Proenzyme 

(g) cAMP 5. Activates trypsin 

(h) Zymogen 6. Common covalent 

(i) Enteropeptidase modification 

Qj) Vitamin K 7. Inhibited by CTP 

(1) Tissue factor 8. Less-active state of an 


allosteric protein 
9. Initiates extrinsic 
pathway 
10. Forms fibrin 
11. More-active state of an 
allosteric protein 
12. Removes phosphates 


19. Powering change. Phosphorylation isacommon covalent 
modification of proteins in all forms of life. What energetic 
advantages accrue from the use of ATP as the phosphoryl 
donor? 
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20. No going back. What is the key difference between 
regulation by covalent modification and specific proteolytic 
cleavage? 


21. Zymogen activation. When very low concentrations of 
pepsinogen are added to acidic media, how does the half- 
time for activation depend on zymogen concentration? 


22. No protein shakes advised. Predict the physiological 
effects of a mutation that resulted in a deficiency of entero- 
peptidase. 


23. A revealing assay. Suppose that you have just examined 
a young boy with a bleeding disorder highly suggestive of 
classic hemophilia (factor VIII deficiency). Because of the 
late hour, the laboratory that carries out specialized coagu- 
lation assays is closed. However, you happen to have a 
sample of blood from a classic hemophiliac whom you 
admitted to the hospital an hour earlier. What is the sim- 
plest and most rapid test that you can perform to determine 
whether your present patient also is deficient in factor VIII 
activity? 


24. Counterpoint. The synthesis of factor X, like that of 
prothrombin, requires vitamin K. Factor X also contains 
y-carboxyglutamate residues in its amino-terminal region. 
However, activated factor X, in contrast with thrombin, 
retains this region of the molecule. What is a likely func- 
tional consequence of this difference between the two acti- 
vated species? 


25. A discerning inhibitor. Antithrombin III forms an 
irreversible complex with thrombin but not with pro- 
thrombin. What is the most likely reason for this difference 
in reactivity? 


26. Repeating heptads. Each of the three types of fibrin 
chains contains repeating heptapeptide units (abcdefg) in 
which residues a and d are hydrophobic. Propose a reason 
for this regularity. 


27. Drug design. A drug company has decided to use recom- 
binant DNA methods to prepare a modified o-antitrypsin 
that will be more resistant to oxidation than is the naturally 
occurring inhibitor. Which single amino acid substitution 
would you recommend? 


28. Blood must flow. Why is inappropriate blood-clot 
formation dangerous? 


29. Dissolution row. What is tissue-type plasminogen 
activator and what is its role in preventing heart attacks. 


30. Joining together. What differentiates a soft clot from a 
mature clot? 


Data Interpretation Problems 


31. Distinguishing between models. The following graph 
shows the fraction of an allosteric enzyme in the R state (fr) 
and the fraction of active sites bound to substrate (Y) as a 
function of substrate concentration. Which model, the 
concerted or sequential, best explains these results? 
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[After M. W. Kirschner and H. K. Schachman. Biochemistry 12:2997-3004, 1966.] 


32. Reporting live from ATCase 1. ATCase underwent 
reaction with tetranitromethane to form a colored nitroty- 
rosine group (Amax = 430 nm) in each of its catalytic chains. 
The absorption by this reporter group depends on its 
immediate environment. An essential lysine residue at each 
catalytic site also was modified to block the binding of sub- 
strate. Catalytic trimers from this doubly modified enzyme 
were then combined with native trimers to form a hybrid 
enzyme. The absorption by the nitrotyrosine group was 
measured on addition of the substrate analog succinate. 
What is the significance of the alteration in the absorbance 
at 430 nm? 
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[After H. K. Schachman. J. Biol, Chem. 263:18583-18586, 1988.] 


33. Reporting live from ATCase 2. A different ATCase 
hybrid was constructed to test the effects of allosteric acti- 
vators and inhibitors. Normal regulatory subunits were 
combined with nitrotyrosine-containing catalytic subunits. 
The addition of ATP in the absence of substrate increased 
the absorbance at 430 nm, the same change elicited by the 
addition of succinate (see the graph in Problem 32). 
Conversely, CTP in the absence of substrate decreased the 
absorbance at 430 nm. What is the significance of the 
changes in absorption of the reporter groups? 
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[After H. K. Schachman. J. Biol. Chem. 263:18583-18586, 1988.] 
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Chapter Integration Problems 


34. Density matters. The sedimentation value of aspartate 
transcarbamoylase decreases when the enzyme switches to 
the R state. On the basis of the allosteric properties of the 
enzyme, explain why the sedimentation value decreases. 


35. Too tight a grip. Trypsin cleaves proteins on the car- 
boxyl side of lysine. Trypsin inhibitor has a lysine residue, 
and binds to trypsin, yet it is not a substrate. Explain. 


Mechanism Problems 


36. Aspartate transcarbamoylase. Write the mechanism (in 
detail) for the conversion of aspartate and carbamoy1 phos- 
phate into N-carbamoylaspartate. Include a role for the 
histidine residue present in the active site. 


37. Protein kinases. Write a mechanism (in detail) for the 
phosphorylation of a serine residue by ATP catalyzed by a 
protein kinase. What groups might you expect to find in the 
enzyme’s active site? 
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Carbohydrates 
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Carbohydrates are important fuel molecules, but they play many other biochemical roles, including 
protection against high-impact forces. The cartilage of a runner's foot cushions the impact of each 
step she takes. A key component of cartilage are molecules called glycosaminoglycans, large 
polymers made up of many repeats of dimers such as the pair shown at the right. [Untitled x-ray/ 
Nick Veasey/Getty Images.] 


or years, the study of carbohydrates was considered less exciting than OUTLINE 
many if not most topics of biochemistry. Carbohydrates were recognized 
as important fuels and structural components but were thought to be 
peripheral to most key activities of the cell. In essence, they were considered 
the underlying girders and fuel for a magnificent piece of biochemical archi- 11.2 Monosaccharides Are Linked to 
tecture. This view has changed dramatically in the past few years. We have Form Complex Carbohydrates 
learned that cells of all organisms are coated in a dense and complex coat of 
carbohydrates. Secreted proteins are often extensively decorated with car- 
bohydrates essential to a protein’s function. The extracellular matrix in 
higher eukaryotes—the environment in which the cells live—is rich in 11.4 Lectins Are Specific Carbohydrate- 
secreted carbohydrates central to cell survival and cell-to-cell communica- Binding Proteins 
tion. Carbohydrates are crucial for the development and functioning of 
all organisms, not only as fuels, but also as information-rich molecules. 
Carbohydrates, carbohydrate-containing proteins, and specific carbohydrate- 
binding proteins are required for interactions that allow cells to form tissues, 
are the basis of human blood groups, and are used by a variety of pathogens 


11.1 Monosaccharides Are the 
Simplest Carbohydrates 


11.3, Carbohydrates Can Be Linked to 
Proteins to Form Glycoproteins 
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to gain access to their hosts. Indeed, rather than mere infrastructure compo- 
nents, carbohydrates supply details and enhancements to the biochemical 
architecture of the cell, helping to define the beauty, functionality, and 
uniqueness of the cell. 

A key property of carbohydrates that allows their many functions is the 
tremendous structural diversity possible within this class of molecules. 
Carbohydrates are built from monosaccharides, which are small molecules— 
typically containing from three to nine carbon atoms that are bound to 
hydroxyl groups—that vary in size and in the stereochemical configuration 
at one or more carbon centers. These monosaccharides can be linked together 
to form a large variety of oligosaccharide structures. The sheer number of 
possible oligosaccharides makes this class of molecules information rich. 
This information, when attached to proteins, can augment the already 
immense diversity of proteins. 

The realization of the importance of carbohydrates to so many aspects of 
biochemistry has spawned a field of study called glycobiology. Glycobiology 
is the study of the synthesis and structure of carbohydrates and how carbo- 
hydrates are attached to and recognized by other molecules such as 
proteins. Along with a new field comes a new “omics”’ to join genomics and 
proteomics—glycomics. Glycomics is the study of the glycome, all of the 
carbohydrates and carbohydrate-associated molecules that cells produce. 
Like the proteome, the glycome is not static and can change, depending on 
cellular and environmental conditions. Unraveling oligosaccharide struc- 
tures and elucidating the effects of their attachment to other molecules 
constitute a tremendous challenge in the field of biochemistry. 


11.1 Monosaccharides Are the Simplest Carbohydrates 


Carbohydrates are carbon-based molecules that are rich in hydroxyl groups. 
Indeed, the empirical formula for many carbohydrates is (CH2O),— 
literally, a carbon hydrate. Simple carbohydrates are called monosaccharides. 
These simple sugars serve not only as fuel molecules but also as funda- 
mental constituents of living systems. For instance, DNA is built on simple 
sugars: its backbone consists of alternating phosphoryl groups and deoxyri- 
bose, a cyclic five-carbon sugar. 

Monosaccharides are aldehydes or ketones that have two or more 
hydroxyl groups. The smallest monosaccharides, composed of three carbon 
atoms, are dihydroxyacetone and D- and L-glyceraldehyde. 
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Dihydroxyacetone is called a ketose because it contains a keto group (in red 
above), whereas glyceraldehyde is called an aldose because it contains an 
aldehyde group. They are referred to as trioses (tri- for three, referring to the 
three carbon atoms that they contain). Similarly, simple monosaccharides 
with four, five, six, and seven carbon atoms are called tetroses, pentoses, 
hexoses, and heptoses, respectively. Perhaps the monosaccharides of which 
we are most aware are the hexoses, such as glucose and fructose. Glucose is 
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an essential energy source for virtually all forms of life. Fructose is com- 
monly used as a sweetener that is converted into glucose derivatives inside 
the cell. 

Carbohydrates can exist in a dazzling variety of isomeric forms (Figure 11.1). 
Dihydroxyacetone and glyceraldehyde are called constitutional isomers 
because they have identical molecular formulas but differ in how the atoms 
are ordered. Stereoisomers are isomers that differ in spatial arrangement. 
Recall from the discussion of amino acids (p. 27) that stereoisomers are 
designated as having either D or L configuration. Glyceraldehyde has a sin- 
gle asymmetric carbon atom and, thus, there are two stereoisomers of this 
sugar: D-glyceraldehyde and L-glyceraldehyde. These molecules are a type 
of stereoisomer called enantiomers, which are mirror images of each other. 
Most vertebrate monosaccharides have the D configuration. According to 
convention, the D and L isomers are determined by the configuration of 
the asymmetric carbon atom farthest from the aldehyde or keto group. 
Dihydroxyacetone is the only monosaccharide without at least one asym- 
metric carbon atom. 

Monosaccharides made up of more than three carbon atoms have mul- 
tiple asymmetric carbons, and so they can exist not only as enantiomers but 
also as diastereoisomers, isomers that are not mirror images of each other. 
The number of possible stereoisomers equals 2”, where n is the number of 
asymmetric carbon atoms. Thus, a six-carbon aldose with 4 asymmetric 
carbon atoms can exist in 16 possible diastereoisomers, of which glucose is 
one such isomer. 

Figure 11.2 shows the common sugars that we will see most frequently 
in our study of biochemistry. D-Ribose, the carbohydrate component of 
RNA, is a five-carbon aldose, as is deoxyribose, the monosaccharide com- 
ponent of deoxynucleotides. D-Glucose, D-mannose, and D-galactose are 


Figure 11.1 Isomeric forms of 
carbohydrates. 
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Figure 11.2 Common monosaccharides. 
Aldoses contain an aldehyde (shown in blue), 
whereas ketoses, such as fructose, contain a 
ketose (also shown in blue). The asymmetric 
carbon atom farthest from the aldehyde or 
ketone (shown in red) designates the 
structures as being in the D configuration. 
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abundant six-carbon aldoses. Note that D-glucose and D-mannose differ in 
configuration only at C-2, the carbon atom in the second position. Sugars 
that are diastereoisomers differing in configuration at only a single asym- 
metric center are called epimers. Thus, D-glucose and D-mannose are epi- 
meric at C-2; D-glucose and D-galactose are epimeric at C-4. 

Note that ketoses have one less asymmetric center than aldoses with 
the same number of carbon atoms. D-Fructose is the most abundant 
ketohexose. 


Many common sugars exist in cyclic forms 


The predominant forms of ribose, glucose, fructose, and many other sugars 
in solution, as is the case inside the cell, are not open chains. Rather, the 
open-chain forms of these sugars cyclize into rings. The chemical basis 
for ring formation is that an aldehyde can react with an alcohol to form a 
hemiacetal. 
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Aldehyde Alcohol Hemiacetal 
For an aldohexose such as glucose, a single molecule provides both the alde- 
hyde and the alcohol: the C-1 aldehyde in the open-chain form of glucose 
reacts with the C-5 hydroxyl group to form an intramolecular hemiacetal 
(Figure 11.3). The resulting cyclic hemiacetal, a six-membered ring, is 
called pyranose because of its similarity to pyran. 

Similarly, a ketone can react with an alcohol to form a hemiketal. 
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The C-2 keto group in the open-chain form of a ketohexose, such as 
fructose, can form an intramolecular hemiketal by reacting with either the 
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C-6 hydroxyl group to form a six-membered cyclic hemiketal or the C-5 
hydroxyl group to form a five-membered cyclic hemiketal (Figure 11.4). 
The five-membered ring is called a furanose because of its similarity to 
furan. 
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(open-chain form) (a cyclic form of fructose) 


The depictions of glucopyranose (glucose) and fructofuranose (fructose) 
shown in Figures 11.3 and 11.4 are Haworth projections. In such projections, 
the carbon atoms in the ring are not written out. The approximate plane of 
the ring is perpendicular to the plane of the paper, with the heavy line on the 
ring projecting toward the reader. 

We have seen that carbohydrates can contain many asymmetric carbon 
atoms. An additional asymmetric center is created when a cyclic hemiac- 
etal is formed, creating yet another diastereoisomeric form of sugars called 
anomers. In glucose, C-1 (the carbonyl carbon atom in the open-chain 
form) becomes an asymmetric center. Thus, two ring structures can be 
formed: a-D-glucopyranose and B-D-glucopyranose (see Figure 11.3). For 
D sugars drawn as Haworth projections in the standard orientation as 
shown in Figure 11.3, the designation a means that the hydroxyl group 
attached to C-1 is on the opposite side of the ring as C-6; B means that the 
hydroxyl group is on the same side of the ring as C-6. The C-1 carbon atom 
is called the anomeric carbon atom, and the a and B forms are called 
anomers. An equilibrium mixture of glucose contains approximately one- 
third a anomer, two-thirds B anomer, and <1% of the open-chain form. 

The furanose-ring form of fructose also has anomeric forms, in which a 
and B refer to the hydroxyl groups attached to C-2, the anomeric carbon 
atom (see Figure 11.4). Fructose forms both pyranose and furanose rings. 


323 


11.1 Monosaccharides 


Figure 11.3 Pyranose formation. The 
open-chain form of glucose cyclizes when the 
C-5 hydroxyl group attacks the oxygen atom 
of the C-1 aldehyde group to form an 
intramolecular hemiacetal. Two anomeric 
forms, designated « and B, can result. 


Figure 11.4 Furanose formation. The 
open-chain form of fructose cyclizes to a 
five-membered ring when the C-5 hydroxyl 
group attacks the C-2 ketone to form an 
intramolecular hemiketal. Two anomers are 
possible, but only the a anomer is shown. 
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Figure 11.6 Common monosaccharides 


in their ring forms. 
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B anomers are possible. o-D-Fructopyranose B-p-Fructopyranose 


The pyranose form predominates in fructose free in solution, and the fura- 
nose form predominates in many fructose derivatives (Figure 11.5). 

8-D-Fructopyranose, found in honey, is one of the sweetest chemicals 
known. The B-D-fructofuranose form is not nearly as sweet. Heating con- 
verts B-fructopyranose into the B-fructofuranose form, reducing the sweet- 
ness of the solution. For this reason, corn syrup with a high concentration of 
fructose in the B-D-pyranose form is used as a sweetener in cold, but not 
hot, drinks. Figure 11.6 shows the common sugars discussed previously in 
their ring forms. 


HOH2C HOH2C OH 
oO 
H H 
H H 
OH H 
D-Ribose 2-Deoxy-p-ribose 
CH20H CH20H 
HOH,¢ CH2OH 
2 2 
CH,OH fe) HO OH H OH 
H H 
H ui uy dal OH H OH OH 
H H OH 
HO OH H OH HO OH 
H OH OH H H OH H H 
a-D-Glucose a-D-Fructose a-D-Galactose a-D-Mannose 


Pyranose and furanose rings can assume different conformations 


The six-membered pyranose ring is not planar, because of the tetrahedral 
geometry of its saturated carbon atoms. Instead, pyranose rings adopt two 
classes of conformations, termed chair and boat because of the resem- 
blance to these objects (Figure 11.7). In the chair form, the substituents on 
the ring carbon atoms have two orientations: axial and equatorial. Axial 
bonds are nearly perpendicular to the average plane of the ring, whereas 
equatorial bonds are nearly parallel to this plane. Axial substituents steri- 
cally hinder each other if they emerge on the same side of the ring (e.g., 
1,3-diaxial groups). In contrast, equatorial substituents are less crowded. 


The chair form of B-D-glucopyranose predominates because all axial 
positions are occupied by hydrogen atoms. The bulkier —OH and 
—CH,OH groups emerge at the less-hindered periphery. The 
boat form of glucose is disfavored because it is quite sterically 
hindered. 

Furanose rings, like pyranose rings, are not planar. They can be 
puckered so that four atoms are nearly coplanar and the fifth is 
about 0.5 A away from this plane (Figure 11.8). This conformation 
is called an envelope form because the structure resembles an opened 
envelope with the back flap raised. In the ribose moiety of most 
biomolecules, either C-2 or C-3 is out of the plane on the same side 
as C-5. These conformations are called C-2-endo and C-3-endo, 
respectively. 


H CH2OH CH 20H 
ne 0 H\_o Figure 11.8 Envelope ; 
OH H OH conformations of B-p-ribose. 
H The C-3-endo and C-2-endo forms 
H HO HO H of B-b-ribose are shown. The color 
OH indicates the four atoms that lie 
C-3-endo C-2-endo approximately in a plane. 


Glucose is a reducing sugar 


Because the a and B isomers of glucose are in an equilibrium that passes 
through the open-chain form, glucose has some of the chemical properties 
of free aldehydes, such as the ability to react with oxidizing agents. For 
example, glucose can react with cupric ion (Cu’*), reducing it to cuprous 
ion (Cu” ), while being oxidized to gluconic acid. 
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‘2 Solutions of cupric ion (known as Fehling’s solution) provide a simple 
test for the presence of sugars such as glucose. Sugars that react are 
called reducing sugars; those that do not are called nonreducing sugars. 
Reducing sugars can often nonspecifically react with other molecules. For 
instance, as a reducing sugar, glucose can react with hemoglobin to form 
glycosylated hemoglobin. Monitoring changes in the amount of glycosy- 
lated hemoglobin is an especially useful means of assessing the effectiveness 
of treatments for diabetes mellitus, a condition characterized by high levels of 
blood glucose (Section 27.3). Because the glycosylated hemoglobin remains 
in circulation, the amount of the modified hemoglobin corresponds to the 
long-term regulation—over several months—of glucose levels. In nondia- 
betic people, less than 6% of the hemoglobin is glycosylated, whereas, in 
uncontrolled diabetics, almost 10% of the hemoglobin is glycosylated. 
Although the glycosylation of hemoglobin has no effect on oxygen binding 
and is thus benign, similar reducing reactions between sugars and other 


Boat form 


Chair form 


Figure 11.7 Chair and boat forms of 
B-p-glucopyranose. The chair form is 
more stable owing to less steric hindrance 
because the axial positions are occupied by 
hydrogen atoms. Abbreviations: a, axial; 

e, equatorial. 
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CHAPTER 11 Carbohydrates the normal biochemical function of the modified proteins. Modifications 
known as advanced glycosylation end products (AGE) have been implicated 
in aging, arteriosclerosis, and diabetes, as well as other pathological condi- 
tions. AGE is the name given to a series of reactions between an amino 
group not participating in a peptide bond in a protein and the aldehyde form 
of a carbohydrate. 


Monosaccharides are joined to alcohols and amines through 
glycosidic bonds 


The biochemical properties of monosaccharides can by modified by reac- 
tion with other molecules. These modifications increase the biochemical 
versatility of carbohydrates, enabling them to serve as signal molecules or 
rendering them more susceptible to combustion. Three common reactants 
are alcohols, amines, and phosphates. A bond formed between the ano- 
meric carbon atom of glucose and the oxygen atom of an alcohol is called a 
glycosidic bond—specifically, an O-glycosidic bond. O-Glycosidic bonds are 
prominent when carbohydrates are linked together to form long polymers 
and when they are attached to proteins. In addition, the anomeric carbon 
atom of a sugar can be linked to the nitrogen atom of an amine to form an 
N-glycosidic bond, such as when nitrogenous bases are attached to ribose 
units to form nucleosides. Examples of modified carbohydrates are shown 
in Figure 11.9. 
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Figure 11.9 Modified monosaccharides. 
Carbohydrates can be modified by the 
addition of substituents (shown in red) other Phosphorylated sugars are key intermediates in energy generation 
than hydroxyl groups. Such modified and biosyntheses 
carbohydrates are often expressed on cell sat, : ; ; ; 
surfaces. One sugar modification deserves special note because of its prominence in 


metabolism. The addition of phosphoryl groups is a common modification 
of sugars. For instance, the first step in the breakdown of glucose to obtain 
energy is its conversion into glucose 6-phosphate. Several subsequent inter- 
mediates in this metabolic pathway, such as dihydroxyacetone phosphate 
and glyceraldehyde 3-phosphate, are phosphorylated sugars. 
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Phosphorylation makes sugars anionic; the negative charge prevents these 
sugars from spontaneously leaving the cell by crossing lipid-bilayer mem- 
branes. Phosphorylation also creates reactive intermediates that will more 
readily form linkages to other molecules. For example, a multiply phospho- 
rylated derivative of ribose plays key roles in the biosyntheses of purine and 
pyrimidine nucleotides (Chapter 25). 


11.2 Monosaccharides Are Linked to Form 
Complex Carbohydrates 


Because sugars contain many hydroxyl groups, glycosidic bonds can join 
one monosaccharide to another. Oligosaccharides are built by the linkage of 
two or more monosaccharides by O-glycosidic bonds (Figure 11.10). In the 
disaccharide maltose, for example, two D-glucose residues are joined by a 
glycosidic linkage between the a-anomeric form of C-1 on one sugar and 
the hydroxyl oxygen atom on C-4 of the adjacent sugar. Such a linkage is 
called an a-1,4-glycosidic bond. Just as proteins have a polarity defined by 
the amino and carboxy] termini, oligosaccharides have a polarity defined by 
their reducing and nonreducing ends. The carbohydrate unit at the reducing 
end has a free anomeric carbon atom that has reducing activity because it 
can form the open-chain form, as discussed earlier (p. 325). By convention, 
this end of the oligosaccharide 1s still called the nonreducing end even when 
it is bound to another molecule such as a protein and thus no longer has 
reducing properties. 

The fact that monosaccharides have multiple hydroxyl groups means 
that many different glycosidic linkages are possible. For example, consider 
three monosaccharides—glucose, mannose, and galactose. These molecules 
can be linked together in the laboratory to form more than 12,000 different 
structures differing in the order of the monosaccharides and the hydroxyl 
groups participating in the glycosidic linkages. For instance, the hydroxyl 
group on carbon 1 of one monosaccharide can link to carbons 4 or 6 of the 
next monosaccharide. In this section, we will look at some of the most 
common oligosaccharides found in nature. 


Sucrose, lactose, and maltose are the common disaccharides 


A disaccharide consists of two sugars joined by an O-glycosidic bond. Three 
abundant disaccharides that we encounter frequently are sucrose, lactose, 
and maltose (Figure 11.11). Sucrose (common table sugar) is obtained com- 
mercially from sugar cane or sugar beets. The anomeric carbon atoms of a 
glucose unit and a fructose unit are joined in this disaccharide; the configu- 
ration of this glycosidic linkage is a for glucose and B for fructose. Sucrose 
can be cleaved into its component monosaccharides by the enzyme sucrase. 
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Figure 11.11 Common disaccharides. Sucrose, lactose, and maltose are common dietary 
components. The angles in the bonds to the central oxygen atoms do not denote carbon atoms. 
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a-1,4-Glycosidic bond 
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Figure 11.10 Maltose, a disaccharide. 
Two molecules of glucose are linked by an 
a-1,4-glycosidic bond to form the disaccharide 
maltose. The angles in the bonds to the 
central oxygen atom do not denote carbon 
atoms. The angles are added only for ease of 
illustration. The glucose molecule on the right 
is capable of assuming the open-chain form, 
which is capable of acting as a reducing agent. 
The glucose molecule on the left cannot 
assume the open-chain form, because the 

C-1 carbon atom is bound to another 
molecule. 


Maltose 
(a-p-Glucopyranosyl-(1— 4)-o.-p-glucopyranose 
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Figure 11.12 Electron micrograph of 
microvilli. Lactase and other enzymes that 
hydrolyze carbohydrates are present on 
microvilli that project from the outer face of 
the plasma membrane of intestinal epithelial 
cells. [From Louisa Howard and Katherine 
Connolly. Courtesy of Louisa Howard, 
Dartmouth College.] 


Lactose, the disaccharide of milk, consists of galactose joined to glucose 
by a B-1,4-glycosidic linkage. Lactose is hydrolyzed to these monosaccha- 
rides by lactase in human beings and by B-galactosidase in bacteria. In 
maltose, two glucose units are joined by an a-1,4-glycosidic linkage. Maltose 
comes from the hydrolysis of large polymeric oligosaccharides such as 
starch and glycogen and is in turn hydrolyzed to glucose by maltase. Sucrase, 
lactase, and maltase are located on the outer surfaces of epithelial cells lin- 
ing the small intestine (Figure 11.12). The cleavage products of sucrose, 
lactose, and maltose can be further processed to provide energy in the form 
of ATP. 


Glycogen and starch are storage forms of glucose 


Glucose is an important energy source in virtually all life forms. However, 
free glucose molecules cannot be stored because in high concentrations, 
glucose will disturb the osmotic balance of the cell, with the potential result 
being cell death. The solution is to store glucose as units in a large polymer, 
which is not osmotically active. 

Large polymeric oligosaccharides, formed by the linkage of multiple 
monosaccharides, are called polysaccharides and play vital roles in energy 
storage and in maintaining the structural integrity of an organism. If all of 
the monosaccharide units in a polysaccharide are the same, the polymer is 
called a homopolymer. The most common homopolymer in animal 
cells is glycogen, the storage form of glucose. Glycogen is present 
in most of our tissues but is most common in muscle and liver. As 
will be considered in detail in Chapter 21, glycogen is a large, 
branched polymer of glucose residues. Most of the glucose units 
in glycogen are linked by a-1,4-glycosidic bonds. The branches are 
formed by a-1,6-glycosidic bonds, present about once in 10 units 
(Figure 11.13). 

The nutritional reservoir in plants is the homopolymer starch, of 


which there are two forms. Amylose, the unbranched type of starch, 
—O consists of glucose residues in a-1,4 linkage. Amylopectin, the 
branched form, has about 1 a-1,6 linkage per 30 a-1,4 linkages, 

in similar fashion to glycogen except for its lower degree of branching. 

Figure 11.13 Branch point in glycogen. Two chains More than half the carbohydrate ingested by human beings is starch 


of glucose molecules joined by a-1,4-glycosidic bonds 


are linked by an a-1,6-glycosidic bond to create a 
branch point. Such an a-1,6-glycosidic bond forms at 


found in wheat, potatoes, and rice, to name just a few sources. 
Amylopectin, amylose, and glycogen are rapidly hydrolyzed by 


approximately every 10 glucose units, making glycogen a-amylase, an enzyme secreted by the salivary glands and the 


a highly branched molecule. 


pancreas. 

We have considered only homopolymers of glucose. However, given the 
variety of different monosaccharides that can be put together in any number 
of arrangements, the number of possible polysaccharides is huge. We will 
consider some of these polysaccharides shortly. 


Cellulose, a structural component of plants, is made of 
chains of glucose 


Cellulose, the other major polysaccharide of glucose found in plants, serves 
a structural rather than a nutritional role as an important component of the 
plant cell wall. Cellulose is among the most abundant organic compounds in the 
biosphere. Some 10!° kg of cellulose is synthesized and degraded on Earth 
each year, an amount 1000 times as great as the combined weight of the 
human race. Cellulose is an unbranched polymer of glucose residues joined 
by B-1,4 linkages, in contrast with the a-1,4 linkage seen in starch and gly- 
cogen. This simple difference in stereochemistry yields two molecules with 
vastly different properties and biological functions. The 8 configuration 
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allows cellulose to form very long, straight chains. Fibrils are formed by 
parallel chains that interact with one another through hydrogen bonds, 
generating a rigid, supportive structure. The straight chains formed by B 
linkages are optimal for the construction of fibers having a high tensile 
strength. The a-1,4 linkages in glycogen and starch produce a very different 
molecular architecture: a hollow helix is formed instead of a straight chain 
(Figure 11.14). The hollow helix formed by a linkages is well suited to the 
formation of a more-compact, accessible store of sugar. Although mammals 
lack cellulases and therefore cannot digest wood and vegetable fibers, 
cellulose and other plant fibers are still an important constituent of the 
mammalian diet as a component of dietary fiber. Soluble fiber such as pectin 
(polygalacturonic acid) slows the movement of food through the gastroin- 
testinal tract, allowing improved digestion and the absorption of nutrients. 
Insoluble fibers, such as cellulose, increase the rate at which digestion prod- 
ucts pass through the large intestine. This increase in rate can minimize 
exposure to toxins in the diet. Cellulose is currently being investigated as 
potential source of ethanol for biofuels. 


11.3. Carbohydrates Can Be Linked to Proteins to 
Form Glycoproteins 


A carbohydrate group can be covalently attached to a protein to form a gly- 
coprotein. We will examine three classes of glycoproteins. The first class is 
simply referred to as glycoproteins. In glycoproteins of this class, the pro- 
tein constituent is the largest component by weight. This versatile class 
plays a variety of biochemical roles. Many glycoproteins are components of 
cell membranes, where they take part in processes such as cell adhesion and 
the binding of sperm to eggs. Other glycoproteins are formed by linking 
carbohydrates to soluble proteins. In particular, many of the proteins 
secreted from cells are glycosylated, or modified by the attachment of 
carbohydrates, including most proteins present in the serum component of 
blood. 

The second class of glycoproteins comprises the proteoglycans. The 
protein component of proteoglycans is conjugated to a particular type of 
polysaccharide called a glycosaminoglycan. Carbohydrates make up a much 


for structural purposes. The a-1,4 linkages favor bent structures, 
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Figure 11.15 Glycosidic bonds between 
proteins and carbohydrates. A glycosidic 
bond links a carbohydrate to the side chain of 
asparagine (N-linked) or to the side chain of 
serine or threonine (O-linked). The glycosidic 
bonds are shown in red. 


larger percentage by weight of the proteoglycan compared with simple 
glycoproteins. Proteoglycans function as structural components and 


lubricants. 


Mucins, or mucoproteins, are, like proteoglycans, predominately carbo- 
hydrate. N-Acetylgalactosamine is usually the carbohydrate moiety bound 
to the protein in mucins. N-Acetylgalactosamine is an example of an amino 
sugar, so named because an amino group replaces a hydroxyl group. Mucins, 
a key component of mucus, serve as lubricants. 

Glycosylation greatly increases the complexity of the proteome. A given 
protein with several potential glycosylation sites can have many different 
glycosylated forms (sometimes called glycoforms), each of which can be 
generated only in a specific cell type or developmental stage. 


Carbohydrates can be linked to proteins through asparagine (N-linked) 
or through serine or threonine (O-linked) residues 


Sugars in glycoproteins are attached either to the amide nitrogen atom in the 
side chain of asparagine (termed an N-linkage) or to the oxygen atom in the 
side chain of serine or threonine (termed an O-linkage), as shown in 
Figure 11.15. An asparagine residue can accept an oligosaccharide only if 
the residue is part of an Asn-X-Ser or Asn-X-Thr sequence, in which X can 
be any residue, except proline. Thus, potential glycosylation sites can be 
detected within amino acid sequences. However, not all potential sites are 
glycosylated. Which sites are glycosylated depends on other aspects of the 
protein structure and on the cell type in which the protein is expressed. All 
N-linked oligosaccharides have in common a pentasaccharide core consist- 
ing of three mannose and two N-acetylglucosamine residues. Additional 
sugars are attached to this core to form the great variety of oligosaccharide 
patterns found in glycoproteins (Figure 11.16). 
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Figure 11.16 N-linked oligosaccharides. A pentasaccharide core (shaded gray) is common 
to all N-linked oligosaccharides and serves as the foundation for a wide variety of /V-linked 
oligosaccharides, two of which are illustrated: (A) high-mannose type; (B) complex type. 


The glycoprotein erythropoietin is a vital hormone 


ki Let us look at a glycoprotein present in the blood serum that has 
dramatically improved treatment for anemia, particularly that 
induced by cancer chemotherapy. The glycoprotein hormone erythropoietin 


(EPO) is secreted by the kidneys and stimulates the pro- 331 
duction of red blood cells. EPO is composed of 165 amino 11.3 Glycoproteins 
acids and is N-glycosylated at three asparagine residues 
and O-glycosylated on a serine residue (Figure 11.17). 
The mature EPO is 40% carbohydrate by weight, and 
glycosylation enhances the stability of the protein in the 
blood. Unglycosylated protein has only about 10% of 
the bioactivity of the glycosylated form because the pro- 
tein is rapidly removed from the blood by the kidneys. 
The availability of recombinant human EPO has greatly 
aided the treatment of anemias. However, some endur- 
ance athletes have used recombinant human EPO to 
increase the red-blood-cell count and hence their oxygen- 
carrying capacity. Drug-testing laboratories are able to 
distinguish some forms of prohibited human recombi- 
nant EPO from natural EPO in athletes by detecting 
differences in their glycosylation patterns through the 
use of isoelectric focusing (p. 75). 


Proteoglycans, composed of polysaccharides and 
protein, have important structural roles 


As stated earlier, proteoglycans are proteins attached to 
glycosaminoglycans. The glycosaminoglycan makes up 
as much as 95% of the biomolecule by weight, and so the 
proteoglycan resembles a polysaccharide more than a 
protein. Proteoglycans not only function as lubricants 
and structural components in connective tissue, but also 


h Figure 11.17 Oligosaccharides attached to erythropoietin. 


: : ; Erythropoietin has oligosaccharides linked to three asparagine residues 
mediate the adhesion of cells to the extracellular matrix, and one serine residue. The structures shown are approximately to scale. 


and bind factors that stimulate cell proliferation. See Figure 11.16 for the carbohydrate key. [Drawn from 1BUY.pdf] 


‘2 The properties of proteoglycans are determined 

primarily by the glycosaminoglycan component. Many glycosamin- 
oglycans are made of repeating units of disaccharides containing a deriva- 
tive of an amino sugar, either glucosamine or galactosamine (Figure 11.18). 
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Figure 11.18 Repeating units in glycosaminoglycans. Structural formulas for five repeating 
units of important glycosaminoglycans illustrate the variety of modifications and linkages that are 
possible. Amino groups are shown in blue and negatively charged groups in red. Hydrogen atoms 
have been omitted for clarity. The right-hand structure is a glucosamine derivative in each case. 


Figure 11.19 Hurler disease. Formerly 
called gargoylism, Hurler disease is a 
mucopolysaccharidosis having symptoms that 
include wide nostrils, a depressed nasal 
bridge, thick lips and earlobes, and irregular 
teeth. In Hurler disease, glycosaminoglycans 
cannot be degraded. The excess of these 
molecules are stored in the soft tissue of the 
facial regions, resulting in the characteristic 
facial features. [Courtesy National MPS 
Society, www.mpssociety.org.] 
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At least one of the two sugars in the repeating unit has a negatively charged 
carboxylate or sulfate group. The major glycosaminoglycans in animals are 
chondroitin sulfate, keratan sulfate, heparin, heparan sulfate, dermatan 
sulfate, and hyaluronate. Mucopolysaccharidoses are a collection of diseases, 
such as Hurler disease, that result from the inability to degrade glycosamin- 
oglycans (Figure 11.19). Although precise clinical features vary with the 
disease, all mucopolysaccharidoses result in skeletal deformities and 
reduced life expectancies. 


Proteoglycans are important components of cartilage 


Among the best-characterized members of this diverse class is the proteo- 
glycan in the extracellular matrix of cartilage. The proteoglycan aggrecan 
and the protein collagen are key components of cartilage. The triple helix of 
collagen (p. 43) provides structure and tensile strength, whereas aggrecan 
serves as a shock absorber. The protein component of aggrecan is a large 
molecule composed of 2397 amino acids. The protein has three globular 
domains, and the site of glycosaminoglycan attachment is the extended 
region between globular domains 2 and 3. This linear region contains 
highly repetitive amino acid sequences, which are sites for the attachment of 
keratan sulfate and chondroitin sulfate. Many molecules of aggrecan are in 
turn noncovalently bound through the first globular domain to a very long 
filament formed by linking together molecules of the glycosaminoglycan 
hyaluronan (Figure 11.20). Water is bound to the glycosaminoglycans, 
attracted by the many negative charges. Aggrecan can cushion compressive 
forces because the absorbed water enables it to spring back after having been 
deformed. When pressure is exerted, as when the foot hits the ground while 
walking, water is squeezed from the glycosaminoglycan, cushioning the 
impact. When the pressure is released, the water rebinds. Osteoarthritis can 
result from the proteolytic degradation of aggrecan and collagen in the 
cartilage. 
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Figure 11.20 Structure of proteoglycan from cartilage. (A) Electron micrograph of a 
proteoglycan from cartilage (with false color added). Proteoglycan monomers emerge laterally at 
regular intervals from opposite sides of a central filament of hyaluronan. (B) Schematic 
representation. G = globular domain. [(A) Courtesy of Dr. Lawrence Rosenberg. From J. A. 
Buckwalter and L. Rosenberg. Collagen Relat. Res. 3:489-504, 1983.] 


In addition to being a key component of structural tissues, glycosamino- 
glycans are common throughout the biosphere. Chitin is a glycosaminogly- 
can found in the exoskeleton of insects, crustaceans, and arachnids and is, 
next to cellulose, the second most abundant polysaccharide in nature 
(Figure 11.21). 


Mucins are glycoprotein components of mucus 


As stated earlier, another class of glycoproteins consists of the mucins (muco- 
proteins). In mucins, the protein component is extensively glycosylated to 
serine or threonine residues by N-acetylgalactosamine (see Figure 11.9). 
Mucins are capable of forming large polymeric structures and are common 
in mucous secretions. These glycoproteins are synthesized by specialized 
cells in the tracheobronchial, gastrointestinal, and genitourinary tracts. 
Because a key function of mucins is to act as a lubricant, mucins are abundant 
in saliva. 

A model of a mucin is shown in Figure 11.22A. The defining feature of 
the mucins is a region of the protein backbone termed the variable number of 
tandem repeats (VNTR) region, which is rich in serine and threonine 
residues that are O-glycosylated. Indeed, the carbohydrate moiety can 
account for as much as 80% of the molecule by weight. A number of core 
carbohydrate structures are conjugated to the protein component of mucin. 
Figure 11.22B shows one such structure. 

Mucins adhere to epithelial cells and act as a protective barrier; they also 
hydrate the underlying cells. In addition to protecting cells from environ- 
mental insults, such as stomach acid, inhaled chemicals in the lungs, and 
bacterial infections, mucins have roles in fertilization, the immune response, 
and cell adhesion. Mucins are overexpressed in bronchitis and cystic fibro- 
sis, and the overexpression of mucins is characteristic of adenocarcinomas— 
cancers of the glandular cells of epithelial origin. 


Protein glycosylation takes place in the lumen of the endoplasmic 
reticulum and in the Golgi complex 


The major pathway for protein glycosylation takes place inside the lumen of 
the endoplasmic reticulum (ER) and in the Golgi complex, organelles that play 
central roles in protein trafficking (Figure 11.23). The protein 1s synthesized 
by ribosomes attached to the cytoplasmic face of the ER membrane, and the 


Figure 11.23 Golgi complex and 
endoplasmic reticulum. The electron 
micrograph shows the Golgi complex and 
adjacent endoplasmic reticulum. The black dots 
on the cytoplasmic surface of the ER membrane 
are ribosomes. [Micrograph courtesy of Lynne 
Mercer.] 
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Figure 11.21 Chitin, a glycosaminoglycan, 
is present in insect wings and the 
exoskeleton. Glycosaminoglycans are 
components of the exoskeletons of insects, 
crustaceans, and arachnids. [FLPA/Alamy.] 
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Figure 11.22 Mucin structure. (A) A 
schematic representation of a mucoprotein. 
The VNTR region is highly glycosylated, forcing 
the molecule into an extended conformation. 
The Cys-rich domains and the D domain 
facilitate the polymerization of many such 
molecules. (B) An example of an 
oligosaccharide that is bound to the VNTR 
region of the protein. [After A. Varki et al. 
(Eds.), Essentials of Glycobiology, 2d ed. 
(Cold Spring Harbor Press, 2009), 

pp. 117, 118] 
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Isoprene 


Figure 11.24 Golgi complex as sorting 
center. The Golgi complex is the sorting 
center in the targeting of proteins to 
lysosomes, secretory vesicles, and the plasma 
membrane. The cis face of the Golgi complex 
receives vesicles from the endoplasmic 
reticulum, and the trans face sends a different 
set of vesicles to target sites. Vesicles also 
transfer proteins from one compartment of 
the Golgi complex to another. [Courtesy of Dr. 
Marilyn Farquhar.] 


peptide chain is inserted into the lumen of the ER (Section 30.6). The 
N-linked glycosylation begins in the ER and continues in the Golgi com- 
plex, whereas the O-linked glycosylation takes place exclusively in the 
Golgi complex. 

A large oligosaccharide destined for attachment to the asparagine residue 
of a protein is assembled on dolichol phosphate, a specialized lipid molecule 
located in the ER membrane and containing about 20 isoprene (Cs) units. 


n=15-19 
Dolichol phosphate 


The terminal phosphate group of the dolichol phosphate is the site of 
attachment of the activated oligosaccharide, which is subsequently trans- 
ferred to a specific asparagine residue of the growing polypeptide chain. 
Both the activated sugars and the complex enzyme that is responsible for 
transferring the oligosaccharide to the protein are located on the lumenal 
side of the ER. Thus, proteins in the cytoplasm are not glycosylated by this 
pathway. 

Proteins in the lumen of the ER and in the ER membrane are trans- 
ported to the Golgi complex, which is a stack of flattened membranous sacs. 
Carbohydrate units of glycoproteins are altered and elaborated in the Golgi 
complex. The O-linked sugar units are fashioned there, and the N-linked 
sugars, arriving from the ER as acomponent of a glycoprotein, are modified 
in many different ways. The Golgi complex is the major sorting center of the 
cell. Proteins proceed from the Golgi complex to lysosomes, secretory gran- 
ules, or the plasma membrane, according to signals encoded within their 
amino acid sequences and three-dimensional structures (Figure 11.24). 
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Specific enzymes are responsible for oligosaccharide 
assembly 


How are the complex carbohydrates formed, be they unconju- 
gated molecules such as glycogen or components of glycopro- 
teins? Complex carbohydrates are synthesized through the 
action of specific enzymes, glycosyltransferases, which catalyze 
the formation of glycosidic bonds. Given the diversity of 
known glycosidic linkages, many different enzymes are 
required. Indeed, glycosyltransferases account for 1% to 2% of 
gene products in all organisms examined. 

The general form of the reaction catalyzed by a glycosyl- 
transferase is shown in Figure 11.25. The sugar to be added 
comes in the form of an activated (energy-rich) sugar nucle- 
otide, such as UDP-glucose (UDP is the abbreviation for uri- 
dine diphosphate). The attachment of a nucleotide to enhance 
the energy content of a molecule is a common strategy in 
biosynthesis that we will see many times in our study of bio- 
chemistry. The acceptor substrates for glycosyltransferases are 
quite varied and include carbohydrates, serine, threonine and 
asparagine residues of proteins, lipids, and even nucleic acids. 


Blood groups are based on protein glycosylation patterns 


ci The human ABO blood groups illustrate the effects of 
glycosyltransferases on the formation of glycoproteins. 
Each blood group is designated by the presence of one of the 
three different carbohydrates, termed A, B, or O, attached to 
glycoproteins and glycolipids on the surfaces of red blood cells 
(Figure 11.26). These structures have in common an oligosac- 
charide foundation called the O (or sometimes H) antigen. 
The A and B antigens differ from the O antigen by the addi- 
tion of oneextramonosaccharide, either N-acetylgalactosamine 
(for A) or galactose (for B) through an a-1,3 linkage to a galac- 
tose moiety of the O antigen. 

Specific glycosyltransferases add the extra monosaccharide 
to the O antigen. Each person inherits the gene for one glycosyl- 
transferase of this type from each parent. The type A transferase 
specifically adds N-acetylgalactosamine, whereas the type B 
transferase adds galactose. These enzymes are identical in all but 
4 of 354 positions. The O phenotype is the result of a mutation 
that leads to premature termination of translation and, hence, to 
the production of neither of the required glycosyltransferases. 

These structures have important implications for blood 
transfusions and other transplantation procedures. If an anti- 
gen not normally present in a person is introduced, the per- 
son’s immune system recognizes it as foreign. Red-blood-cell 
lysis occurs rapidly, leading to a severe drop in blood pressure 
(hypotension), shock, kidney failure, and death from circula- 
tory collapse. 
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Figure 11.25 General form of a glycosyltransferase 
reaction. The sugar to be added comes from a sugar 
nucleotide—in this case, UDP-glucose. The acceptor, designated 
X in this illustration, can be one of a variety of biomolecules, 
including other carbohydrates or proteins. 
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Figure 11.26 Structures of A, B, and O oligosaccharide 
antigens. The carbohydrate structures shown are depicted 
symbolically by employing a scheme (see the key in Figure 11.16) 
that is becoming widely used. 


Why are different blood types present in the human population? 
Suppose that a pathogenic organism such as a parasite expresses on its 
cell surface a carbohydrate antigen similar to one of the blood-group anti- 
gens. This antigen may not be readily detected as foreign in a person whose 
blood type matches the parasite antigen, and the parasite will flourish. 
However, other people with different blood types will be protected. Hence, 
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Figure 11.27 Formation of a mannose 
6-phosphate marker. A glycoprotein 


destined for delivery to lysosomes acquires a 
phosphate marker in the Golgi compartment 


in a two-step process. First, GIcNAc 
phosphotransferase adds a phospho-NV- 
acetylglucosamine unit to the 6-OH group 
of a mannose, and then an 


N-acetylglucosaminidase removes the added 


sugar to generate a mannose 6-phosphate 
residue in the core oligosaccharide. 


there will be selective pressure on human beings to vary blood type to 
prevent parasitic mimicry and a corresponding selective pressure on para- 
sites to enhance mimicry. The constant “arms race” between pathogenic 
microorganisms and human beings drives the evolution of diversity of 
surface antigens within the human population. 


Errors in glycosylation can result in pathological conditions 


ki Although the role of carbohydrate attachment to proteins is not 
known in detail in most cases, data indicate that this glycosylation is 
important for the processing and stability of these proteins, as it is for EPO. 
For instance, certain types of muscular dystrophy can be traced to improper 
glycosylation of membrane proteins. Indeed, an entire family of severe 
inherited human disease called congenital disorders of glycosylation has been 
identified. These pathological conditions reveal the importance of proper 
modification of proteins by carbohydrates and their derivatives. 

An especially clear example of the role of glycosylation is provided by 
T-cell disease (also called mucolipidosis II), a lysosomal storage disease. 
Normally, a carbohydrate marker directs certain digestive enzymes from the 
Golgi complex to lysosomes where they normally function. Lysosomes are 
organelles that degrade and recycle damaged cellular components or 
material brought into the cell by endocytosis. In patients with I-cell disease, 
lysosomes contain large inclusions of undigested glycosaminoglycans and 
glycolipids—hence the “T” in the name of the disease. These inclusions are 
present because the enzymes normally responsible for the degradation of 
glycosaminoglycans are missing from affected lysosomes. Remarkably, the 
enzymes are present at very high levels in the blood and urine. Thus, active 
enzymes are synthesized, but, in the absence of appropriate glycosylation, 
they are exported instead of being sequestered in lysosomes. In other words, 
in I-cell disease, a whole series of enzymes are incorrectly addressed and 
delivered to the wrong location. Normally, these enzymes contain a mannose 
6-phosphate residue, a component of an N-oligosaccharide attached to 
proteins bound for the lysosome. In I-cell disease, however, the attached 
mannose lacks a phosphate (Figure 11.27). Mannose 6-phosphate is in fact 
the marker that normally directs many hydrolytic enzymes from the Golgi 
complex tolysosomes. I-cell patients are deficientin the N-acetylglucosamine 
phosphotransferase catalyzing the first step in the addition of the phospho- 
ryl group; the consequence is the mistargeting of eight essential enzymes. 
I-cell disease causes the patient to suffer severe psychomotor retardation 
and skeletal deformities, similar to those in Hurler disease. 


Oligosaccharides can be “sequenced” 


How is it possible to determine the structure of a glycoprotein—the 
oligosaccharide structures and their points of attachment? Most ap- 
proaches make use of enzymes that cleave oligosaccharides at specific types 
of linkages. 

The first step is to detach the oligosaccharide from the protein. For 
example, N-linked oligosaccharides can be released from proteins by an 
enzyme such as peptide N-glycosidase F, which cleaves the N-glycosidic 
bonds linking the oligosaccharide to the protein. The oligosaccharides can 
then be isolated and analyzed. MALDI-TOF or other mass spectrometric 
techniques (Section 3.4) provide the mass of an oligosaccharide fragment. 
However, many possible oligosaccharide structures are consistent with 
a given mass. More-complete information can be obtained by cleaving the 
oligosaccharide with enzymes of varying specificities. For example, B-1,4- 
galactosidase cleaves B-glycosidic bonds exclusively at galactose residues. 
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Figure 11.28 Mass spectrometric “sequencing” of oligosaccharides. Carbohydrate-cleaving 
enzymes were used to release and specifically cleave the oligosaccharide component of the 
glycoprotein fetuin from bovine serum. Parts A and B show the masses obtained with MALDI-TOF 
spectrometry as well as the corresponding structures of the oligosaccharide-digestion products 
(using the same scheme as that in Figure 11.16): (A) digestion with peptide N-glycosidase F 

(to release the oligosaccharide from the protein) and neuraminidase; (B) digestion with peptide 
N-glycosidase F, neuraminidase, and B-1,4-galactosidase. Knowledge of the enzyme specificities 
and the masses of the products permits the characterization of the oligosaccharide. See Figure 11.16 
for the carbohydrate key. [After A. Varki, R. D. Cummings, J. D. Esko, H. H. Freeze, G. W. Hart, and 
J. Marth (Eds.), Essentials of Glycobiology (Cold Spring Harbor Laboratory Press, 1999), p. 596.] 


The products can again be analyzed by mass spectrometry (Figure 11.28). 
The repetition of this process with the use of an array of enzymes of differ- 
ent specificity will eventually reveal the structure of the oligosaccharide. 

Proteases applied to glycoproteins can reveal the points of oligosaccha- 
ride attachment. Cleavage by a specific protease yields a characteristic pat- 
tern of peptide fragments that can be analyzed chromatographically. 
Fragments attached to oligosaccharides can be picked out because their 
chromatographic properties will change on glycosidase treatment. Mass 
spectrometric analysis or direct peptide sequencing can reveal the identity 
of the peptide in question and, with additional effort, the exact site of oligo- 
saccharide attachment. 

Now that the sequencing of the human genome is complete, the charac- 
terization of the much more complex proteome, including the biological 
roles of specifically modified proteins, can begin in earnest. 


11.4 Lectins Are Specific Carbohydrate-Binding Proteins 


The diversity and complexity of the carbohydrate units and the variety of 
ways in which they can be joined in oligosaccharides and polysaccharides 
suggest that they are functionally important. Nature does not construct 
complex patterns when simple ones suffice. Why all this intricacy and 
diversity? It is now clear that these carbohydrate structures are the recogni- 
tion sites for a special class of proteins. Such proteins, termed glycan-binding 
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hk Figure 11.29 Structure 
of a C-type carbohydrate- 
binding domain of an 
animal lectin. Notice that a 
calcium ion links a mannose 
residue to the lectin. Selected 
interactions are shown, with 
some hydrogen atoms omitted 
for clarity. [Drawn from 2MSC. 
pdb|] 


proteins, bind specific carbohydrate structures on neighboring cell surfaces. 
Originally discovered in plants, glycan-binding proteins are ubiquitous, 
and no living organisms have been found that lack these key proteins. 
We will focus on a particular class of glycan-binding proteins termed lectins 
(from Latin legere, “to select’). The interaction of lectins with their carbo- 
hydrate partners is another example of carbohydrates being information- 
rich molecules that guide many biological processes. The diverse carbo- 
hydrate structures displayed on cell surfaces are well suited to serving 
as sites of interaction between cells and their environments. Interestingly, 
the partners for lectin binding are often the carbohydrate moiety of 
glycoproteins. 


Lectins promote interactions between cells 


Cell—cell contact is a vital interaction in a host of biochemical functions, 
ranging from building a tissue from isolated cells to facilitating the trans- 
mission of information. The chief function of lectins, carbohydrate-binding 
proteins, is to facilitate cell-cell contact. A lectin usually contains two or 
more binding sites for carbohydrate units. These carbohydrate-binding 
sites on the surface of one cell interact with arrays of carbohydrates dis- 
played on the surface of another cell. Lectins and carbohydrates are linked 
by a number of weak noncovalent interactions that ensure specificity yet 
permit unlinking as needed. The weak interactions between one cell surface 
and another resemble the action of Velcro; each interaction is weak, but the 
composite is strong. 

We have already met a lectin obliquely. Recall that, in I-cell disease, 
lysosomal enzymes lack the appropriate mannose 6-phosphate, a molecule 
that directs the enzymes to the lysosome. Under normal circumstance, the 
mannose 6-phosphate receptor, a lectin, binds the enzymes in the Golgi appa- 
ratus and directs them to the lysosome. 


Lectins are organized into different classes 


Lectins can be divided into classes on the basis of their amino acid sequences 
and biochemical properties. One large class is the C type (for calcium- 
requiring) found in animals. These proteins each have a homologous do- 
main of 120 amino acids that is responsible for carbohydrate binding. The 
structure of one such domain bound to a carbohydrate target is shown in 
Figure 11.29. 

A calcium ion on the protein acts as a bridge between the protein and 
the sugar through direct interactions with sugar OH groups. In addition, 
two glutamate residues in the protein bind to both the calcium ion and the 


Glu 


- Mannose 


Glu 


Figure 11.30 Selectins mediate cell-cell 
interactions. The scanning electron 
micrograph shows lymphocytes adhering to 
the endothelial lining of a lymph node. The L 
selectins on the lymphocyte surface bind 
specifically to carbohydrates on the lining of 
the lymph-node vessels [Courtesy of Dr. 
Eugene Butcher.] 


sugar, and other protein side chains form hydrogen bonds with other OH 
groups on the carbohydrate. The carbohydrate-binding specificity of a 
particular lectin is determined by the amino acid residues that bind the 
carbohydrate. 

Proteins termed selectins are members of the C-type family. Selectins 
bind immune-system cells to sites of injury in the inflammatory response 
(Figure 11.30). The L, E, and P forms of selectins bind specifically to 
carbohydrates on lymph-node vessels, endothelium, or activated blood 
platelets, respectively. New therapeutic agents that control inflammation 
may emerge from a deeper understanding of how selectins bind and distin- 
guish different carbohydrates. L-Selectin, originally thought to participate 
only in the immune response, is produced by embryos when they are ready 
to attach to the endometrium of the mother’s uterus. For a short period 
of time, the endometrial cells present an oligosaccharide on the cell sur- 
face. When the embryo attaches through lectins, the attachment activates 
signal pathways in the endometrium to make implantation of the embryo 
possible. 

Another large class of lectins comprises the L-lectins. These lectins are 
especially rich in the seeds of leguminous plants, and many of the initial 
biochemical characterizations of lectins were performed on this readily 
available lectin. Although the exact role of lectins in plants is unclear, they 
can serve as potent insecticides. Other L-type lectins, such as calnexin and 
calreticulin, are prominent chaperones in the eukaryotic endoplasmic retic- 
ulum. Recall that chaperones are proteins that facilitate the folding of other 
proteins. 


Influenza virus binds to sialic acid residues 


e Many pathogens gain entry into specific host cells by adhering to 
cell-surface carbohydrates. For example, influenza virus recognizes 
sialic acid residues linked to galactose residues that are present on cell-surface 
glycoproteins. The viral protein that binds to these sugars is called hemag- 
glutinin (Figure 11.31). 

After binding hemagglutinin, the virus is engulfed by the cell and begins 
to replicate. To exit the cell, the new virions must bind to hemaglutinin in 
what is essentially the reverse of viral entry. Another viral protein, neuramin- 
idase (sialidase), cleaves the glycosidic bonds to the sialic acid residues of 
hemagglutinin, freeing the virus to infect new cells, spreading the infection 
throughout the respiratory tract. Inhibitors of this enzyme such as oseltami- 
vir (Tamiflu) and zanamivir (Relenza) are important anti-influenza agents. 

Viral hemagglutinin’s carbohydrate-binding specificity may play an 
important role in species specificity of infection and ease of transmission. 
For instance, avian influenza H5N1 (bird flu) is especially lethal and is 
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Figure 11.31 Viral receptors. Influenza 
virus targets cells by binding to sialic acid 
residues (purple diamonds) located at the 
termini of oligosaccharides present on cell- 
surface glycoproteins and glycolipids. These 
carbohydrates are bound by hemagglutinin 
(interaction circles), one of the major proteins 
expressed on the surface of the virus. The 
other major viral-surface protein, 
neuraminidase, is an enzyme that cleaves 
oligosaccharide chains to release the viral 
particle at a later stage of the viral life cycle. 
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readily spread from bird to bird. Although human beings can be infected by 
this virus, infection is rare and human-to-human transmission is rarer still. 
The biochemical basis of these characteristics is that the avian-virus hemag- 
glutinin recognizes a different carbohydrate sequence from that recognized 
in human influenza. Although human beings have the sequence to which 
the avian virus binds, it is located deep in the lungs. Infection by the avian 
virus is thus difficult, and, when it does occur, the avian virus is not readily 
transmitted by sneezing or coughing. 

Plasmodium falciparum, the parasitic protozoan that causes malaria, also 
relies on glycan binding to infect and colonize its host. Glycan-binding 
proteins of the parasitic form initially injected by the mosquito bind to the 
glycosaminoglycan heparin sulfate on the liver, initiating the parasite’s 
entry into the cell. On exiting from the liver later in its life cycle, the para- 
site invades red blood cells by using another glycan-binding protein to 
bind to the carbohydrate moiety of glycophorin, a prominent membrane 
glycoprotein in red blood cells. Developing means to disrupt the carbo- 
hydrate interactions between pathogens and host cells may prove to be 
clinically useful. 


Summary 


11.1 Monosaccharides Are the Simplest Carbohydrates 
Carbohydrates are aldoses or ketoses that are rich in hydroxyl groups. 
An aldose is a carbohydrate with an aldehyde group (as in glyceralde- 
hyde and glucose), whereas a ketose contains a keto group (as in dihy- 
droxyacetone and fructose). A sugar belongs to the D series if the 
absolute configuration of its asymmetric carbon atom farthest from 
the aldehyde or keto group is the same as that of D-glyceraldehyde. 
Most naturally occurring sugars belong to the D series. The C-1 alde- 


hyde in the open-chain form of glucose reacts with the C-5 hydroxyl 341 
group to form a six-membered pyranose ring. The C-2 keto group in Summary 
the open-chain form of fructose reacts with the C-5 hydroxyl group to 

form a five-membered furanose ring. Pentoses such as ribose and 

deoxyribose also form furanose rings. An additional asymmetric cen- 

ter is formed at the anomeric carbon atom (C-1 in aldoses and C-2 in 

ketoses) in these cyclizations. The hydroxyl group attached to the ano- 

meric carbon atom is on the opposite side of the ring from the CH,OH 

group attached to the chiral center in the « anomer, whereas it is on the 

same side of the ring as the CH2OH group in the B anomer. Not all 

atoms in the ring lie in the same plane. Rather, pyranose rings usually 

adopt the chair conformation, and furanose rings usually adopt the 

envelope conformation. Sugars are joined to alcohols and amines by 

glycosidic bonds from the anomeric carbon atom. For example, 

N-glycosidic bonds link sugars to purines and pyrimidines in nucle- 

otides, RNA, and DNA. 


11.2 Monosaccharides Are Linked to Form Complex Carbohydrates 

Sugars are linked to one another in disaccharides and polysaccharides 
by O-glycosidic bonds. Sucrose, lactose, and maltose are the common 
disaccharides. Sucrose (common table sugar) consists of a-glucose and 
B-fructose joined by a glycosidic linkage between their anomeric car- 
bon atoms. Lactose (in milk) consists of galactose joined to glucose by 
a B-1,4 linkage. Maltose (in starch) consists of two glucoses joined by 
an a-1,4 linkage. Starch is a polymeric form of glucose in plants, and 
glycogen serves a similar role in animals. Most of the glucose units in 
starch and glycogen are in a-1,4 linkage. Cellulose, the major struc- 
tural polymer of plant cell walls, consists of glucose units joined by 
B-1,4 linkages. These B linkages give rise to long straight chains that 
form fibrils with high tensile strength. In contrast, the a linkages in 
starch and glycogen lead to open helices, in keeping with their roles as 
mobilizable energy stores. 


11.3 Carbohydrates Can Be Linked to Proteins to Form Glycoproteins 
Carbohydrates are commonly conjugated to proteins. If the protein 
component is predominant, the conjugate of protein and carbohydrate 
is called a glycoprotein. Most secreted proteins are glycoproteins. The 
signal molecule erythropoietin is a glycoprotein. Glycoproteins are 
also prominent on the external surface of the plasma membrane. 
Proteins bearing covalently linked glycosaminoglycans are proteogly- 
cans. Glycosaminoglycans are polymers of repeating disaccharides. 
One of the units in each repeat is a derivative of glucosamine or galac- 
tosamine. These highly anionic carbohydrates have a high density of 
carboxylate or sulfate groups. Proteoglycans are found in the extracel- 
lular matrices of animals and are key components of cartilage. 
Mucoproteins, like proteoglycans, are predominantly carbohydrate by 
weight. The protein component is heavily O-glycosylated with 
N-acetylgalactosamine joining the oligosaccharide to the protein. 
Mucoproteins serve as lubricants. 

Specific enzymes link the oligosaccharide units on proteins either 
to the side-chain oxygen atom of a serine or threonine residue or to the 
side-chain amide nitrogen atom of an asparagine residue. Protein gly- 
cosylation takes place in the lumen of the endoplasmic reticulum. The 
N-linked oligosaccharides are synthesized on dolichol phosphate and 
subsequently transferred to the protein acceptor. Additional sugars 
are attached in the Golgi complex to form diverse patterns. 
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11.4 Lectins Are Specific Carbohydrate-Binding Proteins 
Carbohydrates on cell surfaces are recognized by proteins called lec- 
tins. In animals, the interplay of lectins and their sugar targets guides 
cell-cell contact. The viral protein hemagglutinin on the surface of the 
influenza virus recognizes sialic acid residues on the surfaces of cells 
invaded by the virus. A small number of carbohydrate residues can be 
joined in many different ways to form highly diverse patterns that can 


be distinguished by the lecti 
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Problems 


1. Word origin. Account for the origin of the term carbohy- 


drate. 


4. Carbons and carbonyls. To which classes of sugars do the 


monosaccharides shown here belong? 


2. Diversity. How many different oligosaccharides can be 
made by linking one glucose, one mannose, and one galac- 
tose? Assume that each sugar is in its pyranose form. io 


Compare this number with the number of tripeptides that 


can be made from three different amino acids. | 


3. Couples. Indicate whether each of the following pairs of | 
sugars consists of anomers, epimers, or an aldose—ketose 


pair: 


a) D-glyceraldehyde and dihydroxyacetone 


b) D-glucose and D-mannose 


d) a-D-glucose and B-D-glucose 


( 
( 
(c) D-glucose and D-fructose 
( 
(e) D-ribose and D-ribulose 

( 


f) D-galactose and D-glucose 


H—C——OH 
H——C——OH 
CH20H 
p-Erythrose 
O X _/CH20H 
ee 
CH20H 


b-Erythrulose 


CHO 
H—C—OH 
H—C—OH CHO 
H—C—OH H M a ue 
CH20H ne CH20OH 
D-Ribose D-Glyceraldehyde Dihydroxyacetone 
O Ss _/CH20H 
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5. Chemical cousins. Although an aldose with 4 asymmetric 
carbon atoms is capable of forming 16 diastereoisomers, 
only 8 of the isomers are commonly observed, including 
glucose. They are listed below with their structural relation 
to glucose. Using the structure of glucose as a reference, 
draw the structures. 


CHO 
D-Allose: Epimeric at C-3 H—C—OH 
b-Altrose: Isomeric at C-2 and C-3 
. ; HO—C——-H 

D-Mannose: Epimeric at C-2 
bD-Glucose: Isomeric at C-3 and C-4 H—C—OH 
D-Idose: Isomeric at C-2, C-3 and C-4 H—C_OH 
D-Galactose: Epimeric at C-4 
D-Talose: Isomeric at C-2 and C-4 CH20H 

p-Glucose 


6. Mutarotation. The specific rotations of the a and B 
anomers of D-glucose are +112 degrees and +18.7 degrees, 
respectively. Specific rotation, [a]p, is defined as the 
observed rotation of light of wavelength 589 nm (the D line 
of asodium lamp) passing through 10 cm of a1 g ml solution 
of asample. When a crystalline sample of «-D-glucopyranose 
is dissolved in water, the specific rotation decreases from 112 
degrees to an equilibrium value of 52.7 degrees. On the basis 
of this result, what are the proportions of the a and B anomers 
at equilibrium? Assume that the concentration of the open- 
chain form is negligible. 


7. Telltale marker. Glucose reacts slowly with hemoglobin 
and other proteins to form covalent compounds. Why is 
glucose reactive? What is the nature of the adduct 
formed? 


8. Periodate cleavage. Compounds containing hydroxyl 
groups on adjacent carbon atoms undergo carbon-carbon 
bond cleavage when treated with periodate ion (IO, ). 
How can this reaction be used to distinguish between pyra- 
nosides and furanosides? 


9. Oxygen source. Does the oxygen atom attached to C-1 
in methyl a-D-glucopyranoside come from glucose or 
methanol? 


10. Sugar lineup. Identify the following four sugars. 


HOH2C 


OH 
HO 
(c) HOH2C. 0 y 
H 
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Problems 


11. Cellular glue. A trisaccharide unit of a cell-surface gly- 
coprotein is postulated to play a critical role in mediating 
cell-cell adhesion in a particular tissue. Design a simple 
experiment to test this hypothesis. 


12. Mapping the molecule. Each of the hydroxyl groups of 
glucose can be methylated with reagents such as dimethyl- 
sulfate under basic conditions. Explain how exhaustive methy- 
lation followed by the complete digestion of a known amount 
of glycogen would enable you to determine the number of 
branch points and reducing ends. 


13. Component parts. Raffinose is a trisaccharide and a 
minor constituent in sugar beets. 


(a) Is raffinose a reducing sugar? Explain. 
(b) What are the monosaccharides that compose raffinose? 


(c) B-Galactosidase is an enzyme that will remove galactose 
residues from an oligosaccharide. What are the products of 
B-galactosidase treatment of raffinose? 


HOH2C 
HO on, tb HO OH 
o—C fe) CH2OH 
WT, Ye 
CH,OH 
Raffinose 
14. Anomeric differences. a-D-Mannose is 
CH20H 


a sweet-tasting sugar. B-D-Mannose, on 
the other hand, tastes bitter. A pure solu- ce) 
tion of a-D-mannose loses its sweet taste 
with time as it is converted into the B ano- 
mer. Draw the 8 anomer and explain how 
it is formed from the a anomer. 


OH HO 
HO OH 


a-D-Mannose 


15. A taste of honey. Fructose in its B-D-pyranose form 
accounts for the powerful sweetness of honey. The B-D- 
furanose form, although sweet, is not as sweet as the pyra- 
nose form. The furanose form is the more stable form. Draw 
the two forms and explain why it may not always be wise to 
cook with honey. 


16. Making ends meet. (a) Compare the number of reducing 
ends to nonreducing ends in a molecule of glycogen. (b) As 
we will see in Chapter 21, glycogen is an important 
fuel-storage form that is rapidly mobilized. At which end— 
the reducing or nonreducing—would you expect most 
metabolism to take place? 


17. A lost property. Glucose and fructose are reducing sug- 
ars. Sucrose, or table sugar, is a disaccharide consisting of 
both fructose and glucose. Is sucrose a reducing sugar? 
Explain. 
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18. Meat and potatoes. Compare the structures of glycogen 
and starch. 


19. Straight or with a twist? Account for the different struc- 
tures of glycogen and cellulose. 


20. Sweet proteins. List the key classes of glycoprotein, 
their defining characteristics, and their biological func- 
tions. 


21. Life extender. What is the function of the carbohydrate 
moiety that is attached to EPO? 


22. Cushioning. What is the role of the glycosaminoglycan 
in the cushioning provided by cartilage? 


23. Undelivered mail. Not returned to sender. I-cell disease 
results when proteins normally destined to the lysosomes 
lack the appropriate carbohydrate-addressing molecule 
(p. 337). Suggest another possible means by which I-cell 
disease might arise. 


24. From one, many. What is meant by a glycoform? 
25. Ome. What is meant by the glycome? 


26. Exponential expansion? Compare the amount of infor- 
mation inherent in the genome, the proteome, and the 
glycome. 


27. Locks and keys. What does the fact that all organisms 
contain lectins suggest about the role of carbohydrates? 


28. Carbohydrates—not just for breakfast anymore. 
Differentiate between a glycoprotein and a lectin. 


29. Carbohydrates and proteomics. Suppose that a protein 
contains six potential N-linked glycosylation sites. How 
many possible proteins can be generated, depending on 
which of these sites is actually glycosylated? Do not include 
the effects of diversity within the carbohydrate added. 


Chapter Integration Problems 


30. Stereospecificity. Sucrose, a major product of photosyn- 
thesis in green leaves, is synthesized by a battery of 
enzymes. The substrates for sucrose synthesis, D-glucose 
and D-fructose, are a mixture of a and B anomers as well as 
acyclic compounds in solution. Nonetheless, sucrose consists 
of a-D-glucose linked by its carbon-1 atom to the carbon-2 


atom of 8-D-fructose. How can the specificity of sucrose be 
explained in light of the potential substrates? 


31. Specific recognition. How might the technique of affin- 
ity chromatography be used to purify lectins? 


Data Interpretation Problem 


32. Sore joints. A contributing factor to the development 
of arthritis is the inappropriate proteolytic destruction of 
the aggrecan component of cartilage by the proteolytic 
enzyme aggrecanase. The immune-system signal molecule 
interleukin 2 (IL-2) activates aggrecanase; in fact, IL-2 
blockers are sometimes used to treat arthritis. Studies were 
undertaken to determine whether inhibitors of aggrecanase 
can counteract the effects of IL-2. Pieces of cartilage were 
incubated in media with various additions and the amount 
of aggrecan destruction was measured as a function of 
time. 
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[After M. A. Pratta et al. J. Biol. Chem. 
278:45539-45545, 2003, Fig. 7B.] 


(a) Aggrecan degradation was measured by the release of 
glycosaminoglycan. What is the rational for this assay? 


(b) Why might glycosaminoglycan release not indicate 
agerecan degradation? 

(c) What is the purpose of the control—cartilage incubated 
with no additions? 

(d) What is the effect of adding IL-2 to the system? 

(e) What is the response when an aggrecanase inhibitor 
(ST154) is added in addition to IL-2? 


(f) Why is there some aggrecan destruction in the control 
with the passage of time? 


Lipids and Cell Membranes 


CHAPTER I2 


he boundaries of all cells are defined by biological membranes (Fig- 

ure 12.1). These barriers prevent molecules generated inside the cell 
from leaking out and unwanted molecules from diffusing in; yet they also 
contain transport systems that allow the cell to take up specific molecules 
and remove unwanted ones. Such transport systems confer on membranes 
the important property of selective permeability. 

Membranes are dynamic structures in which proteins float in a sea of 
lipids. The lipid components of the membrane form the barrier to permea- 
bility, and protein components act as a transport system of pumps and 
channels that allow selected molecules into and out of the cell. This trans- 
port system will be considered in the next chapter. 

In addition to an external cell membrane (called the plasma membrane), 
eukaryotic cells also contain internal membranes that form the boundaries 
of organelles such as mitochondria, chloroplasts, peroxisomes, and lyso- 
somes. Functional specialization in the course of evolution has been closely 
linked to the formation of such compartments. Specific systems have 
evolved to allow the targeting of selected proteins into or through particular 
internal membranes and, hence, into specific organelles. External and inter- 
nal membranes share essential properties; these features are the subject of 
this chapter. 

Biological membranes serve several additional functions indispensable 
for life, such as energy storage and information transduction, that are dic- 
tated by the proteins associated with them. In this chapter, we will examine 


An HIV particle exits an infected cell by membrane budding. 
Cellular membranes are highly dynamic structures that 
spontaneously self-assemble. Driven by hydrophobic 
interactions, as shown in the diagram at right the fatty acid tails 
of membrane lipids pack together (green), while the polar 
heads (red) remain exposed on the surfaces. [Micrographs 
from Eye of Science/Photo Researchers.] 
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Figure 12.1 Electron micrograph of a 
plasma cell. This image has been colored to 
indicate the distinct boundary of the cell, 
formed by its plasma membrane. [Steve 
Gschmeissner/Photo Researchers.] 
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the properties of membrane proteins that enable them to exist in the hydro- 
phobic environment of the membrane while connecting two hydrophilic 
environments, and defer a discussion of the functions of these proteins until 
later chapters. 


Many Common Features Underlie the Diversity of 
Biological Membranes 


Membranes are as diverse in structure as they are in function. However, 
they do have in common a number of important attributes: 


1. Membranes are sheetlike structures, only two molecules thick, that form 
closed boundaries between different compartments. The thickness of most 
membranes is between 60 A (6 nm) and 100 A (10 nm). 


2. Membranes consist mainly of lipids and proteins. The mass ratio of lipids 
to proteins ranges from 1:4 to 4:1. Membranes also contain carbohydrates 
that are linked to lipids and proteins. 


3. Membrane lipids are small molecules that have both hydrophilic and 
hydrophobic moieties. These lipids spontaneously form closed bimolecular 
sheets in aqueous media. These lipid bilayers are barriers to the flow of polar 
molecules. 


4. Specific proteins mediate distinctive functions of membranes. Proteins serve 
as pumps, channels, receptors, energy transducers, and enzymes. Membrane 
proteins are embedded in lipid bilayers, which create suitable environments 
for their action. 


5. Membranes are noncovalent assemblies. The constituent protein and lipid 
molecules are held together by many noncovalent interactions, which act 
cooperatively. 


6. Membranes are asymmetric. The two faces of biological membranes 
always differ from each other. 


7. Membranes are fluid structures. Lipid molecules diffuse rapidly in the 
plane of the membrane, as do proteins, unless they are anchored by specific 
interactions. In contrast, lipid molecules and proteins do not readily rotate 
across the membrane. Membranes can be regarded as two-dimensional 
solutions of oriented proteins and lipids. 


8. Most cell membranes are electrically polarized, such that the inside is 
negative [typically —60 millivolts (mV)]. Membrane potential plays a key 
role in transport, energy conversion, and excitability (Chapter 13). 


12.1 Fatty Acids Are Key Constituents of Lipids 


The hydrophobic properties of lipids are essential to their ability to form 
membranes. Most lipids owe their hydrophobic properties to one compo- 
nent, their fatty acids. 


Fatty acid names are based on their parent hydrocarbons 


Fatty acids are long hydrocarbon chains of various lengths and degrees of 
unsaturation terminated with carboxylic acid groups. The systematic name 
for a fatty acid is derived from the name of its parent hydrocarbon by the 
substitution of oic for the final e. For example, the Cj saturated fatty acid 
is called octadecanoic acid because the parent hydrocarbon is octadecane. A 
Cy fatty acid with one double bond is called octadecenoic acid; with two 
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double bonds, octadecadienoic acid; and with three double bonds, octadeca- 
trienoic acid. The notation 18:0 denotes a Cyg fatty acid with no double 
bonds, whereas 18:2 signifies that there are two double bonds. The struc- 
tures of the ionized forms of two common fatty acids—palmitic acid (16:0) 
and oleic acid (18:1)—are shown in Figure 12.2. 

Fatty acid carbon atoms are numbered starting at the carboxyl] terminus, 
as shown in the margin. Carbon atoms 2 and 3 are often referred to as a and 
B, respectively. The methyl carbon atom at the distal end of the chain is 
called the w-carbon atom. The position ofa double bond is represented by the 
symbol A followed by a superscript number. For example, cis-A” means that 
there is a cis double bond between carbon atoms 9 and 10; trans-A” means 
that there is a trans double bond between carbon atoms 2 and 3. Alternatively, 
the position of a double bond can be denoted by counting from the distal 
end, with the w-carbon atom (the methyl carbon) as number 1. An w-3 fatty 
acid, for example, has the structure shown in the margin. Fatty acids are 
ionized at physiological pH, and so it is appropriate to refer to them accord- 
ing to their carboxylate form: for example, palmitate or hexadecanoate. 


Fatty acids vary in chain length and degree of unsaturation 


Fatty acids in biological systems usually contain an even number of carbon 
atoms, typically between 14 and 24 (Table 12.1). The 16- and 18-carbon 
fatty acids are most common. The dominance of fatty acid chains containing 
an even number of carbon atoms is in accord with the way in which fatty 


Table 12.1 Some naturally occurring fatty acids in animals 


Number of Number of 
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Figure 12.2 Structures of two fatty acids. 
Palmitate is a 16-carbon, saturated fatty acid, 
and oleate is an 18-carbon fatty acid with a 
single cis double bond. 
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An w-3 fatty acid 


carbons double bonds Common name Systematic name Formula 
12 0 Laurate n-Dodecanoate CH3(CH2);9 COO 
14 0 Myristate n-Tetradecanoate CH3(CH2);2COO 
16 0 Palmitate n-Hexadecanoate CH3(CH2)4COO ~ 
18 0 Stearate n-Octadecanoate CH3(CH2);6COO © 
20 0 Arachidate n-Eicosanoate CH3(CH2);sCOO 
22; 0 Behenate n-Docosanoate CH3(CH2)29 COO © 
24 0 Lignocerate n- Tetracosanoate CH3(CH2)22COO ~ 
16 1 Palmitoleate cis-A’-Hexadecenoate CH3(CH 2); CH=CH(CH2)7COO ~ 
18 1 Oleate cis-A’-Octadecenoate CH3(CH2)7CH=CH(CH2)7COO ~ 
18 2 Linoleate cis,cis-A°, A'?-Octadecadienoate CH3(CH)4(CH=CHCH))(CH).COO- 
18 3 Linolenate all-cis-A’, A'?, Al°-Octadecatrienoate CH3CH2(CH=CHCH))3(CH2)6COO™ 
20 4 Arachidonate all-cis A®, A®, Al’, -A* Ficosatetraenoate = CH3(CH2)4(CH=CHCH))4(CH2),COO7 
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Figure 12.3 Schematic structure of a 
phospholipid. 
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Figure 12.4 Structure of phosphatidate 
(diacylglycerol 3-phosphate). The absolute 


configuration of the center carbon (C-2) is shown. 


acids are biosynthesized (Chapter 26). The hydrocarbon chain is almost 

invariably unbranched in animal fatty acids. The alkyl chain may be saturated 

or it may contain one or more double bonds. The configuration of the double 
bonds in most unsaturated fatty acids is cis. The double bonds in polyun- 
saturated fatty acids are separated by at least one methylene group. 

The properties of fatty acids and of lipids derived from them are mark- 
edly dependent on chain length and degree of saturation. Unsaturated 
fatty acids have lower melting points than do saturated fatty acids of the 
same length. For example, the melting point of stearic acid is 69.6°C, 
whereas that of oleic acid (which contains one cis double bond) is 13.4°C. 
The melting points of polyunsaturated fatty acids of the Cjg series are 
even lower. Chain length also affects the melting point, as illustrated by 
the fact that the melting temperature of palmitic acid (Cy¢) is 6.5 degrees 

lower than that of stearic acid (Cg). Thus, short chain length and unsaturation 
enhance the fluidity of fatty acids and of their derivatives. 


12.2 There Are Three Common Types of Membrane Lipids 


By definition, lipids are water-insoluble biomolecules that are highly soluble in 
organic solvents such as chloroform. Lipids have a variety of biological roles: 
they serve as fuel molecules, highly concentrated energy stores, signal mol- 
ecules and messengers in signal-transduction pathways, and components of 
membranes. The first three roles of lipids will be considered in later chap- 
ters. Here, our focus is on lipids as membrane constituents. The three major 
kinds of membrane lipids are phospholipids, glycolipids, and cholesterol. We 
begin with lipids found in eukaryotes and bacteria. The lipids in archaea are 
distinct, although they have many features related to membrane formation 
in common with lipids of other organisms. 


Phospholipids are the major class of membrane lipids 


Phospholipids are abundant in all biological membranes. A phospholipid 
molecule is constructed from four components: one or more fatty acids, a 
platform to which the fatty acids are attached, a phosphate, and an alcohol 
attached to the phosphate (Figure 12.3). The fatty acid components provide 
a hydrophobic barrier, whereas the remainder of the molecule has hydro- 
philic properties that enable interaction with the aqueous environment. 

The platform on which phospholipids are built may be glycerol, a three- 
carbon alcohol, or sphingosine, a more complex alcohol. Phospholipids 
derived from glycerol are called phosphoglycerides. A phosphoglyceride con- 
sists of a glycerol backbone to which are attached two fatty acid chains and 
a phosphorylated alcohol. 

In phosphoglycerides, the hydroxyl groups at C-1 and C-2 of 
glycerol are esterified to the carboxyl groups of the two fatty acid 
chains. The C-3 hydroxyl group of the glycerol backbone is esterified 
to phosphoric acid. When no further additions are made, the resulting 
compound is phosphatidate (diacylglycerol 3-phosphate), the simplest 


2- phosphoglyceride. Only small amounts of phosphatidate are present 


in membranes. However, the molecule is a key intermediate in the 
biosynthesis of the other phosphoglycerides (Section 26.1). The abso- 
lute configuration of the glycerol 3-phosphate moiety of membrane 
lipids is shown in Figure 12.4. 

The major phosphoglycerides are derived from phosphatidate 
by the formation of an ester bond between the phosphate group of 
phosphatidate and the hydroxyl group of one of several alcohols. The 


common alcohol moieties of phosphoglycerides are the amino acid serine, 349 


ethanolamine, choline, glycerol, and inositol. 12.2 Types of Membrane Lipids 
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The structural formulas of phosphatidylcholine and the other principal 
phosphoglycerides—namely, phosphatidylethanolamine, phosphatidyl- 
serine, phosphatidylinositol, and diphosphatidylglycerol—are given in 


Figure 12.5. 
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Figure 12.5 Some common phosphoglycerides found in membranes. 
Sphingomyelin is a phospholipid found in mem- 
branes that is not derived from glycerol. Instead, the 
backbone in sphingomyelin is sphingosine, an amino 
alcohol that contains a long, unsaturated hydrocar- 
bon chain (Figure 12.6). In sphingomyelin, the *HAN 


amino group of the sphingosine backbone is linked a : 
to a fatty acid by an amide bond. In addition, the Ten Tet a el ea ae 
primary hydroxyl group of sphingosine is esteri- HO 


fied to phosphorylcholine. 


Sphingosine 
Membrane lipids can include carbohydrate 1 
moieties ae Nt 
The second major class of membrane lipids, gly- HsC(H20)12. a Ps a a a 
colipids, are sugar-containing lipids. Like sphin- ek. Hp J\ “cts 
gomyelin, the glycolipids in animal cells are me wee CH3 
derived from sphingosine. The amino group of Sphingomyelin 
the sphingosine backbone is acylated by a fatty Figure 12.6 Structures of sphingosine and sphingomyelin. The sphingosine 


acid, as in sphingomyelin. Glycolipids differ moiety of sphingomyelin is highlighted in blue. 
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Figure 12.7 An archaeon and its 
environment. Archaea can thrive in 
habitats as harsh as a volcanic vent. Here, 
the archaea form an orange mat 
surrounded by yellow sulfurous deposits. 
[Krafft-Explorer/Photo Researchers.] 


from sphingomyelin in the identity of the unit that is linked to the primary 
hydroxyl group of the sphingosine backbone. In glycolipids, one or more 
sugars (rather than phosphorylcholine) are attached to this group. The sim- 
plest glycolipid, called a cerebroside, contains a single sugar residue, either 
glucose or galactose. 
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More-complex glycolipids, such as gangliosides, contain a branched chain of 
as many as seven sugar residues. Glycolipids are oriented in a completely 
asymmetric fashion with the sugar residues always on the extracellular side of 
the membrane. 


Cholesterol Is a Lipid Based on a Steroid Nucleus 


Cholesterol, the third major type of membrane lipid, has a structure that is 
quite different from that of phospholipids. It is a steroid, built from four 
linked hydrocarbon rings. 


H5C CH; 
CH; 
CH; 
CH; 


HO 
Cholesterol 


A hydrocarbon tail is linked to the steroid at one end, and a hydroxyl group 
is attached at the other end. In membranes, the orientation of the molecule 
is parallel to the fatty acid chains of the phospholipids, and the hydroxyl 
group interacts with the nearby phospholipid head groups. Cholesterol is 
absent from prokaryotes but is found to varying degrees in virtually all ani- 
mal membranes. It constitutes almost 25% of the membrane lipids in certain 
nerve cells but is essentially absent from some intracellular membranes. 


Archaeal membranes are built from ether lipids with branched chains 


The membranes of archaea differ in composition from those of 

eukaryotes or bacteria in three important ways. Two of these differ- 
ences clearly relate to the hostile living conditions of many archaea 
(Figure 12.7). First, the nonpolar chains are joined to a glycerol backbone 
by ether rather than ester linkages. The ether linkage is more resistant to 
hydrolysis. Second, the alkyl chains are branched rather than linear. They 
are built up from repeats of a fully saturated five-carbon fragment. These 
branched, saturated hydrocarbons are more resistant to oxidation. The abil- 
ity of archaeal lipids to resist hydrolysis and oxidation may help these 
organisms to withstand the extreme conditions, such as high temperature, 


low pH, or high salt concentration, under which some of these archaea grow. 
Finally, the stereochemistry of the central glycerol is inverted compared 
with that shown in Figure 12.4. 
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Membrane lipid from the archaeon Methanococcus jannaschii 


A membrane lipid is an amphipathic molecule containing a hydrophilic 
and a hydrophobic moiety 


The repertoire of membrane lipids is extensive. However, these lipids pos- 
sess a critical common structural theme: membrane lipids are amphipathic 
molecules (amphiphilic molecules). A membrane lipid contains both a hydro- 
philic and a hydrophobic moiety. 

Let us look at a model of a phosphoglyceride, such as phosphatidy]l- 
choline. Its overall shape is roughly rectangular (Figure 12.8A). The two 
hydrophobic fatty acid chains are approximately parallel to each other, 
whereas the hydrophilic phosphorylcholine moiety points in the opposite 
direction. Sphingomyelin has a similar conformation, as does the archaeal 
lipid depicted. Therefore, the following shorthand has been adopted to rep- 
resent these membrane lipids: the hydrophilic unit, also called the polar 
head group, is represented by a circle, and the hydrocarbon tails are depicted 
by straight or wavy lines (Figure 12.8B). 


Figure 12.8 Representations of 
membrane lipids. (A) Space-filling 
models of a phosphoglyceride, 
sphingomyelin, and an archaeal lipid 
show their shapes and distribution of 
hydrophilic and hydrophobic moieties. 
(B) A shorthand depiction of a 
membrane lipid. 


Archaeal lipid Shorthand depiction 


351 


12.2 Types of Membrane Lipids 


352 
CHAPTER 12 Lipids and Cell Membranes 


NZ 
As 


Figure 12.9 Diagram of a section of a 
micelle. lonized fatty acids readily form such 
structures, but most phospholipids do not. 


Figure 12.10 Diagram of a section of a 
bilayer membrane. 


12.3 Phospholipids and Glycolipids Readily Form 
Bimolecular Sheets in Aqueous Media 


What properties enable phospholipids to form membranes? Membrane for- 
mation is a consequence of the amphipathic nature of the molecules. Their polar 
head groups favor contact with water, whereas their hydrocarbon tails inter- 
act with one another in preference to water. How can molecules with these 
preferences arrange themselves in aqueous solutions? One way is to form a 
globular structure called a micelle. The polar head groups form the outside 
surface of the micelle, which is surrounded by water, and the hydrocarbon 
tails are sequestered inside, interacting with one another (Figure 12.9). 

Alternatively, the strongly opposed preferences of the hydrophilic and 
hydrophobic moieties of membrane lipids can be satisfied by forming a lipid 
bilayer, composed of two lipid sheets (Figure 12.10). A lipid bilayer is also 
called a bimolecular sheet. The hydrophobic tails of each individual sheet 
interact with one another, forming a hydrophobic interior that acts as a 
permeability barrier. The hydrophilic head groups interact with the aque- 
ous medium on each side of the bilayer. The two opposing sheets are called 
leaflets. 

The favored structure for most phospholipids and glycolipids in aqueous 
media is a bimolecular sheet rather than a micelle. The reason is that the two 
fatty acid chains of a phospholipid or a glycolipid are too bulky to fit into the 
interior of a micelle. In contrast, salts of fatty acids (such as sodium 
palmitate, a constituent of soap) readily form micelles because they contain 
only one chain. The formation of bilayers instead of micelles by phospholipids 
is of critical biological importance. A micelle is a limited structure, usually 
less than 200 A (20 nm) in diameter. In contrast, a bimolecular sheet can 
extend to macroscopic dimensions, as much as a millimeter (10’ A, or 
10° nm) or more. Phospholipids and related molecules are important 
membrane constituents because they readily form extensive bimolecular 
sheets (Figure 12.11). 

Lipid bilayers form spontaneously by a self-assembly process. In other 
words, the structure of a bimolecular sheet is inherent in the structure of 
the constituent lipid molecules. The growth of lipid bilayers from phospho- 
lipids is rapid and spontaneous in water. Hydrophobic interactions are the 
major driving force for the formation of lipid bilayers. Recall that hydrophobic 


Figure 12.11 Space-filling model of a section of phospholipid bilayer membrane. (A) An 
idealized view showing regular structures. (B) A more realistic view of a fluid bilayer showing more 
irregular structures of the fatty acid chains. 


interactions also play a dominant role in the stacking of bases in nucleic 
acids and in the folding of proteins (Sections 1.3 and 2.4). Water molecules 
are released from the hydrocarbon tails of membrane lipids as these tails 
become sequestered in the nonpolar interior of the bilayer. Furthermore, 
van der Waals attractive forces between the hydrocarbon tails favor close pack- 
ing of the tails. Finally, there are electrostatic and hydrogen-bonding attrac- 
tions between the polar head groups and water molecules. Thus, lipid bilayers 
are stabilized by the full array of forces that mediate molecular interactions 
in biological systems. Because lipid bilayers are held together by many rein- 
forcing, noncovalent interactions (predominantly hydrophobic), they are coop- 
erative structures. These hydrophobic interactions have three significant 
biological consequences: (1) lipid bilayers have an inherent tendency to be 
extensive; (2) lipid bilayers will tend to close on themselves so that there are 
no edges with exposed hydrocarbon chains, and so they form compart- 
ments; and (3) lipid bilayers are self-sealing because a hole in a bilayer is 
energetically unfavorable. 


Lipid vesicles can be formed from phospholipids 


The propensity of phospholipids to form membranes has been used to cre- 
ate an important experimental and clinical tool. Lipid vesicles, or liposomes, 
are aqueous compartments enclosed by a lipid bilayer (Figure 12.12). These 
structures can be used to study membrane permeability or to deliver chem- 
icals to cells. Liposomes are formed by suspending a suitable lipid, such as 
phosphatidylcholine, in an aqueous medium, and then sonicating (1.e., agi- 
tating by high- frequency sound waves) to give a dispersion of closed vesicles 
that are quite uniform in size. Vesicles formed by this method are nearly 
spherical and have a diameter of about 500 A (50 nm). Larger vesicles 
(of the order of 1 jum. or 10* A in diameter) can be prepared by slowly 
evaporating the organic solvent from a suspension of phospholipid in a 
mixed-solvent system. 
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. = - 
~e. — Figure 12.12 Liposome. A liposome, or 


= os lipid vesicle, is a small aqueous compartment 
Bilayer membrane surrounded by a lipid bilayer. 


Ions or molecules can be trapped in the aqueous compartments of 
lipid vesicles by forming the vesicles in the presence of these substances 
(Figure 12.13). For example, 500-A-diameter vesicles formed in a 0.1 M 
glycine solution will trap about 2000 molecules of glycine in each inner 
aqueous compartment. These glycine-containing vesicles can be separated 
from the surrounding solution of glycine by dialysis or by gel-filtration 
chromatography. The permeability of the bilayer membrane to glycine can 
then be determined by measuring the rate of efflux of glycine from the inner 
compartment of the vesicle to the ambient solution. Liposomes can be 
formed with specific membrane proteins embedded in them by solubilizing 
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Figure 12.13 Preparation of glycine- 
containing liposomes. Liposomes contain- 
ing glycine are formed by the sonication of 
phospholipids in the presence of glycine. Free 
glycine is removed by gel filtration. 
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Figure 12.14 Experimental arrangement 
for the study of a planar bilayer 
membrane. A bilayer membrane is formed 
across a 1-mm hole in a septum that 
separates two aqueous compartments. This 
arrangement permits measurements of the 
permeability and electrical conductance of lipid 
bilayers. 


the proteins in the presence of detergents and then adding them to the 
phospholipids from which liposomes will be formed. Protein—liposome 
complexes provide valuable experimental tools for examining a range of 
membrane-protein functions. 


kz Therapeutic applications for liposomes are currently under active 
investigation. For example, liposomes containing drugs or DNA for 
gene-therapy experiments can be injected into patients. These liposomes 
fuse with the plasma membrane of many kinds of cells, introducing into the 
cells the molecules that they contain. Drug delivery with liposomes often 
lessens its toxicity. Less of the drug is distributed to normal tissues because 
long-circulating liposomes concentrate in regions of increased blood circu- 
lation, such as solid tumors and sites of inflammation. Moreover, the selec- 
tive fusion of lipid vesicles with particular kinds of cells is a promising 
means of controlling the delivery of drugs to target cells. 

Another well-defined synthetic membrane is a planar bilayer membrane. 
This structure can be formed across a 1-mm hole ina partition between two 
aqueous compartments by dipping a fine paintbrush into a membrane- 
forming solution, such as phosphatidylcholine in decane, and stroking the 
tip of the brush across the hole. The lipid film across the hole thins sponta- 
neously into a lipid bilayer. The electrical conduction properties of this 
macroscopic bilayer membrane are readily studied by inserting electrodes 
into each aqueous compartment (Figure 12.14). For example, the permea- 
bility of the membrane to ions is determined by measuring the current 
across the membrane as a function of the applied voltage. 


Lipid bilayers are highly impermeable to ions 
and most polar molecules 


Permeability studies of lipid vesicles and electrical-conductance measure- 
ments of planar bilayers have shown that lipid bilayer membranes have a very 
low permeability for ions and most polar molecules. Water is a conspicuous 
exception to this generalization; it traverses such membranes relatively eas- 
ily because of its low molecular weight, high concentration, and lack of a 
complete charge. The range of measured permeability coefficients is very 
wide (Figure 12.15). For example, Na* and K” traverse these membranes 
10° times as slowly as does HO. Tryptophan, a zwitterion at pH 7, crosses 
the membrane 10” times as slowly as does indole, a structurally related mol- 
ecule that lacks ionic groups. In fact, the permeability of small molecules is 
correlated with their solubility in a nonpolar solvent relative to their solubility 
in water. This relation suggests that a small molecule might traverse a lipid 
bilayer membrane in the following way: first, it sheds its solvation shell of 
water; then, it is dissolved in the hydrocarbon core of the membrane; and, 
finally, it diffuses through this core to the other side of the membrane, 
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Figure 12.15 Permeability coefficients (P) of ions and molecules in a lipid bilayer. 
The ability of molecules to cross a lipid bilayer spans a wide range of values. 


where it becomes resolvated by water. An ion such as Na” traverses mem- 
branes very slowly because the replacement of its coordination shell of 
polar water molecules by nonpolar interactions with the membrane interior 
is highly unfavorable energetically. 


12.4 Proteins Carry Out Most Membrane Processes 


We now turn to membrane proteins, which are responsible for most of the 
dynamic processes carried out by membranes. Membrane lipids form a 
permeability barrier and thereby establish compartments, whereas specific 
proteins mediate nearly all other membrane functions. In particular, proteins 
transport chemicals and information across a membrane. Membrane lipids 
create the appropriate environment for the action of such proteins. 

Membranes differ in their protein content. Myelin, a membrane that 
serves as an electrical insulator around certain nerve fibers, has a low 
content of protein (18%). Relatively pure lipids are well suited for insula- 
tion. In contrast, the plasma membranes, or exterior membranes, of most 
other cells are much more metabolically active. They contain many pumps, 
channels, receptors, and enzymes. The protein content of these plasma 
membranes is typically 50%. Energy-transduction membranes, such as the 
internal membranes of mitochondria and chloroplasts, have the highest 
content of protein, typically 75%. 

The protein components of a membrane can be readily visualized by 
SDS-—polyacrylamide gel electrophoresis. As stated earlier (p. 71), the electro- 
phoretic mobility of many proteins in SDS-containing gels depends on the 
mass rather than on the net charge of the protein. The gel-electrophoresis 
patterns of three membranes—the plasma membrane of erythrocytes, the 
photoreceptor membrane of retinal rod cells, and the sarcoplasmic reticu- 
lum membrane of muscle—are shown in Figure 12.16. It is evident that 
each of these three membranes contains many proteins but has a distinct 
protein composition. In general, membranes performing different functions 
contain different repertoires of proteins. 


Proteins associate with the lipid bilayer in a variety 
of ways 


The ease with which a protein can be dissociated from a 
membrane indicates how intimately it is associated with 
the membrane. Some membrane proteins can be solubi- 
lized by relatively mild means, such as extraction by a 
solution of high ionic strength (e.g., 1 M NaCl). Other 
membrane proteins are bound much more tenaciously; 
they can be solubilized only by using a detergent or an 
organic solvent. Membrane proteins can be classified as 
being either peripheral or integral on the basis of this dif- 
ference in dissociability (Figure 12.17). Integral mem- 
brane proteins interact extensively with the hydrocarbon 
chains of membrane lipids, and they can be released only 
by agents that compete for these nonpolar interactions. 
In fact, most integral membrane proteins span the lipid 
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Figure 12.16 SDS-—acrylamide gel 
patterns of membrane proteins. 

(A) The plasma membrane of erythrocytes. 
(B) The photoreceptor membranes of retinal 
rod cells. (C) The sarcoplasmic reticulum 
membrane of muscle cells. [Courtesy of 

Dr. Theodore Steck (part A) and Dr. David 
MacLennan (part C).] 
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bilayer. In contrast, peripheral membrane proteins are Figure 12.17 Integral and peripheral membrane proteins. 
bound to membranes primarily by electrostatic and Integral membrane proteins (a and 6) interact extensively with the 


hydrogen-bond interactions with the head groups of lip- 
ids. These polar interactions can be disrupted by adding 


hydrocarbon region of the bilayer. Most known integral membrane 
proteins traverse the lipid bilayer. Peripheral membrane proteins 
interact with the polar head groups of the lipids (c) or bind to the 


salts oF by changing the pH. Many peripheral membrane surfaces of integral proteins (d). Other proteins are tightly anchored to 
proteins are bound to the surfaces of integral proteins, on the membrane by a covalently attached lipid molecule (e). 


356 either the cytoplasmic or the extracellular side of the membrane. Others are 
CHAPTER 12 Lipids and Cell Membranes anchored to the lipid bilayer by a covalently attached hydrophobic chain, 
such as a fatty acid. 


Proteins interact with membranes in a variety of ways 


Membrane proteins are more difficult to purify and crystallize than are 
water-soluble proteins. Nonetheless, researchers using x-ray crystallo- 
graphic or electron microscopic methods have determined the three- 
dimensional structures of more than 200 such proteins at sufficiently high 
resolution to discern the molecular details. As noted in Chapter 2, membrane 
proteins differ from soluble proteins in the distribution of hydrophobic 
and hydrophilic groups. We will consider the structures of three membrane 
proteins in some detail. 


(A) (B) 


h Figure 12.18 Structure of 
bacteriorhodopsin. Notice that 
bacteriorhodopsin consists largely of 
membrane-spanning « helices (represented 
by yellow cylinders). (A) View through the 
membrane bilayer. The interior of the 
membrane is green and the head groups are 
red. (B) View from the cytoplasmic side of the 
membrane. [Drawn from 1BRX.pdb.] 


Cytoplasm 


Proteins can span the membrane with alpha helices. The first membrane 
protein that we consider is the archaeal protein bacteriorhodopsin, shown in 
Figure 12.18. This protein uses light energy to transport protons from 
inside to outside the cell, generating a proton gradient used to form 
ATP. Bacteriorhodopsin is built almost entirely of « helices; seven closely 
packed a helices, arranged almost perpendicularly to the plane of the 
cell membrane, span its 45-A width. Examination of the primary struc- 
ture of bacteriorhodopsin reveals that most of the amino acids in these 
membrane-spanning a helices are nonpolar and only a very few are charged 
(Figure 12.19). This distribution of nonpolar amino acids is sensible because 
these residues are either in contact with the hydrocarbon core of the 
membrane or with one another. Membrane-spanning a helices ave the most 
common structural motif in membrane proteins. As will be considered in 
Section 12.5, such regions can often be detected by examining amino acid 
sequence alone. 


AQ|TGRPBWIWLALGTALMGLGTLYFLVRIGMGV SBPDBARIFYAITTLVPA 
WAFTMYLSMLELGYGLTMVPFGGBIQNP | YWARYABWLFTTPLELLBLALLV 
BA TIVYS YRIFVWWAISTAAMLYILYV 
LFFGFT SMABS MRPEVAST FRIVERINVTVVLWSAYVVVWLEIGSBIGAGIVPL 
NIBTULEMVLBVSABVGFGL I ULRISBIA | F GBIABAP ERIS ABGAAATS 


Figure 12.19 Amino acid sequence of bacteriorhodopsin. The seven helical regions are 
highlighted in yellow and the charged residues in red. 


(B) 


A channel protein can be formed from beta strands. Porin, a protein 
from the outer membranes of bacteria such as FE. coli and Rhodobacter cap- 
sulatus, represents a class of membrane proteins with a completely different 
type of structure. Structures of this type are built from B strands and contain 
essentially no a helices (Figure 12.20). 

The arrangement of £ strands is quite simple: each strand is hydrogen 
bonded to its neighbor in an antiparallel arrangement, forming a single B 
sheet. The B sheet curls up to form a hollow cylinder that, as its name sug- 
gests, forms a pore, or channel, in the membrane. The outside surface of 
porin is appropriately nonpolar, given that it interacts with the hydrocarbon 
core of the membrane. In contrast, the inside of the channel is quite hydro- 
philic and is filled with water. This arrangement of nonpolar and polar 
surfaces is accomplished by the alternation of hydrophobic and hydrophilic 
amino acids along each 8 strand (Figure 12.21). 
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Figure 12.21 Amino acid sequence of a porin. Some membrane proteins, such as porins, are 
built from B strands that tend to have hydrophobic and hydrophilic amino acids in adjacent 
positions. The secondary structure of porin from Rhodopseudomonas blastica is shown, with the 
diagonal lines indicating the direction of hydrogen bonding along the 8 sheet. Hydrophobic residues 
(F, |, L, M, V, W, and Y) are shown in yellow. These residues tend to lie on the outside of the 
structure, in contact with the hydrophobic core of the membrane. 


Embedding part of a protein in a membrane can link the protein to 
the membrane surface. The structure of the endoplasmic reticulum 
membrane-bound enzyme prostaglandin H> synthase-1 reveals a rather 
different role for a helices in protein—membrane associations. This enzyme 


hk Figure 12.20 Structure of bacterial 


porin (from Rhodopseudomonas 


blastica). Notice that this membrane protein 
is built entirely of 8 strands. (A) Side view. 
(B) View from the periplasmic space. Only 


one monomer of the trimeric protein is 
shown. [Drawn from 1PRN.pdb.] 
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Figure 12.22 Formation of prostaglandin 
Hp. Prostaglandin H2 synthase-1 catalyzes the 
formation of prostaglandin Hz from arachidonic 
acid in two steps. 


Hydrophobic 
channel 


Figure 12.24 Hydrophobic channel of prostaglandin H 
synthase-1. A view of prostaglandin H2 synthase-1 from the 
membrane shows the hydrophobic channel that leads to the active 
site. The membrane-anchoring helices are shown in orange. [Drawn 


from 1PTH.pdb.] 
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catalyzes the conversion of arachidonic acid into prostaglandin H2 in 
two steps: (1) a cyclooxygenase reaction and (2) a peroxidase reaction 
(Figure 12.22). Prostaglandin H promotes inflammation and modulates 
gastric acid secretion. The enzyme that produces prostaglandin Hp is a 
homodimer with a rather complicated structure consisting primarily of a 
helices. Unlike bacteriorhodopsin, this protein is not largely embedded in 
the membrane. Instead, it lies along the outer surface of the membrane, 
firmly bound by a set of a helices with hydrophobic surfaces that extend 
from the bottom of the protein into the membrane (Figure 12.23). This link- 
age is sufficiently strong that only the action of detergents can release the 
protein from the membrane. Thus, this enzyme is classified as an integral 
membrane protein, although it does not span the membrane. 
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hk Figure 12.23 Attachment of prostaglandin H2 synthase-1 to 
the membrane. Notice that prostaglandin Hy synthase-1 is held in the 
membrane by a set of a helices (orange) coated with hydrophobic side 
chains. One monomer of the dimeric enzyme is shown. [Drawn from 
1PTH.pdb.] 


e The localization of prostaglandin H» synthase-l in 
the membrane is crucial to its function. The sub- 
strate for this enzyme, arachidonic acid, is a hydrophobic 
molecule generated by the hydrolysis of membrane lip- 
ids. Arachidonic acid reaches the active site of the enzyme 
from the membrane without entering an aqueous envi- 
ronment by traveling through a hydrophobic channel in 
the protein (Figure 12.24). Indeed, nearly all of us have 
experienced the importance of this channel: drugs such 
as aspirin and ibuprofen block the channel and prevent 
prostaglandin synthesis by inhibiting the cyclooxygenase 
activity of the synthase. In particular, aspirin acts through 
the transfer of its acetyl group to a serine residue (Ser 530) 
that lies along the path to the active site (Figure 12.25). 
Two important features emerge from our examina- 
tion of these three examples of membrane-protein struc- 
ture. First, the parts of the protein that interact with the 
hydrophobic parts of the membrane are coated with 


Ser 530 


nonpolar amino acid side chains, whereas those parts that interact with the 359 
aqueous environment are much more hydrophilic. Second, the structures 12.4 Membrane Proteins 
positioned within the membrane are quite regular and, in particular, all 

backbone hydrogen-bond donors and acceptors participate in hydrogen 

bonds. Breaking a hydrogen bond within a membrane is quite unfavorable, O OH 

because little or no water is present to compete for the polar groups. 


oO CHz 
Some proteins associate with membranes through covalently a 
attached hydrophobic groups 0 
The membrane proteins considered thus far associate with the membrane Aspirin 
through surfaces generated by hydrophobic amino acid side chains. (Acetylsalicyclic acid) 
However, even otherwise soluble proteins can associate with membranes if 
hydrophobic groups are attached to the proteins. Three such, groups are O CH; 
shown in Figure 12.26: (1) a palmitoyl group attached to a specific cysteine Setsso~ na 
residue by a thioester bond, (2) a farnesyl group attached to a cysteine resi- 4 


due at the carboxyl terminus, and (3) a glycolipid structure termed a glyco- 
sylphosphatidylinositol (GPI) anchor attached to the carboxyl terminus. Figure 12.25 Aspirin’s effects on 
These modifications are attached by enzyme systems that recognize spe- prostaglandin Hz synthase-1. Aspirin acts 


ie ‘ by transferring an acetyl group to a serine 
cific signal sequences near the site of attachment. residue inorctactandiny Hy Sithasest: 
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Figure 12.26 Membrane anchors. 
Membrane anchors are hydrophobic 
groups that are covalently attached to 
proteins (in blue) and tether the 
proteins to the membrane. The green 
circles and blue square correspond to 
mannose and GIcNAc, respectively. R 
groups represent points of additional 
modification. Glycosylphosphatidylinositol (GP!) anchor 


Transmembrane helices can be accurately predicted from 
amino acid sequences 


Many membrane proteins, like bacteriorhodopsin, employ «a helices to span 
the hydrophobic part of amembrane. As noted earlier, typically most of the 
residues in these a helices are nonpolar and almost none of them are 
charged. Can we use this information to identify likely membrane-spanning 
regions from sequence data alone? One approach to identifying transmem- 
brane helices is to ask whether a postulated helical segment is likely to be 
more stable in a hydrocarbon environment or in water. Specifically, we want 
to estimate the free-energy change when a helical segment is transferred 


Table 12.2 Polarity scale for identifying 
transmembrane helices 


Transfer free 


Amino acid energy in kJ mol © 

residue (kcal mol *) 
Phe 15.5 (3:7 
Met 14.3 (3.4 
Tle 13.0 (3.1 
Leu 11.8 (2.8 
Val 10.9 (2.6 
Cys 8.4 (2.0 
Trp 8.0 (1.9 
Ala 6.7 (1.6 
Thr 5.0.1.2 
Gly 4.2 (1.0 
Ser 2.5 (0.6 
Pro =0)5 (0.2) 
Tyr =2.,9.(=0.7) 
His =e) 
Gln =179 (4.0) 
Asn = 20), 2 (=4h43)) 
Glu —34.4 (—8.2) 
Lys = 37/0 (=8.85) 
Asp =38:6(=9.2) 
Arg See) 


Source: After D. M. Engelman, T. A. Steitz, and 
A. Goldman. Annu. Rev. Biophys. Biophys. Chem. 


15(1986):321-353. 


Note: The free energies are for the transfer of 
an amino acid residue in an a helix from the 
membrane interior (assumed to have a dielectric 


constant of 2) to water. 


(A) Outside 
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from the interior of a membrane to water. Free-energy changes for the 
transfer of individual amino acid residues from a hydrophobic to an aque- 
ous environment are given in Table 12.2. For example, the transfer of a 
helix formed entirely of L-arginine residues, a positively charged amino 
acid, from the interior of a membrane to water would be highly favorable 
[-51.5 kJ mol ! (—12.3 kcal mol‘) per arginine residue in the helix]. In 
contrast, the transfer of a helix formed entirely of L-phenylalanine, a 
hydrophobic amino acid, would be unfavorable [+15.5 kJ mol! (+3.7 kcal 
mol -) per phenylalanine residue in the helix]. ; 

The hydrocarbon core of a membrane is typically 30 A wide, a length 
that can be traversed by an a helix consisting of 20 residues. We can take 
the amino acid sequence of a protein and estimate the free-energy change 
that takes place when a hypothetical a helix formed of residues 1 through 
20 is transferred from the membrane interior to water. The same calcula- 
tion can be made for residues 2 through 21, 3 through 22, and so forth, 
until we reach the end of the sequence. The span of 20 residues chosen for 
this calculation is called a window. The free-energy change for each window 
is plotted against the first amino acid at the window to create a hydropathy 
plot. Empirically, a peak of +84 kJ mol | (+20 kcal mol” ') or more in a 
hydropathy plot based on a window of 20 residues indicates that a polypep- 
tide segment could be a membrane-spanning « helix. For example, glyco- 
phorin, a protein found in the membranes of red blood cells, is predicted by 
this criterion to have one membrane-spanning helix, in agreement with exper- 
imental findings (Figure 12.27). Note, however, that a peak in the hydropathy 
plot does not prove that a segment is a transmembrane helix. Even soluble 
proteins may have highly nonpolar regions. Conversely, some membrane pro- 
teins contain membrane-spanning features (such as a set of cylinder-forming 
8 strands) that escape detection by these plots (Figure 12.28). 
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Figure 12.27 Locating the membrane-spanning helix of glycophorin. (A) Amino acid 
sequence and transmembrane disposition of glycophorin A from the red-blood-cell membrane. 
Fifteen O-linked carbohydrate units are shown as diamond shapes, and an N-linked unit is shown 
as a lozenge shape. The hydrophobic residues (yellow) buried in the bilayer form a transmembrane 
a helix. The carboxyl-terminal part of the molecule, located on the cytoplasmic side of the 
membrane, is rich in negatively charged (red) and positively charged (blue) residues. 

(B) Hydropathy plot for glycophorin. The free energy for transferring a helix of 20 residues from the 
membrane to water is plotted as a function of the position of the first residue of the helix in the 
sequence of the protein. Peaks of greater than +84 kJ mol7! (+20 kcal mol7') in hydropathy 
plots are indicative of potential transmembrane helices. [(A) Courtesy of Dr. Vincent Marchesi; 

(B) after D. M. Engelman, T. A. Steitz, and A. Goldman. Annu. Rev. Biophys. Biophys. Chem. 15: 
321-353, 1986. Copyright © 1986 by Annual Reviews, Inc. All rights reserved.] 
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12.5 Lipids and Many Membrane Proteins Diffuse Rapidly 
in the Plane of the Membrane 


Biological membranes are not rigid, static structures. On the contrary, lipids 
and many membrane proteins are constantly in lateral motion, a process 
called lateral diffusion. The rapid lateral movement of membrane proteins 
has been visualized by means of fluorescence microscopy using the tech- 
nique of fluorescence recovery after photobleaching (FRAP; Figure 12.29). 
First, a cell-surface component is specifically labeled with a fluorescent 
chromophore. A small region of the cell surface (~3 jum?) is viewed through 
a fluorescence microscope. The fluorescent molecules in this region are then 
destroyed (bleached) by a very intense light pulse from a laser. The fluores- 
cence of this region is subsequently monitored as a function of time by using 
a light level sufficiently low to prevent further bleaching. If the labeled 
component is mobile, bleached molecules leave and unbleached molecules 
enter the illuminated region, resulting in an increase in the fluorescence 
intensity. The rate of recovery of fluorescence depends on the lateral mobil- 
ity of the fluorescence-labeled component, which can be expressed in terms 
of a diffusion coefficient, D. The average distance S traversed in time t 
depends on D according to the expression 


S = (4Dt)'” 


The diffusion coefficient of lipids in a variety of membranes is about 
1 m’s |. Thus, a phospholipid molecule diffuses an average distance of 
2 wm in 1s. This rate means that a lipid molecule can travel from one end of 
a bacterium to the other in a second. The magnitude of the observed diffusion 
coefficient indicates that the viscosity of the membrane is about 100 times 
that of water, rather like that of olive oil. 

In contrast, proteins vary markedly in their lateral mobility. Some pro- 
teins are nearly as mobile as lipids, whereas others are virtually immobile. For 
example, the photoreceptor protein rhodopsin (Section 33.3), a very mobile 
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Figure 12.28 Hydropathy plot for porin. 
No strong peaks are observed for this intrinsic 
membrane protein, because it is constructed 
from membrane-spanning B strands rather 
than « helices. 


Figure 12.29 Fluorescence recovery after 
photobleaching (FRAP) technique. (A) 
The cell surface fluoresces because of a 
labeled surface component. (B) The 
fluorescent molecules of a small part of the 
surface are bleached by an intense light pulse. 
(C) The fluorescence intensity recovers as 
bleached molecules diffuse out of the region 
and unbleached molecules diffuse into it. 

(D) The rate of recovery depends on the 
diffusion coefficient. 


Bleach 


= Recovery 


Fluorescence intensity —~ 


Time —— 


=> 
is) 
YS 


362 
CHAPTER 12 Lipids and Cell Membranes 


Rapid 


| 


== £3 


Lateral diffusion 


Very slow 


— 


=3 53 


Tranverse diffusion 
(flip-flop) 


Figure 12.30 Lipid movement in 
membranes. Lateral diffusion of lipids is 
much more rapid than transverse diffusion 
(flip-flop). 
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Figure 12.31 The phase-transition, or 
melting, temperature (T,,) for a 
phospholipid membrane. As the 
temperature is raised, the phospholipid 
membrane changes from a packed, ordered 
state to a more random one. 


protein, has a diffusion coefficient of 0.4 ym? s |. The rapid movement of 
rhodopsin is essential for fast signaling. At the other extreme is fibronectin, 
a peripheral glycoprotein that interacts with the extracellular matrix. For 
fibronectin, D is less than 10 * xm? s_'. Fibronectin has a very low 
mobility because it is anchored to actin filaments on the inside of the 
plasma membrane through integrin, a transmembrane protein that links the 
extracellular matrix to the cytoskeleton. 


The fluid mosaic model allows lateral movement but not 
rotation through the membrane 


On the basis of the mobility of proteins in membranes, in 1972 5. Jonathan 
Singer and Garth Nicolson proposed a fluid mosaic model to describe the 
overall organization of biological membranes. The essence of their model is 
that membranes are two-dimensional solutions of oriented lipids and globular 
proteins. The lipid bilayer has a dual role: it is both a solvent for integral 
membrane proteins and a permeability barrier. Membrane proteins are 
free to diffuse laterally in the lipid matrix unless restricted by special 
interactions. 

Although the lateral diffusion of membrane components can be rapid, 
the spontaneous rotation of lipids from one face of a membrane to the other 
is a very slow process. The transition of a molecule from one membrane 
surface to the other is called transverse diffusion or flip-flop (Figure 12.30) 
The flip-flop of phospholipid molecules in phosphatidylcholine vesicles 
has been directly measured by electron spin resonance techniques, which 
show that a phospholipid molecule flip-flops once in several hours. Thus, a 
phospholipid molecule takes about 10’ times as long to flip-flop across 
a membrane as it takes to diffuse a distance of 50 A in the lateral direction. 
The free-energy barriers to flip-flopping are even larger for protein mole- 
cules than for lipids because proteins have more-extensive polar regions. In 
fact, the flip-flop of a protein molecule has not been observed. Hence, mem- 
brane asymmetry can be preserved for long periods. 


Membrane fluidity is controlled by fatty acid composition 
and cholesterol content 


Many membrane processes, such as transport or signal transduction, 
depend on the fluidity of the membrane lipids, which in turn depends on 
the properties of fatty acid chains. Fatty acid chains in membrane bilayers 
can exist in an ordered, rigid state or in a relatively disordered, fluid state. 
The transition from the rigid to the fluid state takes place abruptly as the 
temperature is raised above T,,, the melting temperature (Figure 12.31). 
This transition temperature depends on the length of the fatty acid chains and 
on their degree of unsaturation (Table 12.3). The presence of saturated fatty 


Table 12.3 The melting temperature of phosphatidylcholine containing different 
pairs of identical fatty acid chains 


Fatty acid 

Number of Number of Common 

carbons double bonds name Systematic name han GC) 
22 0 Behenate n-Docosanote 75 
18 0 Stearate n-Octadecanoate 58 
16 0 Palmitate n-Hexadecanoate 41 
14 0 Myristate n- Tetradecanoate 24 
18 1 Oleate cis-A’-Octadecenoate =22 


acid residues favors the rigid state because their straight hydrocarbon 
chains interact very favorably with one another. On the other hand, a cis 
double bond produces a bend in the hydrocarbon chain. This bend interferes 
with a highly ordered packing of fatty acid chains, and so Tm is lowered 
(Figure 12.32). The length of the fatty acid chain also affects the transition 
temperature. Long hydrocarbon chains interact more strongly than do short 
ones. Specifically, each additional —CH,— group makes a favorable contri- 
bution of about —2 kJ mol~'(—0.5 kcal mol” ') to the free energy of interac- 
tion of two adjacent hydrocarbon chains. 

Bacteria regulate the fluidity of their membranes by varying the number of 
double bonds and the length of their fatty acid chains. For example, the ratio 
of saturated to unsaturated fatty acid chains in the E. coli membrane 
decreases from 1.6 to 1.0 as the growth temperature is lowered from 42°C 
to 27°C. This decrease in the proportion of saturated residues prevents the 
membrane from becoming too rigid at the lower temperature. 

In animals, cholesterol is the key regulator of membrane fluidity. Cholesterol 
contains a bulky steroid nucleus with a hydroxyl group at one end and a 
flexible hydrocarbon tail at the other end. Cholesterol inserts into bilayers 
with its long axis perpendicular to the plane of the membrane. The hydroxyl 
group of cholesterol forms a hydrogen bond with a carbonyl oxygen atom of 
a phospholipid head group, whereas the hydrocarbon tail of cholesterol is 
located in the nonpolar core of the bilayer. The different shape of cholesterol 
compared with that of phospholipids disrupts the regular interactions 
between fatty acid chains (Figure 12.33) 


Lipid rafts are highly dynamic complexes formed between 
cholesterol and specific lipids 


In addition to its nonspecific effects on membrane fluidity, cholesterol can 
form specific complexes with lipids that contain the sphingosine backbone, 
including sphingomyelin and certain glycolipids, and with GPI-anchored 
proteins. These complexes concentrate within small (10-200 nm) and 
highly dynamic regions within membranes. The resulting structures are 
often referred to as lipid rafts. One result of these interactions is the 
moderation of membrane fluidity, making membranes less fluid but at 
the same time less subject to phase transitions. The presence of lipid rafts 
thus represents a modification of the original fluid mosaic model for 
biological membranes. Although their small size and dynamic nature have 
made them very difficult to study, it appears that lipid rafts may play a role 
in concentrating proteins that participate in signal transduction pathways 
and may also serve to regulate membrane curvature and budding. 


All biological membranes are asymmetric 


Membranes are structurally and functionally asymmetric. The outer and 
inner surfaces of all known biological membranes have different components 


Figure 12.32 Packing of fatty acid chains 
in a membrane. The highly ordered packing 


of fatty acid chains is disrupted by the 


presence of cis double bonds. The space- 
filling models show the packing of (A) three 
molecules of stearate (Cig, saturated) and (B) 


a molecule of oleate (Cj, unsaturated) 
between two molecules of stearate. 


Cholesterol 


Figure 12.33 Cholesterol disrupts the 
tight packing of the fatty acid chains. 


[After S. L. Wolfe, Molecular and Cellular 
Biology (Wadsworth, 1993).] 
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Figure 12.34 Asymmetry of the Na*-K* transport system 
in plasma membranes. The Na*—K* transport system pumps 
Na* out of the cell and K* into the cell by hydrolyzing ATP on 
the intracellular side of the membrane. 


and different enzymatic activities. A clear-cut example is the 
pump that regulates the concentration of Na” and K* ions in 
cells (Figure 12.34). This transport protein is located in the 
plasma membrane of nearly all cells in higher organisms. The 
Na*—-K* pump is oriented so that it pumps Na’ out of the cell 
and K* into it. Furthermore, ATP must be on the inside of the 
cell to drive the pump. Ouabain, a specific inhibitor of the 
pump, is effective only if it is located outside. We shall con- 
sider the mechanism of this important and fascinating pump 
and others in its family in Chapter 13. 

Membrane proteins have a unique orientation because, 
after synthesis, they are inserted into the membrane in an 
asymmetric manner. This absolute asymmetry is preserved 
because membrane proteins do not rotate from one side of the 
membrane to the other and because membranes are always 
synthesized by the growth of preexisting membranes. Lipids, too, 
are asymmetrically distributed as a consequence of their mode 
of biosynthesis, but this asymmetry is usually not absolute, 
except for glycolipids. In the red-blood-cell membrane, sphin- 


Outer membrane 


gomyelin and phosphatidylcholine are preferentially located in the outer 
leaflet of the bilayer, whereas phosphatidylethanolamine and phosphatidyl- 
serine are located mainly in the inner leaflet. Large amounts of cholesterol 
are present in both leaflets. 


12.6 Eukaryotic Cells Contain Compartments Bounded 
by Internal Membranes 


Thus far, we have considered only the plasma membrane of cells. Some 
bacteria and archaea have only this single membrane, surrounded by a cell 
wall. Other bacteria, such as E. coli, have two membranes separated by a cell 
wall (made of proteins, peptides, and carbohydrates) lying between them 
(Figure 12.35). The inner membrane acts as the permeability barrier, and 
the outer membrane and the cell wall provide additional protection. The 
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Figure 12.35 Cell membranes of prokaryotes. A schematic view of the membrane of bacterial 
cells surrounded by (A) two membranes or (B) one membrane. 


outer membrane is quite permeable to small molecules, owing to the pres- 365 
ence of porins. The region between the two membranes containing the cell 12.6 Internal Compartments 
wall is called the periplasm. 

Eukaryotic cells, with the exception of plant cells, do 
not have cell walls, and their cell membranes consist of a 
single lipid bilayer. In plant cells, the cell wall is on the 
outside of the plasma membrane. Eukaryotic cells are 
distinguished from prokaryotic cells by the presence of 
membranes inside the cell that form internal compart- 
ments. For example, peroxisomes, organelles that play a 
major role in the oxidation of fatty acids for energy con- 
version, are defined by asingle membrane. Mitochondria, 
the organelles in which ATP is synthesized, are sur- 
rounded by two membranes. As in the case for a bacte- Pacers 
rium, the outer membrane is quite permeable to small camplen: 
molecules, whereas the inner membrane is not. Indeed, 
considerable evidence now indicates that mitochondria DNA 
evolved from bacteria by endosymbiosis (Section 18.1). 

The nucleus is also surrounded by a double membrane, Figure 12.36 Nuclear envelope. The nuclear envelope is a double 
the nuclear envelope, that consists of a set of closed mem- membrane connected to another membrane system of eukaryotes, the 
branes that come together at structures called nuclear endoplasmic reticulum. [After E. C. Schirmer and L. Gerace. Genome 
pores (Figure 12.36). These pores regulate transport into BOE SALAGOR ITD 2002) vetewecr ie 

and out of the nucleus. The nuclear envelope is linked to 

another membrane-defined structure, the endoplasmic 

reticulum, which plays a host of cellular roles, including drug detoxification 

and the modification of proteins for secretion. Thus, a eukaryotic cell con- 

tains interacting compartments, and transport into and out of these com- 

partments is essential to many biochemical processes. 

Membranes must be able to separate or join together so that cells and 
compartments may take up, transport, and release molecules. Many cells 
take up molecules through the process of receptor-mediated endocytosis. 

Here, a protein or larger complex initially binds to a receptor on the cell 
surface. After the receptor is bound, specialized proteins act to cause the 
membrane in this region to invaginate. One of these specialized proteins 
is clathrin, which polymerizes into a lattice network around the growing 
membrane bud, often referred to as a clathrin-coated pit (Figure 12.37). The 
invaginated membrane eventually breaks off and fuses to form a vesicle. 
Various hormones, transport proteins, and antibodies employ receptor- 
mediated endocytosis to gain entry into a cell. A less-advantageous conse- 
quence is that this pathway is available to viruses and toxins as a means 
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Figure 12.37 Vesicle formation by receptor-mediated endocytosis. Receptor 

binding on the surface of the cell induces the membrane to invaginate, with the assistance of 
specialized intracellular proteins such as clathrin. The process results in the formation of a vesicle 
within the cell. [M. M. Perry and A. B. Gilbert. J. Cell Sci. 39:266, 1979]] 


Figure 12.38 Neurotransmitter release. 
Neurotransmitter-containing synaptic vesicles 
are arrayed near the plasma membrane of a 
nerve cell. Synaptic vesicles fuse with the 
plasma membrane, releasing the neuro- 
transmitter into the synaptic cleft. [T. Reese/ 
Don Fawcett/Photo Researchers.] 
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of entry into cells. The reverse process—the fusion of a vesicle to a 
membrane—is a key step in the release of neurotransmitters from a neuron 
into the synaptic cleft (Figure 12.38). 

Let us consider one example of receptor-mediated endocytosis. Iron is a 
critical element for the function and structure of many proteins, including 
hemoglobin and myoglobin (Chapter 7). However, free iron ions are highly 
toxic to cells, owing to their ability to catalyze the formation of free radicals. 
Hence, the transport of iron atoms from the digestive tract to the cells where 
they are most needed must be tightly controlled. In the bloodstream, iron is 
bound very tightly by the protein transferrin, which can bind two Fe** ions 
with a dissociation constant of 10° 7° M at neutral pH. Cells requiring iron 
express the transferrin receptor in their plasma membranes (Section 32.4). 
Formation of a complex between the transferrin receptor and iron-bound 
transferrin initiates receptor-mediated endocytosis, internalizing these 
complexes within vesicles called endosomes (Figure 12.39). As the endo- 
somes mature, proton pumps within the vesicle membrane lower the 
lumenal pH to about 5.5. Under these conditions, the affinity of iron ions for 
transferrin is reduced; these ions are released and are free to pass through 
channels in the endosomal membranes into the cytoplasm. The iron-free 
transferrin complex is recycled to the plasma membrane, where transferrin 
is released back into the bloodstream and the transferrin receptor can 
participate in another uptake cycle. 

Although budding and fusion appear deceptively simple, the struc- 
tures of the intermediates in these processes and the detailed mechanisms 
remain on-going areas of investigation. Key membrane components called 
SNARE (soluble N-ethylmaleimide-sensitive-factor attachment protein 
receptor) proteins help draw appropriate membranes together to initiate the 
fusion process. These proteins, encoded by gene families in all eukaryotic 
cells, largely determine the compartment with which a vesicle will fuse. 
The specificity of membrane fusion ensures the orderly trafficking of 
membrane vesicles and their cargos through eukaryotic cells. 
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Figure 12.39 The transferrin receptor cycle. Iron-bound transferrin binds to the transferrin 
receptor (TfR) on the surface of cells. Receptor-mediated endocytosis occurs, leading to the 
formation of a vesicle called an endosome. As the lumen of the endosome is acidified by the 
action of proton pumps, iron is released from transferrin, passes through channels in the membrane, 
and is utilized by the cell. The complex between iron-free transferrin and the transferrin receptor is 
returned to the plasma membrane for another cycle. [After L. Zecca et al. Nat. Rev. Neurosci. 
5:863-873, 2004, Fig.1.] 


Summary 


Biological membranes are sheetlike structures, typically from 60 to 
100 A thick, that are composed of protein and lipid molecules held 
together by noncovalent interactions. Membranes are highly selective 
permeability barriers. They create closed compartments, which may 
be entire cells or organelles within a cell. Proteins in membranes regu- 
late the molecular and ionic compositions of these compartments. 
Membranes also control the flow of information between cells. 


Fatty Acids Are Key Constituents of Lipids 

Fatty acids are hydrocarbon chains of various lengths and degrees of 
unsaturation that terminate with a carboxylic acid group. The fatty 
acid chains in membranes usually contain between 14 and 24 carbon 
atoms; they may be saturated or unsaturated. Short chain length and 
unsaturation enhance the fluidity of fatty acids and their derivatives 
by lowering the melting temperature. 


12.2 There Are Three Common Types of Membrane Lipids 


12.3 


12.4 


The major types of membrane lipids are phospholipids, glycolipids, 
and cholesterol. Phosphoglycerides, a type of phospholipid, consist of 
a glycerol backbone, two fatty acid chains, and a phosphorylated alco- 
hol. Phosphatidylcholine, phosphatidylserine, and phosphatidyletha- 
nolamine are major phosphoglycerides. Sphingomyelin, a different 
type of phospholipid, contains a sphingosine backbone instead of 
glycerol. Glycolipids are sugar-containing lipids derived from sphin- 
gosine. Cholesterol, which modulates membrane fluidity, is con- 
structed from a steroid nucleus. A common feature of these membrane 
lipids is that they are amphipathic molecules, having one hydrophobic 
and one hydrophilic end. 


Phospholipids and Glycolipids Readily Form Bimolecular Sheets 

in Aqueous Media 

Membrane lipids spontaneously form extensive bimolecular sheets in 
aqueous solutions. The driving force for membrane formation is the 
hydrophobic interactions among the fatty acid tails of membrane lip- 
ids. The hydrophilic head groups interact with the aqueous medium. 
Lipid bilayers are cooperative structures, held together by many weak 
bonds. These lipid bilayers are highly impermeable to ions and most 
polar molecules, yet they are quite fluid, which enables them to act as 
a solvent for membrane proteins. 


Proteins Carry Out Most Membrane Processes 

Specific proteins mediate distinctive membrane functions such as 
transport, communication, and energy transduction. Many integral 
membrane proteins span the lipid bilayer, whereas others are only 
partly embedded in the membrane. Peripheral membrane proteins are 
bound to membrane surfaces by electrostatic and hydrogen-bond 
interactions. Membrane-spanning proteins have regular structures, 
including B strands, although the a helix is the most common 
membrane-spanning structure. Sequences of 20 consecutive nonpo- 
lar amino acids can be diagnostic of a membrane-spanning a-helical 
region of a protein. 


Lipids and Many Membrane Proteins Diffuse Rapidly in the 

Plane of the Membrane 

Membranes are structurally and functionally asymmetric, as exem- 
plified by the restriction of sugar residues to the external surface of 
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Problems 


mammalian plasma membranes. Membranes are dynamic structures 
in which proteins and lipids diffuse rapidly in the plane of the mem- 
brane (lateral diffusion), unless restricted by special interactions. In 
contrast, the rotation of lipids from one face of a membrane to the 
other (transverse diffusion, or flip-flop) is usually very slow. Proteins 
do not rotate across bilayers; hence, membrane asymmetry can be 
preserved. The degree of fluidity of a membrane depends on the chain 
length of its lipids and on the extent to which their constituent fatty 
acids are unsaturated. In animals, cholesterol content also regulates 
membrane fluidity. 


12.6 Eukaryotic Cells Contain Compartments Bounded by 

Internal Membranes 

An extensive array of internal membranes in eukaryotes creates com- 
partments within a cell for distinct biochemical functions. For instance, 
a double membrane surrounds the nucleus, the location of most of the 
cell’s genetic material, and the mitochondria, the location of most ATP 
synthesis. A single membrane defines the other internal compart- 
ments, such as the endoplasmic reticulum. Receptor-mediated endo- 
cytosis enables the formation of intracellular vesicles when ligands 
bind to their corresponding receptor proteins in the plasma mem- 
brane. The reverse process—the fusion of a vesicle to a membrane—is 
a key step in the release of signaling molecules outside the cell. 


amphipathic (amphiphilic) lipid raft (p. 363) 
molecule (p. 351) receptor-mediated endocytosis 
lipid bilayer (p. 352) (p. 365) 
clathrin (p. 365) 
transferrin (p. 366) 


liposome (p. 353) 

integral membrane protein (p. 355) 
peripheral membrane protein (p. 355) transferrin receptor (p. 366) 
hydropathy plot (p. 360) 
lateral diffusion (p. 361) 


fluid mosaic model (p. 362) 


endosome (p. 366) 

SNARE (soluble N-ethylmaleimide- 
sensitive-factor attachment protein 
receptor) proteins (p. 366) 


1. Population density. How many phospholipid mole- 
cules are there in a 1-m? region of a phospholipid bilayer 
membrane? Assume that a phospholipid molecule occupies 
70 A? of the surface area. 


2. Through the looking-glass. Phospholipids form lipid 
bilayers in water. What structure might form if phospholip- 
ids were placed in an organic solvent? 


3. Lipid diffusion. What is the average distance traversed by 
a membrane lipid in 1 ws, 1 ms, and 1 s? Assume a diffusion 
coefficient of 10° cm?s_?. 
4. Protein diffusion. The diffusion coefficient, D, of a rigid 
spherical molecule is given by 


D = kT/67nr 


in which 7 is the viscosity of the solvent, r is the radius of 
the sphere, k is the Boltzman constant (1.38 X 7" erg 
degree !), and T is the absolute temperature. What is the 
diffusion coefficient at 37°C of a 100-kd protein in a mem- 
brane that has an effective viscosity of 1 poise (1 poise = 
1 ergs !cm °)? What is the average distance traversed by 
this protein in 1 ps, 1 ms, and 1 s? Assume that this protein 


is an unhydrated, rigid sphere of density 1.35 gcm °. 


5. Cold sensitivity. Some antibiotics act as carriers that bind 
an ion on one side of a membrane, diffuse through the 
membrane, and release the ion on the other side. The con- 
ductance of a lipid-bilayer membrane containing a carrier 
antibiotic decreased abruptly when the temperature was 
lowered from 40°C to 36°C. In contrast, there was little 
change in conductance of the same bilayer membrane 
when it contained a channel-forming antibiotic. Why? 


6. Melting point 1. Explain why oleic acid (18 carbons, one 
cis bond) has a lower melting point than stearic acid, which 
has the same number of carbon atoms but is saturated. How 
would you expect the melting point of trans-oleic acid to 
compare with that of cis-oleic acid? Why might most unsat- 
urated fatty acids in phospholipids be in the cis rather than 
the trans conformation? 


7. Melting point 2. Explain why the melting point of 
palmitic acid (Cj) is 6.5 degrees lower than that of stearic 
acid (Cy). 


8. A sound diet. Small mammalian hibernators can with- 
stand body temperatures of 0° to 5°C without injury. 
However, the body fats of most mammals have melting 
temperatures of approximately 25°C. Predict how the com- 
position of the body fat of hibernators might differ from 
that of their nonhibernating cousins. 


9. Flip-flop 1. The transverse diffusion of phospholipids in 
a bilayer membrane was investigated by using a fluores- 
cently labeled analog of phosphatidylserine called NBD-PS. 


n—Q 
OnN / yy HsC(H2C) 14 er 


NBD-phosphatidylserine (NBD-PS) 


The fluorescence signal of NBD-PS is quenched when 
exposed to sodium dithionite, a reducing agent that is not 
membrane permeable. 

Lipid vesicles containing phosphatidylserine (98%) and 
NBD-PS (2%) were prepared by sonication and purified. 
Within a few minutes of the addition of sodium dithionite, 
the fluorescence signal of these vesicles decreased to ~45% 
of its initial value. Immediately adding a second addition of 
sodium dithionite yielded no change in the fluorescence 
signal. However, if the vesicles were allowed to incubate for 
6.5 hours, a third addition of sodium dithionite decreased 
the remaining fluorescence signal by 50%. How would you 
interpret the fluorescence changes at each addition of 
sodium dithionite? 


10. Flip-flop 2. Although proteins rarely if ever flip-flop 
across a membrane, the distribution of membrane lipids 
between the membrane leaflets is not absolute except for 
glycolipids. Why are glycosylated lipids less likely to 
flip-flop? 


11. Linkages. Platelet-activating factor (PAF) is a phos- 
pholipid that plays a role in allergic and inflammatory 
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Problems 


responses, as well as in toxic shock syndrome. The structure 
of PAF is shown here. How does it differ from the struc- 
tures of the phospholipids discussed in this chapter? 


CH3(CH2)]5 —O 


0-0 

H3C 7 
; Se W +N(CHs) 
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O 
Platelet-activating factor (PAF) 


12. A question of competition. Would a homopolymer of 
alanine be more likely to form an a helix in water or ina 
hydrophobic medium? Explain. 


13. A false positive. Hydropathy plot analysis of your pro- 
tein of interest reveals a single, prominent hydrophobic 
peak. However, you later discover that this protein is solu- 
ble and not membrane associated. Explain how the hydrop- 
athy plot may have been misleading. 


14. Maintaining fluidity. A culture of bacteria growing at 

37°C was shifted to 25°C. How would you expect this shift 
to alter the fatty acid composition of the mem- 
brane phospholipids? Explain. 


15. Let me count the ways. Each intracellular 
fusion of a vesicle with a membrane requires a 
SNARE protein on the vesicle (called the 
v-SNARE) and a SNARE protein on the target 
membrane (called the t-SNARE). Assume that a 
genome encodes 21 members of the v-SNARE 
family and 7 members of the t-SNARE family. 
With the assumption of no specificity, how many potential 
v-SNARE-t-SNARE interactions could take place? 


Data Interpretation Problems 


16. Cholesterol effects. The red curve on the following 
graph shows the fluidity of the fatty acids of a phospholipid 
bilayer as a function of temperature. The blue curve shows 
the fluidity in the presence of cholesterol. 


No cholesterol 


+ Cholesterol 


Fluidity —~ 
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Temperature — 


(a) What is the effect of cholesterol? 
(b) Why might this effect be biologically important? 
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17. Hydropathy plots. On the basis of the following hydrop- 
athy plots for three proteins (A—C), predict which would be 
membrane proteins. What are the ambiguities with respect 
to using such plots to determine if a protein is a membrane 
protein? 
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Chapter Integration Problem 


18. The proper environment. An understanding of the struc- 
ture and function of membrane proteins has lagged behind 
that of other proteins. The primary reason is that membrane 
proteins are more difficult to purify and crystallize. Why 
might this be the case? 


CHAPTER 13 


Membrane Channels and Pumps 


TT 


The flow of ions through a single membrane channel (channels are shown in red in the 
illustration at the left) can be detected by the patch-clamp technique, which records 
current changes as the channel transits between open and closed states. [(Left) After 

E. Neher and B. Sakmann. The patch clamp technique. Copyright © 1992 by Scientific 
American, Inc. All rights reserved. (Right) Courtesy of Dr. Mauricio Montal.] 


he lipid bilayer of biological membranes is intrinsically impermeable to OUTLINE 
ions and polar molecules, yet certain such species must be able to cross 
these membranes for normal cell function. Permeability is conferred by 
three classes of membrane proteins, pumps, carriers, and channels. Pumps 
use a source of free energy such as ATP hydrolysis or light absorption to 


13.1 The Transport of Molecules 
Across a Membrane May Be 
Active or Passive 


drive the thermodynamically uphill transport of ions or molecules. Pump 13.2 Two Families of Membrane 
action is an example of active transport. Carriers mediate the transport of Proteins Use ATP Hydrolysis to 
ions and small molecules across the membrane without consumption of Pump lons and Molecules 
ATP. Channels provide a membrane pore through which ions can flow very Across Membranes 
rapidly in a thermodynamically downhill direction. The action of channels nem | dines Barawane ie on 
illustrates passive transport, or facilitated diffusion. Archetype of Secondary 

Pumps are energy transducers in that they convert one form of free Transporters That Use One 
energy into another. Two types of ATP-driven pumps, P-type ATPases and Concentration Gradient to 
the ATP-binding cassette (ABC) transporters, undergo conformational Power the Formation of Another 


changes on ATP binding and hydrolysis that cause a bound ion to be trans- 
ported across the membrane. The free energy of ATP hydrolysis is used to 
drive the movement of ions against their concentration gradients, a process 


13.4 Specific Channels Can Rapidly 
Transport lons Across 


: . : a ‘ Membranes 
referred to as primary active transport. In contrast, carriers utilize the gradi- 
ent of one ion to drive the transport of another against its gradient. An 13.5 Gap Junctions Allow lons and 
example of this process, termed secondary active transport, is mediated by Small Molecules to Flow 
the E. coli lactose transporter, a well-studied protein responsible for the Between Communicating Cells 
uptake of a specific sugar from the environment of a bacterium. Many 13.6 Specific Channels Increase the 
transporters of this class are present in the membranes of our cells. The Permeability of Some 
expression of these transporters determines which metabolites a cell can Membranes to Water 


import from the environment. Hence, adjusting the level of transporter 
expression is a primary means of controlling metabolism. 
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Pumps can establish persistent gradients of particular ions across mem- 
branes. Specific ion channels can allow these ions to flow rapidly across 
membranes down these gradients. These channels are among the most fas- 
cinating molecules in biochemistry in their ability to allow some ions to flow 
freely through a membrane while blocking the flow of even closely related 
species. The opening, or gating, of these channels can be controlled by the 
presence of certain ligands or a particular membrane voltage. Gated ion 
channels are central to the functioning of our nervous systems, acting as 
elaborately switched wires that allow the rapid flow of current. 

Finally, a different class of channel, the cell-to-cell channel, or gap 
junction, allows the flow of metabolites or ions between cells. For example, gap 
junctions are responsible for synchronizing muscle-cell contraction in the 
beating heart. 


The expression of transporters largely defines the metabolic activities 
of a given cell type 


Each cell type expresses a specific set of transporters in its plasma mem- 
brane. This collection of expressed transporters is important because it 
largely determines the ionic composition inside cells and the compounds 
that can be taken up from the cell’s environment. In some senses, the cell- 
specific array of transporters defines the cell’s characteristics because a cell 
can execute only those biochemical reactions for which it has taken up the 
necessary substrates. 

An example from glucose metabolism illustrates this point. As we will 
see in the discussion of glucose metabolism in Chapter 16, tissues differ in 
their ability to employ different molecules as energy sources. Which tissues 
can utilize glucose is largely governed by the expression of different mem- 
bers of a family of homologous glucose transporters called GLUT1, 
GLUT2, GLUT3, GLUT4, and GLUT5. For example, GLUT3 is 
expressed only on neurons and a few other cell types. This transporter binds 
glucose relatively tightly so that these cells have first call on glucose when it 
is present at relatively low concentrations. These are just the first of many 
examples that we will encounter that demonstrate the critical role that 
transporter expression plays in the control and integration of metabolism. 


13.1 The Transport of Molecules Across a Membrane 
May Be Active or Passive 


We first consider some general principles of membrane transport. Two 
factors determine whether a molecule will cross a membrane: (1) the perme- 
ability of the molecule in a lipid bilayer and (2) the availability of an energy 
source. 


Many molecules require protein transporters to cross membranes 


As stated in Chapter 12, some molecules can pass through cell membranes 
because they dissolve in the lipid bilayer. Such molecules are called lipophilic 
molecules. The steroid hormones provide a physiological example. These 
cholesterol relatives can pass through a membrane, but what determines the 
direction in which they will move? Such molecules will pass through a 
membrane down their concentration gradient in a process called simple 
diffusion. In accord with the Second Law of Thermodynamics, molecules 
spontaneously move from a region of higher concentration to one of lower 
concentration. 


Matters become more complicated when the molecule is highly polar. 
For example, sodium ions are present at 143 mM outside a typical cell and 
at 14 mM inside the cell. However, sodium does not freely enter the cell, 
because the charged ion cannot pass through the hydrophobic membrane 
interior. In some circumstances, as during a nerve impulse, sodium ions 
must enter the cell. How are they able to do so? Sodium ions pass through 
specific channels in the hydrophobic barrier formed by membrane proteins. 
This means of crossing the membrane is called facilitated diffusion because 
the diffusion across the membrane is facilitated by the channel. It is also 
called passive transport because the energy driving the ion movement 
originates from the ion gradient itself, without any contribution by the 
transport system. Channels, like enzymes, display substrate specificity in 
that they facilitate the transport of some ions, but not other, even closely 
related, ions. 

How is the sodium gradient established in the first place? In this case, 
sodium must move, or be pumped, against a concentration gradient. Because 
moving the ion from a low concentration to a higher concentration results 
in a decrease in entropy, it requires an input of free energy. Protein trans- 
porters embedded in the membrane are capable of using an energy source to 
move the molecule up a concentration gradient. Because an input of energy 
from another source is required, this means of crossing the membrane is 
called active transport. 


Free energy stored in concentration gradients can be quantified 


An unequal distribution of molecules is an energy-rich condition because 
free energy is minimized when all concentrations are equal. Consequently, 
to attain such an unequal distribution of molecules requires an input of free 
energy. How can we quantify the amount of energy required to generate a 
concentration gradient (Figure 13.1)? Consider an uncharged solute mole- 
cule. The free-energy change in transporting this species from side 1, where 
it is present at a concentration of c,, to side 2, where it is present at concen- 
tration C9, is 


AG = RT In (c2/c1) 


where R is the gas constant (8.315 X 10°? kJ mol! deg‘, or 1.987 X 
10° kcal mol * deg™') and T is the temperature in kelvins. For a charged 
species, the unequal distribution across the membrane generates an electrical 
potential that also must be considered because the ions will be repelled by 
the like charges. The sum of the concentration and electrical terms is called 
the electrochemical potential or membrane potential. The free-energy change 
is then given by 


AG = RT In (c2/c,) + ZFAV 


in which Z is the electrical charge of the transported species, AV is 
the potential in volts across the membrane, and F is the Faraday constant 
(96.5 kJ V_! mol}, or 23.1 kcal V_! mol !). 

A transport process must be active when AG is positive, whereas it can 
be passive when AG is negative. For example, consider the transport of an 
uncharged molecule from cy = 10°* M toc) = 10° ' M. 


AG = RTIn (107 '/107%) 
= (8,315 * 10 *) X 298 * In (107) 
= + 11.4kJ mol7!(+2.7 kcal mol” !) 
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13.1 Active and Passive 
Transport Compared 
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Figure 13.1 Free energy and transport. 
The free-energy change in transporting (A) an 
uncharged solute from a compartment at 
concentration c; to one at cz and (B) a singly 
charged species across a membrane to the 
side having the same charge as that of the 
transported ion. Note that the free-energy 
change imposed by a membrane potential of 
59 mV is equivalent to that imposed by a 
concentration ratio of 10 for a singly charged 
ion at 25°C. 


374 


CHAPTER 13. Membrane Channels 


and Pumps 


Phosphorylaspartate 


At 25°C (298 K), AG is +11.4 kJ mol! (+2.7 kcal mol” !), indicating that 
this transport process requires an input of free energy. 


13.2. Two Families of Membrane Proteins Use ATP Hydrolysis 
to Pump lons and Molecules Across Membranes 


The extracellular fluid of animal cells has a salt concentration similar to that 
of seawater. However, cells must control their intracellular salt concentra- 
tions to facilitate specific processes, such as signal transduction and action 
potential propagation, and prevent unfavorable interactions with high con- 
centrations of ions such as Ca’*. For instance, most animal cells contain a 
high concentration of K* and a low concentration of Na” relative to the 
external medium. These ionic gradients are generated by a specific trans- 
port system, an enzyme that is called the Na*—K~ pump or the Na*—K~ 
ATPase. The hydrolysis of ATP by the pump provides the energy needed 
for the active transport of Na” out of the cell and K" into the cell, generat- 
ing the gradients. The pump is called the Na" —K™ ATPase because the 
hydrolysis of ATP takes place only when Na* and K”* are present. This 
ATPase, like all such enzymes, requires Mg** 

The change in free energy accompanying the transport of Na~ and K* 
can be calculated. Suppose that the concentrations of Na~ outside and 
inside the cell are 143 and 14 mM, respectively, and the corresponding 
values for K* are 4 and 157 mM. At a membrane potential of —50 mV and 
a temperature of 37°C, we can use the equation on page 373 to determine 
that the free- energy change in transporting 3 mol of Na” out of the cell 
and 2 mol of K* into the cell is 3(5.99) + 2(9.46) = +36.9 kJ mol ! 
(+8.8 kcal mol~!). Under typical cellular conditions, the hydrolysis of a 
single ATP molecule per transport cycle provides sufficient free energy, 
about —50 kJ mol! (-12 kcal mol” ') to drive the uphill transport of these 
ions. The active transport of Na* and K™ is of great physiological signifi- 
cance. Indeed, more than a third of the ATP consumed by a resting animal 
is used to pump these ions. The Na*—K™ gradient in animal cells controls 
cell volume, renders neurons and muscle cells electrically excitable, and 
drives the active transport of sugars and amino acids. 


na The purification of other ion pumps has revealed a large family of 

evolutionarily related ion pumps including proteins from bacteria, 
archaea, and all eukaryotes. Each of these pumps is specific for a particu- 
lar ion or set of ions. Two are of particular 1 interest: the sarcoplasmic reticu- 
lum Ca?* ATPase (or SERCA) transports Ca2* out of the cytoplasm and 
into the sarcoplasmic reticulum of muscle cells, and the gastric H*—K* 
ATPase is the enzyme responsible for pumping sufficient protons into 
the stomach to lower the pH to 1.0. These enzymes and the hundreds 
of known homologs, including the Na*-K* ATPase, are referred to as 
P-type ATPases because they form a key phosphorylated intermediate. In 
the formation of this intermediate, a phosphoryl group from ATP is 
linked to the side chain of a specific conserved aspartate residue in the 
ATPase to form phosphorylaspartate. 


P-type ATPases couple phosphorylation and conformational changes to 
pump calcium ions across membranes 


Membrane pumps function by mechanisms that are simple in principle but 
often complex in detail. Fundamentally, each pump protein can exist in two 
principal conformational states, one with ion-binding sites open to one side 
of the membrane and the other with ion-binding sites open to the other 


Energy input 
ee < @ 
Conformation 1 Conformation 2 


side (Figure 13.2). To pump ions in a single direction across a membrane, 
the free energy of ATP hydrolysis must be coupled to the interconversion 
between these conformational states. 

We will consider the structural and mechanistic features of P-type 
ATPases by examining SERCA. The properties of this P-type ATPase have 
been established in great detail by relying on crystal structures of the pump 
in five different states. This enzyme, which constitutes 80% of the protein in 
the sarcoplasmic reticulum membrane, plays an important role in relaxation 
of contracted muscle. Muscle contraction is triggered by an abrupt rise in 
the cytoplasmic calcium ion level. Subsequent muscle relaxation depends 
on the rapid removal of Ca’* from the cytoplasm into the sarcoplas- 
mic reticulum, a specialized compartment for Ca”* storage, by SERCA. 
This pump maintains a Ca*~ concentration of approximately 0.1 uM in the 
cytoplasm compared with 1.5 mM in the sarcoplasmic reticulum. 

The first structure of SERCA to be determined had Ca?* bound, but 
no nucleotides present (Figure 13.3). SERCA is a single 110-kd polypeptide 
with a transmembrane domain consisting of 10 a helices. The transmem- 
brane domain includes sites for binding two calcium ions. Each calcium ion 
is coordinated to seven oxygen atoms coming from a combination of side- 
chain glutamate, aspartate, threonine, and asparagine residues, backbone 
carbonyl groups, and water molecules. A large cytoplasmic headpiece con- 
stitutes nearly half the molecular weight of the protein and consists of three 
distinct domains, each with a distinct function. One domain (N) binds the 
ATP nucleotide, another (P) accepts the phosphoryl group on a conserved 
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Figure 13.2 Pump action. A simple scheme 
for the pumping of a molecule across a 
membrane. The pump interconverts to two 
conformational states, each with a binding site 
accessible to a different side of the 
membrane. 


Figure 13.3 Calcium-pump structure. 
The overall structure of the SERCA P-type 
ATPase. Notice the two calcium ions (green) 
that lie in the center of the transmembrane 
domain. A conserved aspartate residue (Asp 
351) that binds a phosphoryl group lies in the 
P domain. The designation bb refers to 
backbone carbonyl groups. [Drawn from 
1SU4.pdb.] 


Calcium-binding 
sites disrupted 


N and P domains have 
closed around the 

phosphorylaspartate 
analog 


hk Figure 13.4 Conformational changes 
associated with calcium pumping. This 
structure was determined in the absence of 
bound calcium and with a phosphorylaspartate 
analog present in the P domain. Notice how 
different this structure is from the calcium- 
bound form shown in Figure 13.3: both the 
transmembrane part (yellow) and the A, P, 
and N domains have substantially rearranged. 
[Drawn from 1WPG.pdb.] 


Figure 13.5 Pumping calcium. Ca?* ATPase 
transports Ca** through the membrane by a 
mechanism that includes (1) Ca** binding 
from the cytoplasm, (2) ATP binding, (3) ATP 
cleavage with the transfer of a phosphoryl 
group to Asp 351 on the enzyme, (4) ADP 
release and eversion of the enzyme to release 
Ca?* on the opposite side of the membrane, 
(5) hydrolysis of the phosphorylaspartate 
residue, and (6) eversion to prepare for the 
binding of Ca** from cytoplasm. 
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aspartate residue, and the third (A) serves as an actuator, linking changes in 
the N and P domains to the transmembrane part of the enzyme. 

SERCA is remarkably structurally dynamic. For example, the structure 
of SERCA without bound Ca?* and with a phosphorylaspartate analog 
present in the P domain is shown in Figure 13.4. The N and P domains are 
now closed around the phosphorylaspartate analog, and the A domain has 
rotated substantially relative to its position in SERCA with Ca?* bound 
and without the phosphoryl analog. Furthermore, the transmembrane 
part of the enzyme has rearranged substantially and the well-organized 
Ca’*-binding sites are disrupted. These sites are now accessible from the 
side of the membrane opposite the N, P, and A domains. 

The structural results can be combined with other studies to construct a 
detailed mechanism for Ca’* pumping by SERCA (Figure 13.5). 


1. The catalytic cycle begins with the enzyme in its unphosphorylated state 
with two calcium ions bound. Wex will refer to the overall enzyme conforma- 
tion in this state as E,; with Ca?* bound, it is E,-(Ca’*)». In this confor- 
mation, SERCA can exchange calcium ions but only with calcium ions from 
the cytoplasmic side of the membrane. This conformation is shown in 
Figure 13.3. 


2. In the E; conformation, the enzyme can bind ATP. The N, P, and A 
domains undergo substantial rearrangement as they close around the bound 
ATP, but there is no substantial conformational change in the transmem- 
brane domain. The calcium ions are now trapped inside the enzyme. 


3. The phosphoryl group is then transferred from ATP to Asp 351. 


4. Upon ADP release, the enzyme again changes its overall conformation, 
including the membrane domain this time. This new conformation is 
referred to as E» or E-P in its phosphorylated form. The process of intercon- 
verting the E; and E, conformations is sometimes referred to as eversion. 
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In the Ey-P conformation, the Ca”* -binding sites become disrupted and 
the calcium ions are released to the side of the membrane opposite that at 
which they entered; ion transport has been achieved. This conformation is 
shown in Figure 13.4. 


5. The phosphorylaspartate residue is hydrolyzed to release inorganic 
phosphate. 


6. With the release of phosphate, the interactions stabilizing the E> confor- 
mation are lost, and the enzyme everts to the E, conformation. 


The binding of two calcium ions from the cytoplasmic side of the mem- 
brane completes the cycle. 

This mechanism likely applies to other P-type ATPases. For example, 
Na*-K* ATPase is an a> tetramer. Its a subunit is homologous to 
SERCA and includes a key aspartate residue analogous to Asp 351. The B 
subunit does not directly take part in ion transport. A mechanism analogous 
to that shown in Figure 13.5 applies, with three Na” ions binding from the 
inside of the cell to the E; conformation and two K” ions binding from out- 
side the cell to the Ez conformation. 


Digitalis specifically inhibits the Na*-K* pump by blocking its 
dephosphorylation 


e Certain steroids derived from plants are potent inhibitors (K; ~ 10 nM) 
of the Na*—K* pump. Digitoxigenin and ouabain are members of 
this class of inhibitors, which are known as cardiotonic steroids because of 
their strong effects on the heart (Figure 13.6). These compounds inhibit the 
dephosphorylation of the E-P form of the ATPase when applied on the 
extracellular face of the membrane. 

Digitalis is a mixture of cardiotonic steroids derived from the dried leaf 
of the foxglove plant (Digitalis purpurea). The compound increases the force 
of contraction of heart muscle and is consequently a choice drug in the treat- 
ment of congestive heart failure. Inhibition of the Na’—K* pump by digi- 
talis leads to a higher level of Na~ inside the cell. The diminished Na~ 
gradient results in slower extrusion of Ca** by the sodium-calcium 
exchanger. The subsequent increase in the intracellular level of Ca?* 
enhances the ability of cardiac muscle to contract. It is interesting to note 
that digitalis was used effectively long before the discovery of the Na*—K~ 
ATPase. In 1785, William Withering, a British physician, heard tales of an 
elderly woman, known as “the old woman of Shropshire,” who cured people 
of “dropsy” (which today would be recognized as congestive heart failure) 
with an extract of foxglove. Withering conducted the first scientific study 
of the effects of foxglove on congestive heart failure and documented its 
effectiveness. 
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Foxglove (Digitalis purpurea) is the source of 
digitalis, one of the most widely used drugs. 
[Inga Spence/Visuals Unlimited.] 


Figure 13.6 Digitoxigenin. Cardiotonic 
steroids such as digitoxigenin inhibit the 
Na*—K* pump by blocking the 
dephosphorylation of E2-P. 
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Figure 13.7 Domain arrangement of ABC 
transporters. ABC transporters are a large 
family of homologous proteins composed of 
two transmembrane domains and two ATP- 
binding domains called ATP-binding cassettes 
(ABCs). (A) The multidrug-resistance protein 
is a single polypeptide chain containing all 
four domains, whereas (B) the Vibrio cholerae 
lipid transporter MsbA consists of a dimer of 
two identical chains, containing one of each 
domain. 


P-type ATPases are evolutionarily conserved and play a wide 
range of roles 


eg Analysis of the complete yeast genome revealed the presence of 
16 proteins that clearly belong to the P-type ATPase family. More- 
detailed sequence analysis suggests that 2 of these proteins transport H~ 
ions, 2 transport Ca**, 3 transport Na’, and 2 transport metals such as 
Cu2*. In addition, 5 members of this family appear to participate in the 
transport of phospholipids with amino acid head groups. These 5 proteins 
help maintain membrane asymmetry by transporting lipids such as phos- 
phatidylserine from the inner to the outer leaflet of the bilayer membrane. 
Such enzymes have been termed “flippases.’”’ Remarkably, the human 
genome encodes 70 P-type ATPases. All members of this protein family 
employ the same fundamental mechanism: the free energy of ATP hydroly- 
sis drives membrane transport by means of conformational changes, which 
are induced by the addition and removal of a phosphoryl group at an analo- 
gous aspartate site in each protein. 


Multidrug resistance highlights a family of membrane pumps 
with ATP-binding cassette domains 


e Studies of human disease revealed another large and important fam- 

ily of active-transport proteins, with structures and mechanisms 
quite different from those of the P-type ATPase family. These pumps were 
identified from studies on tumor cells in culture that developed resistance to 
drugs that had been initially quite toxic to the cells. Remarkably, the devel- 
opment of resistance to one drug had made the cells less sensitive to a range 
of other compounds. This phenomenon is known as multidrug resistance. In 
a significant discovery, the onset of multidrug resistance was found to cor- 
relate with the expression and activity of a membrane protein with an 
apparent molecular mass of 170 kd. This protein acts as an ATP-dependent 
pump that extrudes a wide range of small molecules from cells that express 
it. The protein is called the multidrug-resistance (MDR) protein or 
P-glycoprotein (““glyco” because it includes a carbohydrate moiety). Thus, 
when cells are exposed to a drug, the MDR pumps the drug out of the cell 
before the drug can exert its effects. 


YY Analysis of the amino acid sequences of MDR and homologous 
proteins revealed a common architecture (Figure 13.7A). Each protein 
comprises four domains: two membrane-spanning domains and two 
ATP-binding domains. The ATP-binding domains of these proteins are 
called ATP-binding cassettes (ABCs) and are homologous to domains in a 
large family of transport proteins of bacteria and archaea. Transporters that 
include these domains are called ABC transporters. With 79 members, the 
ABC transporters are the largest single family identified in the E. coli 
genome. The human genome includes more than 150 ABC transporter 
genes. 

The ABC proteins are members of the P-loop NTPase superfamily 
(Section 9.4). The three-dimensional structures of several members of the 
ABC transporter family have now been determined, including that of the 
lipid transporter MsbA from Vibrio cholerae. In contrast with the eukary- 
otic MDR protein, this protein is adimer of 62-kd chains: the amino-terminal 
half of each protein contains the membrane-spanning domain, and the 
carboxyl-terminal half contains the ATP-binding cassette (Figure 13.7B). 
Prokaryotic ABC proteins are often made up of multiple subunits, such as a 
dimer of identical chains, as above, or as a heterotetramer of two membrane- 
spanning domain subunits and two ATP-binding-cassette subunits. The 
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Membrane-spanning 
domain 


hk Figure 13.8 ABC transporter 
structure. Two structures of the lipid 
transporter MsbA from Vibrio cholerae, a 
representive ABC transporter. The open form 
is on the left and the closed, ATP-bound form 
is on the right. The two ATP-binding casettes 


oe (blue) are related to the P-loop NTPases and, 
like them, contain P-loops (green). The 
Open form Closed form a helix adjacent to the P-loop is shown in 


(ATP-bound) red. [Drawn from 3B5W and 3B60.pdb]] 


consolidation of the enzymatic activities of several polypeptide chains in 
prokaryotes to a single chain in eukaryotes is a theme that we will see again. 
The two ATP-binding casettes are in contact, but they do not interact 
strongly in the absence of bound ATP (Figure 13.8). On the basis of this 
structure and others, as well as on other experiments, a mechanism for 
active transport by these proteins has been developed (Figure 13.9). 


1. The catalytic cycle begins with the transporter free of both ATP and 
substrate. The transporter can interconvert between closed and open 
forms. 


2. Substrate enters the central cavity of the open form of the transporter 
from inside the cell. Substrate binding induces conformational changes in 
the ATP-binding cassettes that increase their affinity for ATP. 


3. ATP binds to the ATP-binding cassettes, changing their conformations 
so that the two domains interact strongly with one another. 


Cell 
exterior 


Cell 


interior (in) 
pcieaas 6) ® 2 ATP Figure 13.9 ABC transporter mechanism. 
(out) The mechanism includes the following steps: 
2,0 (1) opening of the channel toward the inside 


of the cell; (2) substrate binding and 
conformational changes in the ATP-binding 
cassettes; (3) ATP binding and further 
conformational changes; (4) separation of the 
membrane-binding domains and release of 
the substrate to the other side of the 
membrane; and (5) ATP hydrolysis to reset 
the transporter to its initial state. 
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Figure 13.10 Antiporters, 
symporters, and uniporters. 
Secondary transporters can 
transport two substrates in 
opposite directions (antiporters), 
two substrates in the same 
direction (symporters), or one 
substrate in either direction 
(uniporter). 


4. The strong interaction between the ATP-binding cassettes induces a 
change in the relation between the two membrane-spanning domains, 
releasing the substrate to the outside of the cell. 


5. The hydrolysis of ATP and the release of ADP and inorganic phosphate 
reset the transporter for another cycle. 


Whereas eukaryotic ABC transporters generally act to export molecules 
from inside the cell, prokaryotic ABC transporters often act to import spe- 
cific molecules from outside the cell. A specific binding protein acts in con- 
cert with the bacterial ABC transporter, delivering the substrate to the 
transporter and stimulating ATP hydrolysis inside the cell. These binding 
proteins are present in the periplasm, the compartment between the two 
membranes that surround some bacterial cells (see Figure 12.35A). 

Thus, ABC transporters use a substantially different mechanism from 
the P-type ATPases to couple the ATP hydrolysis reaction to conforma- 
tional changes. Nonetheless, the net result is the same: the transporters are 
converted from one conformation capable of binding substrate from 
one side of the membrane to another that releases the substrate on the 
other side. 


13.3. Lactose Permease Is an Archetype of Secondary 
Transporters That Use One Concentration Gradient 
to Power the Formation of Another 


Carriers are proteins that transport ions or molecules across the membrane 
without hydrolysis of ATP. The mechanism of carriers involves both large 
conformational changes and the interaction of the protein with only a few 
molecules per transport cycle, limiting the maximum rate at which trans- 
port can occur. Although carriers cannot mediate primary active transport, 
owing to their inability to hydrolyze ATP, they can couple the thermody- 
namically unfavorable flow of one species of ion or molecule up a concentra- 
tion gradient to the favorable flow of a different species down a concentration 
gradient, a process referred to as secondary active transport. Carriers that 
move ions or molecules “uphill” by this means are termed secondary trans- 
porters or cotransporters. These proteins can be classified as either antiporters 
or symporters. Antiporters couple the downhill flow of one species to the 
uphill flow of another in the opposite direction across the membrane; 
symporters use the flow of one species to drive the flow of a different species 
in the same direction across the membrane. Uniporters, another class of car- 
riers, are able to transport a specific species in either direction governed 
only by concentrations of that species on either side of the membrane 
(Figure 13.10). 


Antiporter Symporter Uniporter 


Secondary transporters are ancient molecular machines, common today 
in bacteria and archaea as well as in eukaryotes. For example, approximately 
160 (of approximately 4000) proteins encoded by the E. coli genome appear 
to be secondary transporters. Sequence comparison and hydropathy analy- 
sis suggest that members of the largest family have 12 transmembrane heli- 
ces that appear to have arisen by duplication and fusion of a membrane 
protein with 6 transmembrane helices. Included in this family is the lactose 
permease of E. coli. This symporter uses the H™ gradient across the E. coli 
membrane (outside has higher H* concentration) generated by the oxida- 
tion of fuel molecules to drive the uptake of lactose and other sugars against 
a concentration gradient. This transporter has been extensively studied for 
many decades and is a useful archetype for this family. 

The structure of lactose permease has been determined (Figure 13.11). 
As expected from the sequence analysis, this structure consists of two 
halves, each of which comprises six membrane-spanning a helices. Some of 
these helices are somewhat irregular. The two halves are well separated and 
are joined by a single stretch of polypeptide. In this structure, the sugar lies 
in a pocket in the center of the protein and is accessible from a path that 
leads from the interior of the cell. On the basis of these structures and a wide 
range of other experiments, a mechanism for symporter action has been 
developed. This mechanism (Figure 13.12) has many features similar to 
those for P-type ATPases and ABC transporters. 


1. The cycle begins with the two halves oriented so that the opening to the 
binding pocket faces outside the cell, in a conformation different from that 
observed in the structures solved to date. A proton from outside the cell 
binds to a residue in the permease, quite possibly Glu 269. 


2. In the protonated form, the permease binds lactose from outside the cell. 


3. The structure everts to the form observed in the crystal structure (see 
Figure 13.11). 


4. The permease releases lactose to the inside of the cell. 
5. The permease releases a proton to the inside of the cell. 
6. The permease everts to complete the cycle. 
The site of protonation likely changes in the course of this cycle. 
The same eversion mechanism very likely applies to all classes of 


secondary transporters, which appear to resemble the lactose permease in 
overall architecture. 


Lactose 
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h Figure 13.11 Structure of lactose 
permease with a bound lactose analog. 
The amino-terminal half of the protein is 
shown in blue and the carboxy-terminal half 
in red. (A) Side view. (B) Bottom view (from 
inside the cell). Notice that the structure 
consists of two halves that surround the 
sugar and are linked to one another by only 
a single stretch of polypeptide. [Drawn from 
1PV7.pdb_] 


Figure 13.12 Lactose permease 
mechanism. The mechanism begins with the 
permease open to the outside of the cell 
(upper left). The permease binds a proton 
from the outside of the cell (1) and then 
binds its substrate (2). The permease everts 
(3) and then releases its substrate (4) and a 
proton (5) to the inside of the cell. It then 
everts (6) to complete the cycle. 
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Figure 13.13 Action potential. Signals are 
sent along neurons by the transient 
depolarization and repolarization of the 
membrane. 


13.4 Specific Channels Can Rapidly Transport lons 
Across Membranes 


Pumps and carriers can move ions across the membrane at rates appro- 
aching several thousand ions per second. Other membrane proteins, the 
passive-transport systems called ion channels, are capable of ion-transport 
rates that are more than 1000 times as fast. These rates of transport through 
ion channels are close to rates expected for ions diffusing freely through 
aqueous solution. Yet ion channels are not simply tubes that span mem- 
branes through which ions can rapidly flow. Instead, they are highly 
sophisticated molecular machines that respond to chemical and physical 
changes in their environments and undergo precisely timed conformational 
changes. 


Action potentials are mediated by transient changes in Na* 
and K* permeability 


One of the most important manifestations of ion-channel action is the 
nerve impulse, which is the fundamental means of communication in the 
nervous system. A nerve impulse is an electrical signal produced by the flow 
of ions across the plasma membrane of a neuron. The interior of a neuron, 
like that of most other cells, contains a high concentration of K* and a low 
concentration of Na”. These ionic gradients are generated by the Na’—-K* 
ATPase. The cell membrane has an electrical potential determined by the 
ratio of the internal to the external concentration of ions. In the resting 
state, the membrane potential is typically —60 mV. A nerve impulse, or 
action potential, is generated when the membrane potential is depolarized 
beyond a critical threshold value (e.g., from —60 to —40 mV). The mem- 
brane potential becomes positive within about a millisecond and attains 
a value of about +30 mV before turning negative again (repolarization). 
This amplified depolarization is propagated along the nerve terminal 
(Figure 13.13). 

Ingenious experiments carried out by Alan Hodgkin and Andrew 
Huxley revealed that action potentials arise from large, transient changes 
in the permeability of the axon membrane to Na and K* ions. Depolar- 
ization of the membrane beyond the threshold level leads to an increase in 
permeability to Na”. Sodium ions begin to flow into the cell because of the 
large electrochemical gradient across the plasma membrane. The entry of 
Na’ further depolarizes the membrane, leading to a further increase in 
Na’ permeability. This positive feedback leads to a very rapid and large 
change in membrane potential, from about —60 mV to +30 mV in a mil- 
lisecond. 

The membrane spontaneously becomes less permeable to Na* and more 
permeable to K*. Consequently, K* flows outward, and so the membrane 
potential returns to a negative value. The resting level of —60 mV is restored 
in a few milliseconds as the K* conductance decreases to the value charac- 
teristic of the unstimulated state. The wave of depolarization followed by 
repolarization moves rapidly along a nerve cell. The propagation of these 
waves allows a touch at the tip of your toe to be detected in your brain ina 
few milliseconds. 

This model for the action potential postulated the existence of ion 
channels specific for Na” and K*. These channels must open in response 
to changes in membrane potential and then close after having remained 
open for a brief period of time. This bold hypothesis predicted the existence 
of molecules with a well-defined set of properties long before tools existed 
for their direct detection and characterization. 
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Patch-clamp conductance measurements reveal the activities 
of single channels 


Direct evidence for the existence of these channels was provided by the 
patch-clamp technique, which was introduced by Erwin Neher and Bert 
Sakmann in 1976. This powerful technique enables the measurement of the 
ion conductance through a small patch of cell membrane. In this technique, 
a clean glass pipette with a tip diameter of about 1 wm is pressed against an 
intact cell to form a seal (Figure 13.14). Slight suction leads to the formation 
of a very tight seal so that the resistance between the inside of the pipette 
and the bathing solution is many gigaohms (1 gigaohm is equal to 10° ohms). 
Thus, a gigaohm seal (called a gigaseal) ensures that an electric current 
flowing through the pipette is identical with the current flowing through 
the membrane covered by the pipette. The gigaseal makes possible high- 
resolution current measurements while a known voltage is applied across 
the membrane. Remarkably, the flow of ions through a single channel and 
transitions between the open and the closed states of a channel can be 
monitored with a time resolution of microseconds (Figure 13.15). 
Furthermore, the activity of a channel in its native membrane environment, 
even in an intact cell, can be directly observed. Patch-clamp methods pro- 
vided one of the first views of single biomolecules in action. Subsequently, 
other methods for observing single molecules were invented, opening new 
vistas on biochemistry at its most fundamental level. 


The structure of a potassium ion channel is an archetype for many 
ion-channel structures 


With the existence of ion channels firmly established by patch-clamp 
methods, scientists sought to identify the molecules that form ion channels. 
The Na* channel was first purified from the electric organ of electric eel, 
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Figure 13.14 Patch-clamp modes. The 
patch-clamp technique for monitoring channel 
activity is highly versatile. A high-resistance 
seal (gigaseal) is formed between the pipette 
and a small patch of plasma membrane. This 
configuration is called cell-attached mode. 
The breaking of the membrane patch by 
increased suction produces a low-resistance 
pathway between the pipette and the interior 
of the cell. The activity of the channels in the 
entire plasma membrane can be monitored in 
this whole-cell mode. To prepare a membrane 
in the excised-patch mode, the pipette is 
pulled away from the cell. A piece of plasma 
membrane with its cytoplasmic side now 
facing the medium is monitored by the patch 
pipette. 


Figure 13.15 Observing single channels. 
(A) The results of a patch-clamp experiment 
revealing a single ion channel undergoing 
transitions between closed and open states. 
(B) Closer inspection of the trace in (A) 
reveals the length of time the channel is in 
the open state. 
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Tetrodotoxin 


Figure 13.16 Sequence relations of ion 
channels. Like colors indicate structurally 
similar regions of the sodium, calcium, and 
potassium channels. Each of these channels 
exhibits approximate fourfold symmetry, either 
within one chain (sodium, calcium channels) 


or by forming tetramers (potassium channels). 


384 


which is a rich source of the protein forming this channel. The channel was 
purified on the basis of its ability to bind tetrodotoxin, a neurotoxin 
from the puffer fish that binds to Na* channels very tightly (K; ~ 1 nM). 
The lethal dose of this poison for an adult human being is about 10 ng. 

The isolated Na* channel isa single 260-kd chain. Cloning and sequenc- 
ing of cDNAs encoding Na™ channels revealed that the channel contains 
four internal repeats, each having a similar amino acid sequence, suggesting 
that gene duplication and divergence have produced the gene for this chan- 
nel. Hydrophobicity profiles indicate that each repeat contains five hydro- 
phobic segments (81, $2, 83, 55, and 56). Each repeat also contains a highly 
positively charged 54 segment; positively charged arginine or lysine resi- 
dues are present at nearly every third residue. It was proposed that segments 
51 through $6 are membrane-spanning a helices. The positively charged 
residues in 54 were proposed to act as the voltage sensors of the channel. 

The purification of K* channels proved to be much more difficult 
because of their low abundance and the lack of known high-affinity ligands 
comparable to tetrodotoxin. The breakthrough came in studies of mutant 
fruit flies that shake violently when anesthetized with ether. The mapping 
and cloning of the gene, termed shaker, responsible for this defect revealed 
the amino acid sequence encoded by a K*-channel gene. The shaker gene 
encodes a 70-kd protein that contains sequences corresponding to segments 
S1 through S6 in one of the repeated units of the Na* channel. Thus, a 
K*-channel subunit is homologous to one of the repeated units of Na* 
channels. Consistent with this homology, four Shaker polypeptides come 
together to form a functional channel. More recently, bacterial K* channels 
were discovered that contain only the two membrane-spanning regions cor- 
responding to segments $5 and S6. This and other information suggested 
that S5 and S6, including the region between them, form the actual pore 
in the K* channel. Segments $1 through S4 contain the apparatus that 
opens the pore. The sequence relations between these ion channels are sum- 
marized in Figure 13.16. 
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In 1998, Roderick MacKinnon and coworkers determined the structure 
of a K* channel from the bacterium Streptomyces lividans by x-ray crystal- 
lography. This channel contains only the pore-forming segments 55 and S6. 
As expected, the K* channel is a tetramer of identical subunits, each of 
which includes two membrane-spanning « helices (Figure 13.17). The four 
subunits come together to form a pore in the shape of a cone that runs 
through the center of the structure. 


The structure of the potassium ion channel reveals the basis 
of ion specificity 


The structure presented in Figure 13.17 probably represents the K* chan- 
nel in a closed form. Nonetheless, it suggests how the channel is able to 
exclude all but K* ions. Beginning from the inside of the cell, the pore starts 
with a diameter of approximately 10 A and then constricts to a smaller 


View down the pore Side view 
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A single subunit 


Figure 13.17 Structure of the potassium ion channel. The K* channel, composed of 
four identical subunits, is cone shaped, with the larger opening facing the inside of the cell (center). 
A view down the pore, looking toward the outside of the cell, shows the relations of the individual 
subunits (left). One of the four identical subunits of the pore is illustrated at the right, with the pore- 


forming region shown in gray. [Drawn from 1K4C.pdb.] 


cavity with a diameter of 8 A. Both the opening to the outside 
and the central cavity of the pore are filled with water, anda K* 
ion can fit in the pore without losing its shell of bound water 
molecules. Approximately two-thirds of the way through the 
membrane, the pore becomes more constricted (3-A diameter). 
At that point, any K* ions must give up their water molecules 
and interact directly with groups from the protein. The channel 
structure effectively reduces the thickness of the membrane 
from 34 A to 12 A by allowing the solvated ions to penetrate 
into the membrane before the ions must directly interact with 
the channel (Figure 13.18). 

For K* ions to relinquish their water molecules, other polar 
interactions must replace those with water. The restricted part 
of the pore is built from residues contributed by the two trans- 
membrane a helices. In particular, a five-amino-acid stretch 
within this region functions as the selectivity filter that deter- 
mines the preference for K* over other ions (Figure 13.19). The 
stretch has the sequence Thr- Val-Gly-Tyr-Gly (TVGYG), and 
is nearly completely conserved in all K* channels. The region of 
the strand containing the conserved sequence lies in an extended 
conformation and is oriented such that the peptide carbonyl 
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Figure 13.18 Path through a channel. A potassium ion 
entering the K* channel can pass a distance of 22 A into the 
membrane while remaining solvated with water (blue). At this 
point, the pore diameter narrows to 3 A (yellow), and 
potassium ions must shed their water and interact with 
carbonyl groups (red) of the pore amino acids. 


Figure 13.19 Selectivity filter of the 
potassium ion channel. Potassium ions 
interact with the carbonyl groups of the 
TVGYG sequence of the selectivity filter, 
located at the 3-A-diameter pore of the K* 
channel. Only two of the four channel 
subunits are shown. 
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CHAPTER 13. Membrane Channels 


and Pumps 


Table 13.1 Properties of alkali cations 


Tonic Hydration free 

radius energy in kJ mol ~ 

Ton (A) (kcal mol7!) 
Li* 0.60 —410 (—98) 
Na* 0.95 —301 (—72) 
Kt 1.33 —230(—55) 
Rb* 1.48 —213(-51) 
Cs* 1.69 —197 (—47) 
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Resolvation within 
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groups are directed into the channel, in good position to interact with the 
potassium ions. 

Potassium ion channels are 100-fold more permeable to K* than to 
Na’. How is this high degree of selectivity achieved? Ions having a radius 
larger than 1. 5. A cannot pass into the narrow diameter (3 A) of the selectivity 
filter of the K* channel. However, a bare Na” is small enough (Table 13.1) 
to pass through the pore. Indeed, the ionic radius of Na” is substantially 
smaller than that of K*. How then is Na” rejected? 

The key point is that the free-energy costs of dehydrating these ions 
are considerable [Na*, 301 kJ mol! (72 kcal mol”), and K*, 230 kJ mol! 
(55 kcal mol~')]. The channel pays the cost of dehydrating K* by providing 
compensating interactions with the carbonyl oxygen atoms lining the selectivity 
filter. However, these oxygen atoms are positioned such that they do not 
interact favorably with Na*, because the ion is too small (Figure 13.20). 
Sodium ions are rejected because the higher cost of dehydrating them would 
be unrecovered. The potassium ion channel avoids closely embracing sodi- 
um ions, which must stay hydrated and hence cannot pass through the 
channel. 
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Figure 13.20 Energetic basis of ion selectivity. The energy cost of dehydrating a potassium ion 
is compensated by favorable interactions with the selectivity filter. Because a sodium ion is too 
small to interact favorably with the selectivity filter, the free energy of desolvation cannot be 
compensated and the sodium ion does not pass through the channel. 


The K* channel structure enables a clearer understanding of the 

structure and function of Na* and Ca’* channels because of their 
homology to K* channels. Sequence comparisons and the results of muta- 
genesis experiments have implicated the region, between segments 55 and 
56 in ion selectivity 1 in the Ca’* channel. In Ca?* channels, one glutamate 
residue of this region in each of the four repeated units plays a major role in 
determining ion selectivity. Residues in the positions corresponding to the 
glutamate residues 1 in Ca?* channels are major components of the selectiv- 
ity filter of the Na* channel. These residues—aspartate, glutamate, lysine, 
and alanine—are located in each of the internal repeats of the Na* channel, 
forming a region termed the DEKA locus. Thus, the potential fourfold 
symmetry of the channel is clearly broken in this region, which explains 
why Na™ channels consist of a single large polypeptide chain rather than a 
noncovalent assembly of four identical subunits. The preference of the Na* 
channel for Na‘ over K* depends on ionic radius; the diameter of the pore 
determined by these residues and others is sufficiently restricted that small 
ions such as Na” and Li” can pass through the channel, but larger ions such 
as K” are significantly hindered. 


Cell 
exterior 


interior 


The structure of the potassium ion channel explains its rapid 
rate of transport 


The tight binding sites required for ion selectivity should slow the progress 
of ions through a channel, yet ion channels achieve rapid rates of ion trans- 
port. How is this paradox resolved? A structural analysis of the K* channel 
at high resolution provides an appealing explanation. Four K* -binding 
sites crucial for rapid ion flow are present in the constricted region of the 
K* channel. Consider the process of ion conductance starting from inside 
the cell (Figure 13.21). A hydrated potassium ion proceeds into the channel 
and through the relatively unrestricted part of the channel. The ion then 
gives up its coordinated water molecules and binds to a site within the 
selectivity-filter region. The ion can move between the four sites within the 
selectivity filter because they have similar ion affinities. As each subsequent 
potassium ion moves into the selectivity filter, its positive charge will repel 
the potassium ion at the nearest site, causing it to shift to a site farther up the 
channel and in turn push upward any potassium ion already bound to a site 
farther up. Thus, each ion that binds anew favors the release of an ion from 
the other side of the channel. This multiple-binding-site mechanism solves 
the paradox of high ion selectivity and rapid flow. 


Voltage gating requires substantial conformational changes in specific 
ion-channel domains 


Some Na* and K™ channels are gated by membrane potential; that is, they 
change conformation to a highly conducting form in response to changes in 
voltage across the membrane. As already noted, these voltage-gated channels 
include segments 51 through 54 in addition to the pore itself formed by 
S5 and S6. The structure of a voltage-gated K* channel from Aeropyrum 
pernix has been determined by x-ray crystallography (Figure 13.22). The 
segments 51 through $4 form domains, termed “paddles,” that extend from 
the core of the channel. These paddles include the segment S4, the voltage 
sensor itself. Segment 54 forms an a helix lined with positively charged 
residues. In contrast with expectations, segments 51 through 54 are not 
enclosed within the protein but, instead, are positioned to lie in the mem- 
brane itself. 


Figure 13.21 Model for K*-channel ion 


transport. The selectivity filter has four 


binding sites. Hydrated potassium ions can 
enter these sites, one at a time, losing their 


hydration shells. When two ions occupy 


adjacent sites, electrostatic repulsion forces 
them apart. Thus, as ions enter the channel 
from one side, other ions are pushed out the 


other side. 
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Figure 13.23 A model for voltage 
gating of ion channels. The voltage- 
sensing paddles lie in the “down” 
position below the closed channel (left). 
Membrane depolarization pulls these 
paddles through the membrane. The 
motion pulls the base of the channel 
apart, opening the channel (right). 
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hk Figure 13.22 Structure of a voltage-gated potassium channel. (A) A view looking down 
through the pore. (B) A side view. Notice that the positively charged S4 region (red) lies on the 
outside of the structure at the bottom of the pore. [Drawn from 1ORQ.pdb.] 


A model for voltage gating has been proposed by Roderick MacKinnon 
and coworkers on the basis of this structure and a range of other experi- 
ments (Figure 13.23). In the closed state, the paddles lie in a “down” 
position. On membrane depolarization, the cytoplasmic side of the mem- 
brane becomes more positively charged, and the paddles are pulled through 
the membrane into an “up” position. In this position, they pull the four 
sides of the base on the pore apart, increasing access to the selectivity filter 
and opening the channel. 
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A channel can be inactivated by occlusion of the pore: the 
ball-and-chain model 


The K* channel and the Na™ channel undergo inactivation within milli- 
seconds of opening (Figure 13.24). A first clue to the mechanism of inacti- 
vation came from exposing the cytoplasmic side of either channel to trypsin; 
cleavage by trypsin produced trimmed channels that stayed persistently 
open after depolarization. Furthermore, a mutant Shaker channel lacking 
42 amino acids near the amino terminus opened in response to depolar- 
ization but did not inactivate. Remarkably, inactivation was restored by 
adding a synthetic peptide corresponding to the first 20 residues of the 
native channel. 

These experiments strongly support the ball-and-chain model for chan- 
nel inactivation that had been proposed years earlier (Figure 13.25). 
According to this model, the first 20 residues of the K* channel form a 
cytoplasmic unit (the ball) that is attached to a flexible segment of the 
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Figure 13.24 Inactivation of the potassium 
ion channel. The amino-terminal region of the 
K* chain is critical for inactivation. (A) The wild- 
) type Shaker K* channel displays rapid 
inactivation after opening. (B) A mutant channel 


Mutant + peptide lacking residues 6 through 46 does not 
\ inactivate. (C) Inactivation can be restored by 
adding a peptide consisting of residues 1 


through 20 at a concentration of 100 pM. [After 
W. N. Zagotta, T. Hoshi, and R. W. Aldrich. 
Science 250(1990):568-571.] 
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polypeptide (the chain). When the channel is closed, the ball rotates freely 
in the aqueous solution. When the channel opens, the ball quickly finds a 
complementary site in the open pore and occludes it. Hence, the channel 
opens for only a brief interval before it undergoes inactivation by occlusion. 
Shortening the chain speeds inactivation because the ball finds its target 
more quickly. Conversely, lengthening the chain slows inactivation. Thus, 
the duration of the open state can be controlled by the length and flexibility 
of the tether. In some senses, the “ball” domains, which include substan- 
tial regions of positive charge, can be thought of as large, tethered cations 
that are pulled into the open channel but get stuck and block further ion 
conductance. 


Closed Open 


Inactivation 
domain 


The acetylcholine receptor is an archetype for ligand-gated 
ion channels 


Nerve impulses are communicated across synapses by small, diffusible 
molecules called neurotransmitters. One neurotransmitter is acetylcholine. 
The presynaptic membrane of a synapse is separated from the postsynaptic 
membrane by a gap of about 50 nm called the synaptic cleft. The arrival of a 
nerve impulse at the end of an axon leads to the synchronous export of the 
contents of some 300 vesicles of acetylcholine into the cleft (Figure 13.26). 
The binding of acetylcholine to the postsynaptic membrane markedly 
changes its ionic permeability, triggering an action potential. Acetylcholine 
opens a single kind of cation channel, called the acetylcholine receptor, which 
is almost equally permeable to Na” and to K*. 
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Figure 13.25 Ball-and-chain model for 
channel inactivation. The inactivation 
domain, or “ball” (gray), is tethered to the 
channel by a flexible “chain.” In the closed 
state, the ball is located in the cytoplasm. 
Depolarization opens the channel and creates 
a binding site for the positively charged ball in 
the mouth of the pore. Movement of the ball 
into this site inactivates the channel by 
occluding it. 
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Figure 13.26 Schematic representation 
of a synapse. 


The torpedo (Torpedo marmorata, also 
known as the electric ray) has an electric 
organ, rich in acetylcholine receptors, that can 
deliver a shock of as much as 200 V for 
approximately 1 s. [Yves Gladu/Jacana/Photo 
Researchers.] 
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The acetylcholine receptor is the best-understood ligand-gated channel. 
This type of channel is gated not by voltage but by the presence of specific 
ligands. The binding of acetylcholine to the channel is followed by its tran- 
sient opening. The electric organ of Torpedo marmorata, an electric ray, is a 
choice source of acetylcholine receptors for study because its electroplaxes 
(voltage-generating cells) are very rich in postsynaptic membranes that 
respond to this neurotransmitter. The receptor is very densely packed in 
these membranes (~20,000 ym). The acetylcholine receptor of the 
electric organ has been solubilized by adding a nonionic detergent to a post- 
synaptic membrane preparation and purified by affinity chromatography 
on a column bearing covalently attached cobratoxin, a small protein toxin 
from snakes that has a high affinity for acetylcholine receptors. With the 
use of techniques presented in Chapter 3, the 268-kd receptor was identi- 
fied as a pentamer of four kinds of membrane-spanning subunits—ay, B, y, 
and 6—arranged in the form of a ring that creates a pore through the 
membrane. 

The cloning and sequencing of the cDNAs for the four kinds of subunits 
(50-58 kd) showed that they have clearly similar sequences; the genes for 
the a, B, y, and 6 subunits arose by duplication and divergence of acommon 
ancestral gene. Each subunit has a large extracellular domain, followed at 
the carboxyl end by four predominantly hydrophobic segments that span 
the bilayer membrane. Acetylcholine binds at the a—y and a-—6 interfaces. 
Electron microscopic studies of purified acetylcholine receptors demon- 
strated that the structure has approximate fivefold symmetry, in harmony 
with the similarity of its five constituent subunits (Figure 13.27). 

What is the basis of channel opening? A comparison of the structures of 
the closed and open forms of the channel would be highly revealing, but 
such comparisons have been difficult to obtain. Cryoelectron micrographs 
indicate that the binding of acetylcholine to the extracellular domain causes 
a structural alteration that initiates rotations of the a-helical rods lining the 
membrane-spanning pore. The amino acid sequences of these helices point 
to the presence of alternating ridges of small polar or neutral residues 


(B) 


h Figure 13.27 Structure of the acetylcholine 
receptor. A model for the structure of the acetylcholine 
receptor deduced from high-resolution electron microscopic 
studies reveals that each subunit consists of a large 
extracellular domain consisting primarily of 8 strands, four 
membrane-spanning « helices, and a final « helix inside the 
cell. (A) A side view shows the pentameric receptor with each 
subunit type in a different color. One copy of the a subunit is 
shown in isolation. (B) A view down the channel from outside 
the cell. [Drawn from 2BG9.pdb.] 


(serine, threonine, glycine) and large nonpolar ones (isoleucine, leucine, 
phenylalanine). In the closed state, the large residues may occlude the 
channel by forming a tight hydrophobic ring (Figure 13.28). Indeed, each 
subunit has a bulky leucine residue at a critical position. The binding of 
acetylcholine could allosterically rotate the membrane-spanning helices so 
that the pore would be lined by small polar residues rather than by large 
hydrophobic ones. The wider, more polar pore would then be open to the 
passage of Na* and K™ ions. 


Action potentials integrate the activities of several ion channels 
working in concert 


To see how ligand-gated and ion-gated channels work together to generate 
a sophisticated physiological response, we now revisit the action potential 
introduced at the beginning of this section. First, we need to introduce the 
concept of equilibrium potential. Suppose that a membrane separates two 
solutions that contain different concentrations of some cation X* 
(Figure 13.29). Let [X*];, be the concentration of X* on one side of the 
membrane (corresponding to the inside of a cell) and [X* ]out be the concen- 
tration of X* on the other side (corresponding to the outside of a cell). 
Suppose that an ion channel opens that allows X* to move across the 
membrane. What will happen? It seems clear that X* will move through 
the channel from the side with the higher concentration to the side with the 
lower concentration. However, positive charges will start to accumulate on 
the side with the lower concentration, making it more difficult to move each 
additional positively charged ion. An equilibrium will be achieved when the 
driving force due to the concentration gradient is balanced by the electro- 
static force resisting the motion of an additional charge. In these circum- 
stances, the membrane potential is given by the Nernst equation: 


Veq = —(RT/2F) In ([X]Jin/[X out) 


where R is the gas constant and F is the Faraday constant (96.5 kJ V_‘ mol}, 
or 23.1 kcal VV‘ mol” !) and z is the charge on the ion X (e.g., +1 for X*). 
The membrane potential at equilibrium is called the equilibrium poten- 
tial for a given ion at a given concentration ratio across a membrane. For 
sodium with [Na™ ]i, = 14 mM and [Na™ ]ou: = 143 mM, the equilibrium 
potential is +62 mV at 37°C. Similarly, for potassium with [K*];, = 157 mM 
and [K* ]out = 4 mM, the equilibrium potential is —98 mV. In the absence 
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Figure 13.28 Opening the acetylcholine 
receptor. Cross sections from electron 
microscopic reconstructions of the 
acetylcholine receptor in (top) its closed form 
and (bottom) its open form. (The open form 
corresponds to the structure shown in 

Figure 13.27). The areas labeled M1, M2, M3, 
and M4 correspond to the four membrane- 
spanning « helices of one subunit. The cross 
section of the open channel was generated 
by treating the receptor with acetylcholine 

and freezing the sample within 20 ms. Notice 
that the hole in the center of the channel is 
substantially larger in the open structure. 

The enlargement of the hole is due to the 
rotation of the M2 helices by approximately 
15 degrees along their long axes. [Courtesy of 
Nigel Unwin.] 


Figure 13.29 Equilibrium potential. The 
membrane potential reaches an equilibrium 
when the driving force due to the 
concentration gradient is exactly balanced by 
the opposing force due to the repulsion of 
like charges. 


(A) 
+60 F- 
S saolk \ nat equilibrium 
E potential 
s 
c 
oO 
xe) 
a 
oO 
Cc 
£ 
a 
— 
oO 
= 
-100 © 
se K* equilibrium potential 

(B) 

= 

ac 

< 

v 

5 

e) 
(C) 

Closed Open 
Sodium wa Sa Inactivated 
channel 
Closed Open 


[1 Inactivated 
| | 


Potassium 

channel 
| 
1 


2 3 
Time (ms) 


Figure 13.30 Action-potential mechanism. 
(A) On the initation of an action potential, the 
membrane potential moves from the resting 


potential upward 


toward the Na* equilibrium 


potential and then downward toward the K* 
equilibrium potential. (B) The currents through 


the Na* and K* 


channels underlying the action 


potential. (C) The states of the Na* and K* 


channels during 
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he action potential. 


of stimulation, the resting potential for a typical neuron is —60 mV. This 
value is close to the equilibrium potential for K* owing to the fact that a 
small number of K* channels are open. 

We are now prepared to consider what happens in the generation of 
an action potential (Figure 13.30). Initially, a neurotransmitter such as ace- 
tylcholine is released from an adjacent cell. The released acetylcholine binds 
to the acetylcholine receptor, causing it to open within less than a millisec- 
ond. The acetylcholine receptor is a nonspecific cation channel. Sodium 
ions flow into the cell and potassium ions flow out of the cell. Without any 
further events, the membrane potential would move to a value correspond- 
ing to the average of the equilibrium potentials for Na* and K™, approxi- 
mately —20 mV. However, as the membrane potential approaches —40 mV, 
the voltage-sensing paddles of Na* channels are pulled into the membrane, 
opening the Na* channels. With these channels open, sodium ions flow 
rapidly into the cell and the membrane potential rises rapidly toward the 
Na® equilibrium potential. The voltage-sensing paddles of K* channels 
also are pulled into the membrane by the changed membrane potential, but 
more slowly than Na* channel paddles. Nonetheless, after approximately 
1 ms, many K™ channels start to open. At the same time, inactivation “ball” 
domains plug the open Na” channels, decreasing the Na~ current. The 
acetylcholine receptors that initiated these events are also inactivated on 
this time scale. With the Na~ channels inactivated and only the K* channels 
open, the membrane potential drops rapidly toward the K* equilibrium 
potential. The open K* channels are susceptible to inactivation by their 
“ball” domains, and these K* currents, too, are blocked. With the mem- 
brane potential returned to close to its initial value, the inactivation domains 
are released and the channels return to their original closed states. These 
events propagate along the neuron as the depolarization of the membrane 
opens channels in nearby patches of membrane. 

How much current actually flows across the membrane over the course 
of an action potential? This question can be addressed from two comple- 
mentary directions. First, a typical nerve cell contains 100 Na* channels per 
square micrometer. At a membrane potential of +20 mV, each channel 
conducts 10’ ions per second. Thus, in a period of 1 millisecond, approxi- 
mately 10° ions flow through each square micrometer of membrane surface. 
Assuming a cell volume of 10* jm? and a surface area of 10* rm”, this rate 
of ion flow corresponds to an increase in the Na” concentration of less 
than 1%. How can this be? A robust action potential is generated because 
the membrane potential is very sensitive to even a slight change in the 
distribution of charge. This sensitivity makes the action potential a very 
efficient means of signaling over long distances and with rapid repetition 
rates. 


Disruption of ion channels by mutations or chemicals can be 
potentially life threatening 


e The generation of an action potential requires the precise coordina- 
tion of gating events of a collection of ion channels. Perturbation of 
this timing can have devastating effects. For example, the rhythmic genera- 
tion of action potentials by the heart is absolutely essential to maintain 
delivery of oxygenated blood to the peripheral tissues. Long QT syndrome 
(LQTS) is a genetic disorder in which the recovery of the action potential 
from its peak potential to the resting equilibrium potential is delayed. The 
term “QT” refers to a specific feature of the cardiac electrical activity 
pattern as measured by electrocardiography. LQTS can lead to brief 
losses of consciousness (syncope), disruption of normal cardiac rhythm 


(arrhythmia), and sudden death. The most common mutations identified 
in LOTS patients inactivate K* channels or prevent the proper trafficking 
of these channels to the plasma membrane. The resulting loss in potassium 
permeability slows the repolarization of the membrane and delays the 
induction of the subsequent cardiac contraction, rendering the cardiac 
tissue susceptible to arrhythmias. 

Prolongation of the cardiac action potential in this manner can also be 
induced by a number of therapeutic drugs. In particular, the K* channel 
hERG (for human ether-a-go-go-related gene, named for its ortholog in 
Drosophila melanogaster) is highly susceptible to interactions with certain 
drugs. The hydrophobic regions of these drugs can block hERG by binding 
to two nonconserved aromatic residues on the internal surface of the chan- 
nel cavity. In addition, this cavity is predicted to be wider than other K* 
channels because of the absence of a conserved Pro-X-Pro motif within the 
56 hydrophobic segment. Inhibition of hERG by these drugs can lead to an 
increased risk of cardiac arrhythmias and sudden death. Accordingly, a 
number of these agents, such as the antihistamine terfenadine, have been 
withdrawn from the market. Screening for the inhibition of hERG is now a 
critical safety hurdle for the pharmaceutical advancement of a molecule to 
an approved drug. 


13.5 Gap Junctions Allow lons and Small Molecules to 
Flow Between Communicating Cells 


The ion channels that we have considered thus far have narrow pores and 
are moderately to highly selective in the ions that they allow to pass through 
them. They are closed in the resting state and have short lifetimes in the 
open state, typically a millisecond, that enable them to transmit highly 
frequent neural signals. We turn now to a channel with a very different role. 
Gap junctions, also known as cell-to-cell channels, serve as passageways 
between the interiors of contiguous cells. Gap junctions are clustered in 
discrete regions of the plasma membranes of apposed cells. Electron micro- 
graphs of sheets of gap junctions show them tightly packed in a regular 
hexagonal array (Figure 13.31). A 20-A central hole, the lumen of the chan- 
nel, is prominent in each gap junction. These channels span the intervening 
space, or gap, between apposed cells (hence, the name “gap junction”). The 
width of the gap between the cytoplasms of the two cells is about 35 A. 

Small hydrophilic molecules as well as ions can pass through gap 
junctions. The pore size of the junctions was determined by microinjecting 
a series of fluorescent molecules into cells and observing their passage into 
adjoining cells. All polar molecules with a mass of less than about 1 kd can 
readily pass through these cell-to-cell channels. Thus, inorganic ions and 
most metabolites (e.g., sugars, amino acids, and nucleotides) can flow between 
the interiors of cells joined by gap junctions. In contrast, proteins, nucleic 
acids, and polysaccharides are too large to traverse these channels. Gap 
junctions are important for intercellular communication. Cells in some excit- 
able tissues, such as heart muscle, are coupled by the rapid flow of ions 
through these junctions, which ensures a rapid and synchronous response to 
stimuli. Gap junctions are also essential for the nourishment of cells that are 
distant from blood vessels, as in lens and bone. Moreover, communicating 
channels are important in development and differentiation. For example, 
the quiescent uterus transforms to a forcefully contracting organ at the onset 
of labor; the formation of functional gap junctions at that time creates a 
syncytium of muscle cells that contract in synchrony. 
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Figure 13.31 Gap junctions. This electron 
micrograph shows a sheet of isolated gap 
junctions. The cylindrical connexons form a 
hexagonal lattice having a unit-cell length of 
85 A. The densely stained central hole has 
a diameter of about 20 A. [Don W. Fawcett/ 
Photo Researchers.] 
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Figure 13.32 Schematic representation 
of a gap junction. [Courtesy of Dr. Werner 
Loewenstein.] 


A cell-to-cell channel is made of 12 molecules of connexin, one of a 
family of transmembrane proteins with molecular masses ranging from 
30 to 42 kd. Each connexin molecule appears to have four membrane- 
spanning helices. Six connexin molecules are hexagonally arrayed to form a 
half-channel, called a connexon or hemichannel (Figure 13.32). Two connex- 
ons join end to end in the intercellular space to form a functional channel 
between the communicating cells. Cell-to-cell channels differ from other 
membrane channels in three respects: (1) they traverse two membranes 
rather than one; (2) they connect cytoplasm to cytoplasm, rather than to the 
extracellular space or the lumen of an organelle; and (3) the connexons 
forming a channel are synthesized by different cells. Gap junctions form 
readily when cells are brought together. A cell-to-cell channel, once formed, 
tends to stay open for seconds to minutes. They are closed by high concen- 
trations of calcium ion and by low pH. The closing of gap junctions by Ca?* 
and H™ serves to seal normal cells from injured or dying neighbors. Gap 
junctions are also controlled by membrane potential and by hormone- 
induced phosphorylation. 

The human genome encodes 21 distinct connexins. Different members 
of this family are expressed in different tissues. For example, connexin 26 is 
expressed in key tissues in the ear. Mutations in this connexin are associated 
with hereditary deafness. The mechanistic basis for this deafness appears to 
be insufficient transport of ions or second-messenger molecules, such as 
inositol trisphosphate, between sensory cells. 


13.6 Specific Channels Increase the Permeability of Some 
Membranes to Water 


One more important class of channels does not take part in ion transport at 
all. Instead, these channels increase the rate at which water flows through 
membranes. As noted in Chapter 12, membranes are reasonably permeable 
to water. Why, then, are water-specific channels required? In certain tis- 
sues, in some circumstances, rapid water transport through membranes is 
necessary. In the kidney, for example, water must be rapidly reabsorbed into 
the bloodstream after filtration. Similarly, in the secretion of saliva and 
tears, water must flow quickly through membranes. These observations 
suggested the existence of specific water channels, but initially the channels 
could not be identified. 

The channels (now called aquaporins) were discovered serendipitously. 
Peter Agre noticed a protein present at high levels in red-blood-cell mem- 
branes that had been missed because the protein does not stain well with 
Coomassie brilliant blue. This protein was found in large quantities in red 
blood cells as well as in tissues such as kidneys and corneas, precisely the 
tissues thought to contain water channels. On the basis of this observation, 
further studies were designed that revealed that this 24-kd membrane pro- 
tein is, indeed, a water channel. 

The structure of aquaporin has been determined (Figure 13.33). The 
protein consists of six membrane-spanning « helices. Two loops containing 
hydrophilic residues line the actual channel. Water molecules pass through 
in single file at a rate of 10° molecules per second. Importantly, specific 
positively charged residues toward the center of the channel prevent the 
transport of protons through aquaporin. Thus, aquaporin channels will not 
disrupt proton gradients, which play fundamental roles in energy transduc- 
tion, as we will see in Chapter 18. The aquaporins reveal that channels can 
evolve that specifically do not conduct ions, as can those that do. 


Hydrophilic 
residues 
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hk Figure 13.33 Structure of aquaporin. 
The structure of aquaporin viewed from the 
side (left) and from the top (right). Notice the 
hydrophilic residues (shown as space-filling 
models) that line the water channel. [Drawn 
from 1J4N.pdb.] 


Summary 


13.1 


13.2 


The Transport of Molecules Across a Membrane May Be 

Active or Passive 

For a net movement of molecules across a membrane, two features are 
required: (1) the molecule must be able to cross a hydrophobic barrier 
and (2) an energy source must power the movement. Lipophilic mol- 
ecules can pass through a membrane’s hydrophobic interior by simple 
diffusion. These molecules will move down their concentration gradi- 
ents. Polar or charged molecules require proteins to form passages 
through the hydrophobic barrier. Passive transport or facilitated diffu- 
sion takes place when an ion or polar molecule moves down its concen- 
tration gradient. If a molecule moves against a concentration gradient, 
an external energy source is required; this movement is referred to as 
active transport and results in the generation of concentration gradi- 
ents. The electrochemical potential measures the combined ability of a 
concentration gradient and an uneven distribution of charge to drive 
species across a membrane. 


Two Families of Membrane Proteins Use ATP Hydrolysis to Pump lons 
Across Membranes 

Active transport is often carried out at the expense of ATP hydrolysis. 
P-type ATPases pump ions against a concentration gradient and 
become transiently phosphorylated on an aspartic acid residue in the 
process of transport. P-type ATPases, which include the sarcoplasmic 
reticulum Ca’* ATPase and the Na*’—K* ATPase, are integral mem- 
brane proteins with conserved structures and catalytic mechanisms. 
Membrane proteins containing ATP-binding cassette domains are 
another family of ATP-dependent pumps. Each pump includes four 
major domains: two domains span the membrane and two others con- 
tain ABC P-loop ATPase structures. These pumps are not phospho- 
rylated during pumping; rather, they use the energy of ATP binding 
and hydrolysis to drive conformational changes that result in the 
transport of specific substrates across membranes. The multidrug- 
resistance proteins confer resistance on cancer cells by pumping che- 
motherapeutic drugs out of a cancer cell before the drugs can exert 
their effects. 
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13.3 


13.4 


13.5 


13.6 


Secondary Transporters Use One Concentration Gradient to Power 
the Formation of Another 

Carriers are proteins that transport ions or molecules across the mem- 
brane without hydrolysis of ATP. They can be classified as uniporters, 
antiporters, and symporters. Uniporters transport a substrate in either 
direction, determined by the concentration differences. Antiporters 
and symporters can mediate secondary active transport by coupling 
the uphill flow of one ion or molecule to the downhill flow of another. 
Antiporters couple the downhill flow of one type of ion in one direc- 
tion to the uphill flow of another in the opposite direction. Symporters 
move both ions in the same direction. Studies of the lactose permease 
from E. coli have been a source of insight into both the structures and 
the mechanisms of secondary transporters. 


Specific Channels Can Rapidly Transport lons Across Membranes 

Ion channels allow the rapid movement of ions across the hydrophobic 
barrier of the membrane. The activity of individual ion-channel mol- 
ecules can be observed by using patch-clamp techniques. Many ion 
channels have a common structural framework. In regard to K* chan- 
nels, hydrated potassium ions must transiently lose their coordinated 
water molecules as they move to the narrowest part of the channel, 
termed the selectivity filter. In the selectivity filter, peptide carbonyl 
groups coordinate the ions. Rapid ion flow through the selectivity fil- 
ter is facilitated by ion—ion repulsion, with one ion pushing the next 
ion through the channel. Some ion channels are voltage gated: changes 
in membrane potential induce conformational changes that open these 
channels. Many channels spontaneously inactivate after having been 
open for a short period of time. In some cases, inactivation is due to the 
binding of a domain of the channel termed the “ball” in the mouth of 
the channel to block it. Other channels, typified by the acetylcholine 
receptor, are opened or closed by the binding of ligands. Ligand-gated 
and voltage-gated channels work in concert to generate action poten- 
tials. Inherited mutations or drugs that interfere with the ion channels 
that produce the action potential can result in potentially life threaten- 
ing conditions. 


Gap Junctions Allow lons and Small Molecules to Flow Between 
Communicating Cells 

In contrast with many channels, which connect the cell interior with 
the environment, gap junctions, or cell-to-cell channels, serve to con- 
nect the interiors of contiguous groups of cells. A cell-to-cell channel 
is composed of 12 molecules of connexin, which associate to form two 
6-membered connexons. 


Specific Channels Increase the Permeability of Some 

Membranes to Water 

Some tissues contain proteins that increase the permeability of mem- 
branes to water. Each water-channel-forming protein, termed an 
aquaporin, consists of six membrane-spanning a helices and a central 
channel lined with hydrophilic residues that allow water molecules to 
pass in single file. Aquaporins do not transport protons. 
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Problems 


1. A helping hand. Differentiate between simple diffusion 
and facilitated diffusion. 


2. Powering movement. What are the two forms of energy 
that can power active transport? 


3. Carriers. Name the three types of carrier proteins. 
Which of these can mediate secondary active transport? 


4. The price of extrusion. What is the free-energy cost of 
pumping Ca?* out of a cell when the cytoplasmic con- 
centration is 0.4 wM, the extracellular concentration is 
1.5 mM, and the membrane potential is —60 mV? 


5. Equilibium potentials. For a typical mammalian cell, the 
intracellular and extracellular concentrations of the chloride 
ion (Cl) are 4 mM and 150 mM, respectively. For the cal- 
cium ion (Ca?"), the intracellular and extracellular concen- 
trations are 0.2 wM and 1.8 mM, respectively. Calculate the 
equilibrium potentials at 37°C for these two ions. 


6. How sweet it is. Some animal cells take up glucose by a 
symporter powered by the simultaneous entry of Na*. The 
entry of Na’ provides a free-energy input of 10.8 kJ mol! 
(2.6 kcal mol‘) under typical cellular conditions (external 
[Na*] = 143 mM, internal [Na’] = 14 mM, and mem- 
brane potential = —50 mV). How large a concentration of 
glucose can be generated by this free-energy input? 


7. Variations on a theme. Write a detailed mechanism for 
transport by the Na*-K* ATPase based on analogy with 
the mechanism of the Ca” ATPase shown in Figure 13.5. 


8. Pumping protons. Design an experiment to show that the 
action of lactose permease can be reversed in vitro to pump 
protons. 


9. Opening channels. Differentiate between ligand-gated 
and voltage-gated channels. 


10. Different directions. The K* channel and the Na~ chan- 
nel have similar structures and are arranged in the same 
orientation in the cell membrane. Yet the Na’ channel 
allows sodium ions to flow into the cell and the K* channel 
allows potassium ions to flow out of the cell. Explain. 


11. Differing mechanisms. Distinguish the mechanisms by 
which uniporters and channels transport ions or molecules 
across the membrane. 


12. Short circuit. Carbonyl cyanide 4-(trifluoromethoxy) 
phenylhydrazone (FCCP) is a proton ionophore: it enables 
protons to pass freely through membranes. Treatment of 
E. coli with FCCP prevents the accumulation of lactose in 
these cells. Explain. 


13. Working together. The human genome contains more 
than 20 connexin-encoding genes. Several of these genes are 
expressed in high levels in the heart. Why are connexins so 
highly expressed in cardiac tissue? 


14. Structure—activity relations. On the basis of the struc- 
ture of tetrodotoxin, propose a mechanism by which the 
toxin inhibits Na* flow through the Na* channel. 


15. Hot stuff. When SERCA is incubated with [-y-°"7P]ATP 
(a form of ATP in which the terminal phosphate is labeled 
with radioactive *“P) and calcium at 0°C for 20 seconds 
and analyzed by gel electrophoresis, a radioactive band is 
observed at the molecular weight corresponding to full- 
length SERCA. Why is a labeled band observed? Would 
you expect a similar band if you were performing a similar 
assay, with a suitable substrate, for the MDR protein? 
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16. A dangerous snail. Cone snails are carnivores that inject 
a powerful set of toxins into their prey, leading to rapid 
paralysis. Many of these toxins are found to bind to specific 
ion-channel proteins. Why are such molecules so toxic? 
How might such toxins be useful for biochemical studies? 


17. Pause for effect. Immediately after the repolarization 
phase of an action potential, the neuronal membrane is tem- 
porarily unable to respond to the stimulation of a second 
action potential, a phenomenon referred to as the refractory 
period. What is the mechanistic basis for the refractory 
period? 


18. Only a few. Why do only a small number of sodium 
ions need to flow through the Na™ channel to change the 
membrane potential significantly? 


19. Move than one mechanism. How might a mutation in a 
cardiac voltage-dependent sodium channel cause long QT 
syndrome? 


20. Mechanosensitive channels. Many species contain ion 
channels that respond to mechanical stimuli. On the basis 
of the properties of other ion channels, would you expect 
the flow of ions through a single open mechanosensitive 
channel to increase in response to an appropriate stimulus? 
Why or why not? 


21. Concerted opening. Suppose that a channel obeys the 
concerted allosteric model (MWC model, Section 7.2). The 
binding of ligand to the R state (the open form) is 20 times 
as tight as that to the T state (the closed form). In the 
absence of ligand, the ratio of closed to open channels is 10°, 
If the channel is a tetramer, what is the fraction of open 
channels when 1, 2, 3, and 4 ligands are bound? 


22. Respiratory paralysis. The neurotransmitter acetylcho- 
line is degraded by a specific enzyme that is inactivated by 
Tabun, sarin, and parathion. On the basis of the structures 
below, propose a possible basis for their lethal actions. 


—— O 
\f 
~~o~ en 
Tabun 
(e) O 
; \f 
Hc” “F 
Sarin 
0. ff 
Be. 
Sue oO NO, 
Parathion 


23. Ligand-induced channel opening. The ratio of open to 
closed forms of the acetylcholine receptor channel contain- 
ing zero, one, and two bound acetylcholine molecules is 5 X 
107°, 1.2 x 1073, and 14, respectively. 


(a) By what factor is the open-to-closed ratio increased by 
the binding of the first acetylcholine molecule? The second 
acetylcholine molecule? 


(b) What are the corresponding free-energy contributions 
to channel opening at 25°C? 


(c) Can the allosteric transition be accounted for by the 
MWC concerted model (Section 7.2)? 


24. Frog poison. Batrachotoxin (BTX) is a steroidal alka- 
loid from the skin of Phyllobates tervibilis, a poisonous 
Colombian frog (the source of the poison used on blowgun 
darts). In the presence of BTX, Na‘ channels in an excised 
patch stay persistently open when the membrane is depolar- 
ized. They close when the membrane is repolarized. Which 
transition is blocked by BTX? 


25. Valium target. y-Aminobutyric acid (GABA) opens 
channels that are specific for chloride ions. The GABA, 
receptor channel is pharmacologically important because it 
is the target of Valium, which is used to diminish anxiety. 


(a) The extracellular concentration of Cl” is 123 mM and 
the intracellular concentration is 4 mM. In which direction 
does Cl” flow through an open channel when the mem- 
brane potential is in the —60 mV to +30 mV range? 


(b) What is the effect of Cl -channel opening on the 
excitability of a neuron? 


(c) The hydropathy profile of the GABA receptor resem- 
bles that of the acetylcholine receptor. Predict the number 
of subunits in this Cl” channel. 


26. Understanding SERCA. To study the mechanism of 
SERCA, you prepare membrane vesicles containing this 
protein oriented such that its ATP binding site is on the 
outer surface of the vesicle. To measure pump activity, you 
use an assay that detects the formation of inorganic phos- 
phate in the medium. When you add calcium and ATP to 
the medium, you observe phosphate production for only a 
short period of time. Only after the addition of calcimycin, 
a molecule that makes membranes selectively permeable to 
calcium, do you observe sustained phosphate production. 
Explain. 


Chapter Integration Problem 


27. Speed and efficiency matter. Acetylcholine is rapidly 
destroyed by the enzyme acetylcholinesterase. This 
enzyme, which has a turnover number of 25,000 per second, 
has attained catalytic perfection with a Reat/Kyy of 2 X 
10° M's _'. Why is the efficiency of this enzyme physi- 
ologically crucial? 


Mechanism Problem 


28. Remembrance of mechanisms past. Acetylcholinesterase 
converts acetylcholine into acetate and choline. Like serine 
proteases, acetylcholinesterase is inhibited by DIPF. 
Propose a catalytic mechanism for acetylcholine digestion 
by acetylcholinesterase. Show the reaction as chemical 
structures. 


Data Interpretation Problems 


29. Tarantula toxin. Acid sensing is associated with pain, 
tasting, and other biological activities (Chapter 33). Acid 
sensing is carried out by a ligand-gated channel that permits 
Na‘ influx in response to H*. This family of acid-sensitive 
ion channels (ASICs) includes a number of members. 
Psalmotoxin 1 (PcTX1), a venom from the tarantula, 
inhibits some members of this family. The following 
electrophysiological recordings of cells containing several 
members of the ASIC family were made in the presence of 
the toxin at a concentration of 10 nM. The channels were 
opened by changing the pH from 7.4 to the indicated 
values. The PcTX1 was present for a short time (indicated 
by the black bar above the recordings below), after which 
time it was rapidly washed from the system. 


(A) ASICla ASIC2a 


ASIC1b 


ASIC3 . 
pH4 — (8) 5 100 
5 80 
e) 
<~-—- 60 
© Lo L 
LS 40 
= [ 
SG 20 [ 
gq a o 
= < 0.01 0.1 1 10 
100 s [PcTX1], nM 


(A) Electrophysiological recordings of cells exposed to tarantula toxin. 
(B) Plot of peak current of a cell containing the ASIC1a protein 
versus the toxin concentration. [From P. Escoubas et al. J. Biol, Chem. 
275(2000):25116-25121] 


(a) Which member of the ASIC family—ASIC1a, ASIC 1b, 
ASIC2a, or ASIC3—is most sensitive to the toxin? 


399 


Problems 


(b) Is the effect of the toxin reversible? Explain. 


(c) What concentration of PeTX1 yields 50% inhibition of 
the sensitive channel? 


30. Channel problems 1. A number of pathological condi- 
tions result from mutations in the acetylcholine receptor 
channel. One such mutation in the B subunit, BV266M, 
causes muscle weakness and rapid fatigue. An investigation 
of the acetylcholine-generated currents through the acetyl- 
choline receptor channel for both a control and a patient 
yielded the following results. 


Control 
Closed channel 


Open channel 


Patient 


Tesh toll cee 


What is the effect of the mutation on channel function? 
Suggest some possible biochemical explanations for the 
effect. 


Closed channel 


Open channel 


31. Channel problems 2. The acetylcholine receptor channel 
can also undergo mutation leading to fast-channel syn- 
drome (FCS), with clinical manifestations similar to those 
of slow-channel syndrome (SCS). What would the record- 
ings of ion movement look like in this syndrome? Suggest a 
biochemical explanation. 


32. Transport differences. The rate of transport of two mol- 
ecules, indole and glucose, across a cell membrane is shown 
below. What are the differences between the transport 
mechanisms of the two molecules? Suppose that ouabain 
inhibited the transport of glucose. What would this inhibi- 
tion suggest about the mechanism of transport? 


Glucose 


Rate of transport —— 


20 40 60 
Solute concentration (mM) 
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Signal-Transduction Pathways 
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Signal-transduction circuits in biological systems have molecular on—off 
switches that, like those in a computer chip (above), transmit information GDP 


when “on.’ Common among these circuits are those including G proteins 
(right), which transmit a signal when bound to GTP and are silent when 
bound to GDP. [(Left) Courtesy of Intel.] 


7 es is highly responsive to specific chemicals in its environment: it may OUTLINE 
adjust its metabolism or alter gene-expression patterns on sensing their 

presence. In multicellular organisms, these chemical signals are crucial to 
coordinating physiological responses (Figure 14.1). Three examples of 


14.1 Heterotrimeric G Proteins 
Transmit Signals and Reset 


molecular signals that stimulate a physiological response are epinephrine neces 

(sometimes called adrenaline), insulin, and epidermal growth factor (EGF). 14.2 Insulin Signaling: 

When a mammal is threatened, its adrenal glands release the hormone Phosphorylation Cascades 
epinephrine, which stimulates the mobilization of energy stores and leads to Are Central to Many 
improved cardiac function. After a full meal, the B cells in the pancreas Signal-Transduction Processes 
release insulin, which stimulates a host of physiological responses, includ- WAI] EGF Signaling: 

ing the uptake of glucose from the bloodstream and its storage as glycogen. Signal-Transduction Systems 
The release of EGF in response to a wound stimulates specific cells to Are Poised to Respond 


grow and divide. In all these cases, the cell receives information that a 
certain molecule within its environment is present above some threshold 
concentration. The chain of events that converts the message “this mole- 
cule is present” into the ultimate physiological response is called signal 


14.4 Many Elements Recur with 
Variation in Different 
Signal-Transduction Pathways 


transduction. 14.5 Defects in Signal-Transduction 
Signal-transduction pathways often comprise many components and Pathways Can Lead to Cancer 
branches. They can thus be immensely complicated and confusing. How- and Other Diseases 


ever, the logic of signal transduction can be revealed by examining the 
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Figure 14.2 Principles of signal 
transduction. An environmental signal is first 
received by interaction with a cellular 
component, most often a cell-surface 
receptor. The information that the signal has 
arrived is then converted into other chemical 
forms, or transduced. The transduction 
process often comprises many steps. The 
signal is often amplified before evoking a 
response. Feedback pathways regulate the 
entire signaling process. 


Epinephrine Insulin Epidermal growth factor (EGF) 
+ + + 
B-Adrenergic receptor Insulin receptor EGF receptor 


r i f 


Energy-store Increased glucose Expression of 
mobilization uptake growth-promoting genes 


Figure 14.1 Three signal-transduction pathways. The binding of signaling molecules to their 
receptors initiates pathways that lead to important physiological responses. 


common strategies and classes of molecules that recur in these path- 
ways. These principles are introduced here because signal-transduction 
pathways affect essentially all of the metabolic pathways that we will be 
exploring throughout the rest of the book. 


Signal transduction depends on molecular circuits 


Signal-transduction pathways follow a broadly similar course that can be 
viewed as a molecular circuit (Figure 14.2). All such circuits contain certain 
key steps: 


1. Release of the Primary Messenger. A stimulus such as a wound or digested 
meal triggers the release of the signal molecule, also called the primary 
messenger. 


2. Reception of the Primary Messenger. Most signal molecules do not enter 
cells. Instead, proteins in the cell membrane act as receptors that bind the 
signal molecules and transfer the information that the molecule has bound 
from the environment to the cell’s interior. Receptors span the cell mem- 
brane and thus have both extracellular and intracellular components. A 
binding site on the extracellular side specifically recognizes the signal mol- 
ecule (often referred to as the ligand). Such binding sites are analogous to 
enzyme active sites except that no catalysis takes place within them. The 
interaction of the ligand and the receptor alters the tertiary or quaternary 
structure of the receptor so as to induce a structural change on the intracel- 
lular side. 


3. Delivery of the Message Inside the Cell by the Second Messenger. Other 
small molecules, called second messengers, are used to relay information 
from receptor—ligand complexes. Second messengers are intracellular mol- 
ecules that change in concentration in response to environmental signals 
and mediate the next step in the molecular information circuit. Some par- 
ticularly important second messengers are cyclic AMP (cAMP) and cyclic 
GMP (cGMP), calcium ion, inositol 1,4,5-trisphosphate (IP3), and diacyl- 
glycerol (DAG; Figure 14.3). 

The use of second messengers has several consequences. First, the signal 
may be amplified significantly in the generation of second messengers. 
Only a small number of receptor molecules may be activated by the direct 
binding of signal molecules, but each activated receptor molecule can lead 
to the generation of many second messengers. Thus, a low concentration of 
signal in the environment, even as little as a single molecule, can yield a large 
intracellular signal and response. Second, second messengers are often free to 
diffuse to other cellular compartments where they can influence processes 
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throughout the cell. Third, the use of common second messengers in mul- 
tiple signaling pathways creates both opportunities and potential problems. 
Input from several signaling pathways, often called cross talk, may alter the 
concentration of a common second messenger. Cross talk permits more 
finely tuned regulation of cell activity than would the action of individual 
independent pathways. However, inappropriate cross talk can result in the 
misinterpretation of changes in second-messenger concentration. 


4. Activation of Effectors That Directly Alter the Physiological Response. The 
ultimate effect of the signal pathway is to activate (or inhibit) the pumps, 
channels, enzymes, and transcription factors that directly control metabolic 
pathways, gene expression, and the permeability of membranes to specific 
ions. 


5. Termination of the Signal. After a cell has completed its response to a 
signal, the signaling process must be terminated or the cell loses its respon- 
siveness to new signals. Moreover, signaling processes that fail to terminate 
properly can have highly undesirable consequences. As we will see, many 
cancers are associated with signal-transduction processes that are not prop- 
erly terminated, especially processes that control cell growth. 


In this chapter, we will examine components of the three signal-trans- 
duction pathways shown in Figure 14.1. In doing so, we will see several 
classes of adaptor domains present in signal-transduction proteins. These 
domains usually recognize specific classes of molecules and help transfer 
information from one protein to another. The components described in the 
context of these three pathways recur in many other signal-transduction 
pathways; so bear in mind that the specific examples are representative of 
many such pathways. 


14.1 Heterotrimeric G Proteins Transmit Signals 
and Reset Themselves 


Epinephrine is a hormone secreted by the adrenal glands of mammals 
in response to internal and external stressors. It exerts a wide range of 
effects—referred to as the “fight or flight” response—to help organisms 
anticipate the need for rapid muscular activity, including acceleration of 
heart rate, dilation of the smooth muscle of the airways, and initiation of 
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Figure 14.3 Common second 


messengers. Second messengers are 


intracellular molecules that change in 


concentration in response to environmental 
signals. That change in concentration conveys 


information inside the cell. 


Epinephrine 
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Table 14.1 Biological functions 
mediated by 7TM 
receptors 


Hormone action 
Hormone secretion 
Neurotransmission 
Chemotaxis 

Exocytosis 

Control of blood pressure 
Embryogenesis 

Cell growth and differentiation 
Development 

Smell 

Taste 

Vision 

Viral infection 


Source: After J. 5. Gutkind, J. Biol. Chem. 
273:1839-1842, 1998. 
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Figure 14.4 The 7TM receptor. Schematic 
representation of a 7TM receptor showing its 
passage through the membrane seven times. 


hk Figure 14.5 Structures of rhodopsin 
and the 82-adrenergic receptor. Three- 
dimensional structure of rhodopsin (A) and 
the B2-adrenergic receptor (B). Notice the 
resemblance in the overall architecture of 
both receptors and the similar locations of the 
rhodopsin ligand 1 1-cis-retinal and the B2-AR 
blocker carazolol. [Drawn from 1F88.pdb and 
2RH1.pdb.] 


the breakdown of glycogen (Section 21.3) and fatty acids (Section 22.2). 
Epinephrine signaling begins with ligand binding to a protein called the 
B-adrenergic receptor (B-AR). The B-AR is a member of the largest class of 
cell-surface receptors, called the seven-transmembrane-helix (7TM) recep- 
tors. Members of this family are responsible for transmitting information 
initiated by signals as diverse as hormones, neurotransmitters, odorants, 
tastants, and even photons (Table 14.1). More than 20,000 such receptors 
are now known. Furthermore, about one-third of the therapeutic drugs that 
we use target receptors of this class. As the name indicates, these receptors 
contain seven helices that span the membrane bilayer. The receptors are 
sometimes referred to as serpentine receptors because the single polypeptide 
chain “snakes” through the membrane seven times (Figure 14.4). 

The first member of the 7TM receptor family to have its three- 
dimensional structure determined was rhodopsin (Figure 14.5A), a protein 
in the retina of the eye that senses the presence of photons and initiates 
the signaling cascade responsible for visual sensation. A single lysine resi- 
due within rhodopsin is covalently modified by a form of vitamin A, 
11-cis-retinal. This modification is located near the extracellular side of the 
receptor, within the region surrounded by the seven transmembrane heli- 
ces. As will be considered in greater detail in Section 33.3, exposure to light 
induces the isomerization of 11-cis-retinal to its all-trans form, producing a 
structural change in the receptor that results in the initiation of an action 
potential that is ultimately interpreted by the brain as visual stimulus. 

In 2007, the three-dimensional structure of the B2 subtype of the human 
adrenergic receptor (B2-AR) bound to an inhibitor was solved by x-ray 
crystallography. This inhibitor, carazolol, competes with epinephrine for 
binding to the B>-AR, much in the same way that competitive inhibitors 
act at enzyme active sites (Section 8.5). The structure of the B.-AR revealed 
considerable similarities with that of rhodopsin, particularly with respect to 
the locations of 11-cis-retinal in rhodopsin and the binding site for carazolol 
(Figure 14.5B). Although the precise details of the conformational changes 
induced by ligand binding to the B- AR remain to be established, epinephrine 
likely binds to the B-AR in a similar region of the receptor as carazolol 


(B) 


Blocker-binding 
site 


Cytoplasmic loops Cytoplasmic loops 


Rhodopsin f2-adrenergic receptor 


binds, triggering conformational changes in the cytoplasmic parts of the 
B-AR comparable to those induced by retinal isomerization in rhodopsin. 
Thus, the binding of a ligand from outside the cell induces a structural 
rearrangement in the part of the 7T'M receptor that is positioned inside 
the cell. 


Ligand binding to 7TM receptors leads to the 
activation of heterotrimeric G proteins 


What is the next step in the pathway? The con- Epinephrine —_8-Adrenergic 
formational change in the receptor’s cytoplasmic ‘\ ff receptor 


domain activates a protein called a G protein because 
it binds guanyl nucleotides. The activated G pro- 
tein stimulates the activity of adenylate cyclase, an 
enzyme that catalyzes the conversion of ATP into 
cAMP. The G protein and adenylate cyclase remain 
attached to the membrane, whereas cAMP can 
travel throughout the cell carrying the signal origi- 
nally brought by the binding of epinephrine. 
Figure 14.6 provides a broad overview of these steps. 

Let us consider the role of the G protein in this 
signaling pathway in greater detail. In its unacti- 
vated state, the G protein is bound to GDP. In this 
form, the G protein exists as a heterotrimer consist- 
ing of a, B, and y subunits; the a subunit (referred 
to as Gq) binds the nucleotide (Figure 14.7). The a 
subunit is a member of the P-loop NTPase family 
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Figure 14.6 Activation of protein kinase A by a G-protein pathway. 
Hormone binding to a 7TM receptor initiates a signal-transduction pathway 


(Section 9.4), and the P-loop participates in nucle- that acts through a G protein and cAMP to activate protein kinase A. 


otide binding. The a and y subunits are usually 

anchored to the membrane by covalently attached 

fatty acids. The role of the hormone-bound receptor is 

to catalyze the exchange of GTP for bound GDP. The hormone-receptor 
complex interacts with the heterotrimeric G protein and opens the nucle- 
otide-binding site such that GTP in the cell can displace GDP. 

On GTP binding, the a subunit simultaneously dissociates from the 
By dimer (Gey), transmitting the signal that the receptor has bound its 
ligand. A single hormone—receptor complex can stimulate nucleotide exchange 
in many G-protein heterotrimers. Thus, hundreds of Gq molecules are 


h Figure 14.7 A heterotrimeric G 
protein. (A) A ribbon diagram shows the 
relation between the three subunits. In this 
complex, the a subunit (gray and purple) is 
bound to GDP. Notice that GDP is bound in a 
pocket close to the surface at which the a 
subunit interacts with the By dimer. (B) A 
schematic representation of the heterotrimeric 
G protein. [Drawn from 1GOT.pdb.] 


Adenylate cyclase 


h Figure 14.8 Adenylate cyclase 
activation. (A) Adenylate cyclase is a 
membrane protein with two large intracellular 
domains that contain the catalytic apparatus. 
(B) The structure of a complex between G, in 
its GTP form bound to a catalytic fragment 
from adenylate cyclase. Notice that the 
surface of G, that had been bound to the By 
dimer now binds adenylate cyclase. [Drawn 
from 1AZS.pdb_] 
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Figure 14.9 Epinephrine signaling 
pathway. The binding of epinephrine to the 
B-adrenergic receptor initiates the signal- 
transduction pathway. The process in each 
step is indicated (in black) at the left of each 
arrow. Steps that have the potential for signal 
amplification are indicated at the right in 
green. 
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converted from their GDP form into their GTP form for each bound mol- 
ecule of hormone, giving an amplified response. Because they signal 


through G proteins, 7T'M receptors are often called G-protein-coupled 
receptors (GPCRs). 


Activated G proteins transmit signals by binding to other proteins 


In the GTP form, the surface of G, that had been bound to Gg, has changed 
its conformation from the GDP form so that it no longer has a high affinity 
for Ggy. This surface is now exposed for binding to other proteins. In the 
B-AR pathway, the new binding partner is adenylate cyclase, the enzyme 
that converts ATP into cAMP. This enzyme is a membrane protein that 
contains 12 membrane-spanning helices; two large cytoplasmic domains 
form the catalytic part of the enzyme (Figure 14.8). The interaction of Gg 
with adenylate cyclase favors a more catalytically active conformation of 
the enzyme, thus stimulating cAMP production. Indeed, the G, subunit 
that participates in the B-AR pathway is called Gg, (“‘s” stands for stimu- 
latory). The net result is that the binding of epinephrine to the receptor on the 
cell surface increases the rate of cAMP production inside the cell. The genera- 
tion of cAMP by adenylate cyclase provides a second level of amplification 


because each activated adenylate cyclase can convert many molecules of 
ATP into cAMP. 


Cyclic AMP stimulates the phosphorylation of many target 
proteins by activating protein kinase A 


The increased concentration of cAMP can affect a wide range of cellular 
processes. In the muscle, cAMP stimulates the production of ATP for 
muscle contraction. In other cell types, cAMP enhances the degradation of 
storage fuels, increases the secretion of acid by the gastric mucosa, leads to 
the dispersion of melanin pigment granules, diminishes the aggregation of 
blood platelets, and induces the opening of chloride channels. How does 
cAMP influence so many cellular processes? Most effects of cAMP in 
eukaryotic cells ave mediated by the activation of a single protein kinase. This 
key enzyme is protein kinase A (PKA). 

As described earlier, PKA consists of two regulatory (R) chains and two 
catalytic (C) chains (see Figure 10.17). In the absence of cAMP, the RoC» 
complex is catalytically inactive. The binding of cAMP to the regulatory 
chains releases the catalytic chains, which are catalytically active on their 
own. Activated PKA then phosphorylates specific serine and threonine 
residues in many targets to alter their activity. For instance, PKA phos- 
phorylates two enzymes that lead to the breakdown of glycogen, the poly- 
meric store of glucose, and the inhibition of further glycogen synthesis 


(Section 21.3). PKA stimulates the expression of specific genes by phosphory- 
lating a transcriptional activator called the cAMP response element binding 
(CREB) protein. This activity of PKA illustrates that signal-transduction 
pathways can extend into the nucleus to alter gene expression. 

The signal-transduction pathway initiated by epinephrine is summa- 
rized in Figure 14.9. 


G proteins spontaneously reset themselves through GTP hydrolysis 


How is the signal initiated by epinephrine switched off? Gg subunits have 
intrinsic GTPase activity, which is used to hydrolyze bound GTP to GDP 
and P;. This hydrolysis reaction is slow, however, requiring from seconds to 
minutes. Thus, the GTP form of Gg is able to activate downstream compo- 
nents of the signal-transduction pathway before it is deactivated by GTP 
hydrolysis. In essence, the bound GTP acts as a built-in clock that spontane- 
ously resets the Gy subunit after a short time period. After GTP hydrolysis 
and the release of P;, the GDP-bound form of Gy then reassociates with Ggy 
to re-form the inactive heterotrimeric protein (Figure 14.10). 


Adenylate 
cyclase 


The hormone-bound activated receptor must be reset as well to prevent 
the continuous activation of G proteins. This resetting is accomplished by 
two processes (Figure 14.11). First, the hormone dissociates, returning the 
receptor to its initial, unactivated state. The likelihood that the receptor 


@ Dissociation 


Receptor 
kinase 
ATP ADP 
@) Phosphorylation 


Figure 14.11 Signal termination. Signal transduction by the 7TM receptor is halted (1) by 
dissociation of the signal molecule from the receptor and (2) by phosphorylation of the 
cytoplasmic C-terminal tail of the receptor and the subsequent binding of B-arrestin. 
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Figure 14.10 Resetting Ga. On hydrolysis 
of the bound GTP by the intrinsic GTPase 
activity of Gy, Gy reassociates with the By 
dimer to form the heterotrimeric G protein, 
thereby terminating the activation of adenylate 
cyclase. 
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remains in its unbound state depends on the extracellular concentration of 
hormone. Second, the signaling cascade initiated by the hormone-receptor 
complex activates a kinase that phosphorylates serine and threonine resi- 
dues in the carboxyl-terminal tail of the receptor. These phosphorylation 
events result in the deactivation of the receptor. In the example under con- 
sideration, B-adrenergic-receptor kinase (also called G-protein receptor 
kinase 2, GRK2) phosphorylates the carboxyl]-terminal tail of the hormone— 
receptor complex but not the unoccupied receptor. Finally, the molecule 
B-arrestin binds to the phosphorylated receptor and further diminishes its 
ability to activate G proteins. 


Some 7TM receptors activate the phosphoinositide cascade 


We now turn to another common second-messenger cascade, also employ- 
ing a 7IM receptor, that is used by many hormones to evoke a variety of 
responses. The phosphoinositide cascade, like the cAMP cascade, converts 
extracellular signals into intracellular ones. The intracellular messengers 
formed by activation of this pathway arise from the cleavage of phosphati- 
dylinositol 4,5-bisphosphate (PIP2), a phospholipid present in cell mem- 
branes. An example of a signaling pathway based on the phosphoinositide 
cascade is the one triggered by the receptor for angiotensin II, a peptide 
hormone that controls blood pressure. 

Each type of 7TM receptor signals through a distinct G protein. 
Whereas the B-adrenergic receptor activates the G protein Gag, the angio- 
tensin II receptor activates a G protein called Gag. In its GTP-form, Gag 
binds to and activates the B isoform of the enzyme phospholipase C. This 
enzyme catalyzes the cleavage of PIP, into the two second messengers 
inositol 1,4,5-trisphosphate (IP3) and diacylglycerol (DAG; Figure 14.12). 

IP3 is soluble and diffuses away from the membrane. This second mes- 
senger causes the rapid release of Ca** from the intracellular stores in the 


Phospholipase C 


Phosphatidylinositol 4,5-bisphosphate (PIP) 


Diacylglycerol (DAG) Inositol 1,4,5-trisphosphate (IP3) 


Figure 14.12 Phospholipase C reaction. Phospholipase C cleaves the membrane lipid 
phosphatidylinositol 4,5-bisphosphate into two second messengers: diacylglycerol, which remains in 
the membrane, and inositol 1,4,5-trisphosphate, which diffuses away from the membrane. 


endoplasmic reticulum (ER), which accumulates a reser- 
voir of Ca** through the action of transporters such as 
Ca’* ATPase ection 13.2). On binding IP3, specific 
IP3-gated Ca?*-channel proteins in the ER membrane 
open to allow calcium ions to flow from the ER into the 
cytoplasm. Calcium ion is itself a signaling molecule: it 
can bind proteins, including a ubiquitous signaling pro- 
tein called calmodulin and enzymes such as protein 
kinase C. _By such means, the elevated level of cytoplas- 
mic Ca’* triggers processes such as smooth-muscle 
contraction, glycogen breakdown, and vesicle release. 

DAG remains in the plasma membrane. There, it 
activates protein kinase C (PKC), a protein kinase that 
phosphorylates serine and threonine residues in many 
target proteins. To bind DAG, the specialized DAG- 
binding domains of this kinase require bound calcium. 
Note that diacylglycerol and IP3 work 1 jn tandem: IP3 
increases the Ca’* concentration, and Ca’* facilitates the 
DAG-mediated activation of protein kinase C. The 
phosphoinositide cascade is summarized in Figure 14.13. 
Both IP3 and DAG act transiently because they are con- 
verted into other species by phosphorylation or other 
processes. 


Calcium ion is a widely used second messenger 


Calcium ion participates in many signaling processes in 
addition to the phosphoinositide cascade. Several prop- 
erties of this ion account for its widespread use as an 
intracellular messenger. First, fleeting changes in Ca7* 

concentration are readily detected. At steady state, intra- 
cellular levels of Ca?* 
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Figure 14.13 Phosphoinositide cascade. The cleavage of 
phosphatidylinositol 4,5-bisphosphate (PIP) into diacylglycerol (DAG) 
and inositol 1,4,5-trisphosphate (IP3) results in the release of calcium 
ions (owing to the opening of the IPz receptor ion channels) and the 
activation of protein kinase C (owing to the binding of protein kinase C 
to free DAG in the membrane). Calcium ions bind to protein kinase C 
and help facilitate its activation. 


must be kept low to prevent the precipitation of car- 


boxylated and phosphorylated compounds, which form poorly soluble salts 


with Ca?* 
extrude Ca2t 
concentration of Ca?* 


Calcium i ion levels are kept low by transport systems that 
from the cytoplasm. Because of their action, the cytoplasmic 
is approximately 100 nM, several orders of magni- 


tude lower than that of the extracellular medium. Given this low steady- 


state level, transient increases in Ca?~ 
events can be readily sensed. 
A second property of Ca?* 


concentration produced by signaling 


that makes it a highly suitable intracellular 


messenger is that it can bind tightly to proteins and induce substantial 
structural rearrangements. Calcium ions bind well to negatively charged 


oxygen atoms (from the side chains of glutamate and aspartate) and 
uncharged oxygen atoms (main-chain carbonyl groups and side-chain oxy- 
gen atoms from glutamine and asparagine; Figure 14.14). The capacity of 
from six to eight oxygen atoms— 
enables it to cross-link different segments of a protein and induce significant 


Ca?" to be coordinated to multiple ligands 


conformational changes. 
Our understanding of the role of Ca?* 


that bind Ca2* 
Fura-2 binds Ca?* 
in red) within its structure. 


in cellular processes has 
been greatly enhanced by our ability to detect changes in CaP" 
tions inside cells and even monitor these changes in real time. This 
ability depends on the use of specially designed dyes such, as Fura-2 
and change their fluorescent properties on Ca?* 
through appropriately positioned oxygen atoms (shown 


concentra- 


Figure 14.14 Calcium-binding site. In one 
common mode of binding, calcium is 
coordinated to six oxygen atoms of a protein 
and one (top) of water. 


binding. 
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When such a dye is introduced into cells, changes in available Ca’* concen- 
tration can be monitored with microscopes capable of detecting changes in 
fluorescence (Figure 14.15). Probes for sensing other second messengers 
such as cAMP also have been developed. These molecular-imaging agents 
are greatly enhancing our understanding of signal-transduction processes. 


Figure 14.15 Calcium imaging. (A) The 
fluorescence spectra of the calcium-binding dye 
Fura-2 can be used to measure available calcium ion 
concentrations in solution and in cells. (B) A series 
of images show Ca?* spreading across a cell. These 
images were obtained through the use of a 
fluorescent calcium-binding dye. The images are 
false colored: red represents high Ca** concentra- 
tions, and blue represents low Ca?* concentrations. 
[(A) After S. J. Lippard and J. M. Berg, Principles of 
Bioinorganic Chemistry (University Science Books, 
1994), p. 193; (B) courtesy of Dr. Masashi Isshiki, 
Department of Nephrology, University of Tokyo, and 
Dr. G. W. Anderson, Department of Cell Biology, 300 400 
University of Texas Southwestern Medical School.] Wavelength (nm) 


(B) 


Calcium ion often activates the regulatory protein calmodulin 


Intensity 


Calmodulin (CaM), a 17-kd protein with four Ca2* -binding sites, serves as 
a calcium sensor in nearly all eukaryotic cells. At cytoplasmic concentrations 
above about 500 nM, Ca?" binds to and activates calmodulin. Calmodulin is 
a member of the EF-hand protein family. The EF hand isa Ca? * binding 
motif that consists of a helix, a loop, and a second helix. This motif, origi- 
nally discovered in the protein parvalbumin, was named the EF hand 
because the two key helices designated E and F in parvalbumin are posi- 
tioned like the forefinger and thumb of the right hand (Figure 14.16). These 
two helices and the intervening loop form the Ca?* -binding motif. Seven 


oxygen atoms are coordinated to each Ca’*, six from the protein and one 
from a bound water molecule. Calmodulin is made up of four EF-hand 
motifs, each of which can bind a single Ca?" ion. 

The binding of Ca? to calmodulin induces substantial conformational 
changes in its EF hands, exposing hydrophobic surfaces that can be used to 
bind other proteins. Using its two sets of two EF hands, calmodulin 
clamps down around specific regions of target proteins, usually exposed 
a helices with appropriately positioned hydrophobic and charged groups 
(Figure 14.17). The Ca?*—calmodulin complex stimulates a wide variety of 
enzymes, pumps, and other target proteins by inducing structural 
rearrangements in these binding partners. An especially noteworthy set of 
targets are several calmodulin-dependent protein kinases (CaM kinases) that 
phosphorylate many different proteins and regulate fuel metabolism, ionic 
permeability, neurotransmitter synthesis, and neurotransmitter release. We 
see here a recurring theme in signal-transduction pathways: the concentra- 
tion of a second messenger is increased (in this case, Ca’*); the signal is 
sensed by a second-messenger-binding protein (in this case, calmodulin); 
and the second-messenger-binding protein acts to generate changes in 
enzymes (in this case, calmodulin-dependent kinases) that control effectors. 


14.2 Insulin Signaling: Phosphorylation Cascades Are 
Central to Many Signal-Transduction Processes 


The signaling pathways that we have examined so far have activated a 
protein kinase as a downstream component of the pathway. We now turn to 
a class of signal-transduction pathways that are initiated by receptors that 
include protein kinases as part of their structures. The activation of these 
protein kinases sets in motion other processes that ultimately modify the 
effectors of these pathways. 

An example is the signal-transduction pathway initiated by insulin, the 
hormone released in response to increased blood-glucose levels after a meal. 
In all of its detail, this multifaceted pathway is quite complex. Hence, we 
will focus solely on the major branch, which leads to the mobilization of 
glucose transporters to the cell surface. These transporters allow the cell to 
take up the glucose that is plentiful in the bloodstream after a meal. 


CaM 


_— target 
peptide 


CaM kinase | Calmodulin (apo) 


h& Figure 14.17 Calmodulin binds to a helices. (A) An a helix (purple) in CaM kinase | is a 
target for calmodulin. (B) On Ca?* binding to the apo, or calcium-free, form of calmodulin (1), the 
two halves of calmodulin clamp down around the target helix (2), binding it through hydrophobic 
and ionic interactions. In CaM kinase |, this interaction allows the enzyme to adopt an active 
conformation. [Drawn from 1A06, 1CFD, 1CLL, and 1CM1.pdb.] 
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a 


EF hand 


h Figure 14.16 EF hand. Formed by a 
helix-loop-helix unit, an EF hand is a binding 
site for Ca** in many calcium-sensing 
proteins. Here, the E helix is yellow, the F helix 
is blue, and calcium is represented by the 
green sphere. Notice that the calcium ion is 
bound in a loop connecting two nearly 
perpendicular helices. [Drawn from 1CLL.pdb.] 
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hk Figure 14.18 Insulin structure. Notice 
that insulin consists of two chains (shown in 
blue and yellow) linked by two interchain 
disulfide bonds. The a chain (blue) also has 
an intrachain disulfide bond. [Drawn from 
1B2F.pdb.] 


Insulin-binding site 


B subunit 


Figure 14.19 The insulin receptor. The 
receptor consists of two units, each of which 
consists of an a subunit and a B subunit 
linked by a disulfide bond. Two a subunits, 
which lie outside the cell, come together to 
form a binding site for insulin. Each 8 subunit 
lies primarily inside the cell and includes a 
protein kinase domain. 


The insulin receptor is a dimer that closes around 
a bound insulin molecule 


Insulin is a peptide hormone that consists of two chains, linked by three dis- 
ulfide bonds (Figure 14.18). Its receptor has a quite different structure from 
that of the B-AR. The insulin receptor is a dimer of two identical units. Each 
unit consists of one a chain and one f chain linked to one another by a single 
disulfide bond (Figure 14.19). Each a subunit lies completely outside the 
cell, whereas each B subunit lies primarily inside the cell, spanning the mem- 
brane with a single transmembrane segment. The two a subunits move 
together to form a binding site for a single insulin molecule, a surprising 
occurrence because two different surfaces on the insulin molecule must inter- 
act with the two identical insulin-receptor chains. The moving together of 
the dimeric units in the presence of an insulin molecule sets the signaling 
pathway in motion. The closing up of an oligomeric receptor or the oligomeriza- 
tion of monomeric receptors around a bound ligand is a strategy used by many 
receptors to initiate a signal, particularly by those containing a protein kinase. 

Each 8 subunit consists primarily of a protein kinase domain, homolo- 
gous to protein kinase A. However, this kinase differs from protein kinase A 
in two important ways. First, the insulin-receptor kinase is a tyrosine kinase; 
that is, it catalyzes the transfer of a phosphoryl group from ATP to the 
hydroxyl] group of tyrosine, rather than serine or threonine. 


Oo—Ppo7” 


Tyrosine 
kinase 


Because this tyrosine kinase is a component of the receptor itself, the insulin 
receptor is referred to as a receptor tyrosine kinase. Second, the insulin recep- 
tor kinase is in an inactive conformation when the domain is not covalently 
modified. The kinase is rendered inactive by the position of an unstructured 
loop (called the activation loop) that lies in the center of the structure. 


Insulin binding results in the cross-phosphorylation 
and activation of the insulin receptor 


When the two a subunits move together to surround an insulin molecule, 
the two protein kinase domains on the inside of the cell also are drawn 
together. Importantly, as they come together, the flexible activation loop of 
one kinase subunit is able to fit into the active site of the other kinase sub- 
unit within the dimer. With the two B subunits forced together, the kinase 
domains catalyze the addition of phosphoryl groups from ATP to tyrosine 
residues in the activation loops. When these tyrosine residues are phospho- 
rylated, a striking conformational change takes place (Figure 14.20). The 
rearrangement of the activation loop converts the kinase into an active 
conformation. Thus, insulin binding on the outside of the cell results in the 
activation of a membrane-associated kinase within the cell. 


The activated insulin-receptor kinase initiates a kinase cascade 


On phosphorylation, the insulin-receptor tyrosine kinase is activated. 
Because the two units of the receptor are held in close proximity to one 


Phosphorylation 
—_—— 


Inactive 


hk Figure 14.20 Activation of the insulin receptor by phosphorylation. The activation loop 
is shown in red in this model of the protein kinase domain of the B subunit of the insulin receptor. 
The unphosphorylated structure on the left is not catalytically active. Notice that, when three 
tyrosine residues in the activation loop are phosphorylated, the activation loop swings across the 
structure and the kinase structure adopts a more compact conformation. This conformation is 
catalytically active. [Drawn from 1IRK.pdb and 11R3.pdb_] 


another, additional sites within the receptor also are phosphorylated. These 
phosphorylated sites act as docking sites for other substrates, including 
a class of molecules referred to as insulin-receptor substrates (IRS; 
Figure 14.21). IRS-1 and IRS-2 are two homologous proteins with a com- 
mon modular structure (Figure 14.22). The amino-terminal part includes a 
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Figure 14.21 Insulin signaling. The binding 
of insulin results in the cross-phosphorylation 
and activation of the insulin receptor. 
Phosphorylated sites on the receptor act as 
binding sites for insulin-receptor substrates 
such as IRS-1. The lipid kinase 
phosphoinositide 3-kinase binds to 
phosphorylated sites on IRS-1 through its 
regulatory domain, then converts PIP2 into 
PIPs. Binding to PIP activates PIPz-dependent 
protein kinase, which phosphorylates and 
activates kinases such as Akt1. Activated Akt1 
can then diffuse throughout the cell to 
continue the signal-transduction pathway. 
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hk Figure 14.23 Structure of the SH2 
domain. The domain is shown bound to a 
phosphotyrosine-containing peptide. Notice at 
the top that the negatively charged 
phosphotyrosine residue interacts with two 
arginine residues that are conserved in 


essentially all SH2 domains. [Drawn from 
1SPS.pdb.] 


binding domain 


Figure 14.22 The modular structure of insulin-receptor substrates IRS-1 and IRS-2. This 
schematic view represents the amino acid sequence common to IRS-1 and IRS-2. Each protein 
contains a pleckstrin homology domain (which binds phosphoinositide lipids), a phosphotyrosine- 
binding domain, and four sequences that approximate Tyr-X-X-Met (YXXM). The four sequences are 
phosphorylated by the insulin-receptor tyrosine kinase. 


pleckstrin homology domain, which binds phosphoinositide, and a phospho- 
tyrosine-binding domain. These domains act together to anchor the IRS 
protein to the insulin receptor and the associated membrane. Each IRS 
protein contains four sequences that approximate the form Tyr-X-X-Met. 
These sequences are also substrates for the activated insulin-receptor 
kinase. When the tyrosine residues within these sequences are phosphory- 
lated to become phosphotyrosine residues, IRS molecules can act as adaptor 
proteins: they are not enzymes but serve to tether the downstream compo- 
nents of this signaling pathway to the membrane. 

Phosphotyrosine residues, such as those in the IRS proteins, are recog- 
nized most often by Src homology 2 (SH2) domains (Figure 14.23). These 
domains, present in many signal-transduction proteins, bind to stretches of 
polypeptide that contain phosphotyrosine residues. Each specific 5H2 
domain shows a binding preference for phosphotyrosine in a particular 
sequence context. Which proteins contain SH2 domains that bind to 
phosphotyrosine-containing sequences in the IRS proteins? The most 
important of them are in a class of lipid kinases, called phosphoinositide 
3-kinases (PI3Ks), that add a phosphoryl group to the 3-position of inositol 
in phosphatidylinositol 4,5-bisphosphate (PIP2; Figure 14.24). These 
enzymes are heterooligomers that consist of 110-kd catalytic subunits and 
85-kd regulatory subunits. Through 5H2 domains in the regulatory sub- 
units, these enzymes bind to the IRS proteins and are drawn to the mem- 
brane where they can phosphorylate PIP: to form phosphatidyl-inositol 
3,4,5-trisphosphate (PIP3). PIP3, in turn, activates a protein kinase, PDK1, 
by virtue of a pleckstrin homology domain present in this kinase that is 
specific for PIP3. The activated PDK1 phosphorylates and activates Akt, 
another protein kinase. Akt is not membrane anchored and moves through 
the cell to phosphorylate targets that include components that control the 
trafficking of the glucose receptor GLUT4 to the cell surface as well as 
enzymes that stimulate glycogen synthesis (Section 21.4). 
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3-kinase 
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Figure 14.24 Action of a lipid kinase in insulin signaling. Phosphorylated IRS-1 and IRS-2 
activate the enzyme phosphatidylinositide 3-kinase, an enzyme that converts PIP> into PIPs. 


The cascade initiated by the binding of insulin to the insulin receptor is 
summarized in Figure 14.25. The signal is amplified at several stages 
along this pathway. Because the activated insulin receptor itself is a protein 
kinase, each activated receptor can phosphorylate multiple IRS molecules. 
Activated enzymes further amplify the signal in at least two of the subse- 
quent steps. Thus, a small increase in the concentration of circulating insu- 
lin can produce a robust intracellular response. Note that, although the 
insulin pathway described here may seem complicated, it is substantially 
less elaborate than the full signaling network initiated by insulin. 


Insulin signaling is terminated by the action of phosphatases 


We have seen that the activated G protein promotes its own inactivation by 
the release of a phosphoryl group from GTP. In contrast, proteins phospho- 
rylated on serine, threonine, or tyrosine residues are extremely stable 
kinetically. Specific enzymes, called protein phosphatases, are required to 
hydrolyze these phosphorylated proteins and return them to their initial 
states. Similarly, lipid phosphatases are required to remove phosphoryl 
groups from inositol lipids that had been activated by lipid kinases. In insu- 
lin signaling, three classes of enzymes are of particular importance in shut- 
ting off the signaling pathway: protein tyrosine phosphatases that remove 
phosphoryl groups from tyrosine residues on the insulin receptor and the 
IRS adaptor proteins, lipid phosphatases that hydrolyze PIP3 to PIP», and 
protein serine phosphatases that remove phosphoryl groups from activated 
protein kinases such as Akt. Many of these phosphatases are activated or 
recruited as part of the response to insulin. Thus, the binding of the initial 
signal sets the stage for the eventual termination of the response. 


14.3 EGF Signaling: Signal-Transduction Pathways 
Are Poised to Respond 


Our consideration of the signal-transduction cascades initiated by epineph- 
rine and insulin included examples of how components of signal-transduc- 
tion pathways are poised for action, ready to be activated by minor modifi- 
cations. For example, G-protein subunits require only the binding of GTP 
in exchange for GDP to transmit a signal. This exchange reaction 1s thermo- 
dynamically favorable, but it is quite slow in the absence of an appropriate 
activated 7TM receptor. Similarly, the tyrosine kinase domains of the 
dimeric insulin receptor are ready for phosphorylation and activation but 
require insulin bound between two a subunits to draw the activation loop of 
one tyrosine kinase into the active site of a partner tyrosine kinase to initiate 
this process. 

We now examine a signal-transduction pathway that reveals another 
clear example of how these signaling cascades are poised to respond. This 
pathway is activated by the signal molecule epidermal growth factor (EGF). 
Like that of the insulin receptor, the initiator of this pathway is a receptor 
tyrosine kinase. Both the extracellular and the intracellular domains of this 
receptor are ready for action, held in check only by a specific structure that 
prevents receptors from coming together. Furthemore, in the EGF path- 
way, we will encounter several additional classes of signaling components 
that participate in many other signaling networks. 
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Figure 14.25 Insulin signaling pathway. 
Key steps in the signal-transduction pathway 
initiated by the binding of insulin to the insulin 
receptor. 


Epidermal growth factor (EGF) 


hk Figure 14.26 Structure of epidermal 
growth factor. Notice that three intrachain 
disulfide bonds stabilize the compact three- 
dimensional structure of the growth factor. 
[Drawn from 1EGF.pdb.] 


EGF binding results in the dimerization of the EGF receptor 


Epidermal growth factor is a 6-kd polypeptide that stimulates the growth of 
epidermal and epithelial cells (Figure 14.26). The EGF receptor (EGFR), 
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EGF-binding domain Transmembrane Kinase domain C-terminal tail 


helix (tyrosine-rich) 


Figure 14.27 Modular structure of the EGF receptor. This schematic view of the amino acid 
sequence of the EGF receptor shows the EGF-binding domain that lies outside the cell, a single 
transmembrane helix-forming region, the intracellular tyrosine kinase domain, and the tyrosine-rich 
domain at the carboxyl terminus. 


like the insulin receptor, is a dimer of two identical subunits. Each subunit 
contains an intracellular protein tyrosine kinase domain that participates in 
cross-phosphorylation reactions (Figure 14.27). Unlike those of the insulin 
receptor, however, these units exist as monomers until they bind EGF. 
Moreover, each EGF receptor monomer binds a single molecule of EGF in 
its extracellular domain (Figure 14.28). Thus the dimer binds two ligand 
molecules, in contrast with the insulin-receptor dimer, which binds only one 
ligand. Note that each EGF molecule lies far away from the dimer interface. 
This interface includes a so-called dimerization arm from each monomer that 
reaches out and inserts into a binding pocket on the other monomer. 


Wi) 


Dy) 


Membrane 


hk Figure 14.28 EGF receptor dimerization. The structure of the extracellular region of the 
EGF receptor is shown bound to EGF. Notice that the structure is dimeric with one EGF molecule 
bound to each receptor molecule and that the dimerization is mediated by a dimerization arm that 
extends from each receptor molecule. [Drawn from 11VO.pdb.] 


Although this structure nicely reveals the interactions that support the 
formation of a receptor dimer favoring cross-phosphorylation, it raises 
another question: Why doesn’t the receptor dimerize and signal in the 
absence of EGF? This question has been addressed by examining the struc- 
ture of the EGF receptor in the absence of bound ligand (Figure 14.29). 
This structure is, indeed, monomeric and each monomer is in a confor- 
mation that is quite different from that observed in the ligand-bound dimer. 
In particular, the dimerization arm binds to a domain within the same 
monomer that holds the receptor in a closed configuration. In essence, the 


receptor is poised in a spring-loaded conformation held in position by the 
contact between the interaction loop and another part of the structure, ready 
to bind ligand and change into a conformation active for dimerization and 
signaling. 

This observation suggests that a receptor that exists in the extended 
conformation even in the absence of bound ligand would be constitutively 
active. Remarkably, such a receptor exists. This receptor, Her2, is approxi- 
mately 50% identical in amino acid sequence with the EGF receptor and has 
the same domain structure. Her2 does not bind any known ligand, yet 
crystallographic studies reveal that it adopts an extended structure very 
similar to that observed for the ligand-bound EGF receptor. Under normal 
conditions, Her2 forms heterodimers with the EGF receptor and other 
members of the EGF receptor family and participates in cross-phosphory- 
lation reactions with these receptors. Her2 is overexpressed in some cancers, 
presumably contributing to tumor growth by forming homodimers that 
signal even in the absence of ligand. We will return to Her2 when we con- 
sider approaches to cancer treatment based on knowledge of signaling path- 
ways (Section 14.5). 


The EGF receptor undergoes phosphorylation of its 
carboxyl-terminal tail 


Like the insulin receptor, the EGF receptor undergoes cross-phosphory- 
lation of one unit by another unit within a dimer. However, unlike that of 
the insulin receptor, the site of this phosphorylation is not within the activa- 
tion loop of the kinase, but rather in a region that lies on the C-terminal 
side of the kinase domain. As many as five tyrosines residues in this 
region are phosphorylated. The dimerization of the EGF receptor brings 
the C-terminal region on one receptor into the active site of its partner’s 
kinase. The kinase itself is in an active conformation without phosphory- 
lation, revealing again how this signaling system is poised to respond. 


EGF signaling leads to the activation of Ras, a small G protein 


The phosphotyrosines on the EGF receptors act as docking sites for SH2 
domains on other proteins. The intracellular signaling cascade begins with 
the binding of Grb-2, a key adaptor protein that contains one SH2 domain 
and two Src homology 3 (SH3) domains. On phosphorylation of the recep- 
tor, the SH2 domain of Grb-2 binds to the phos- 

photyrosine residues of the receptor tyrosine 

kinase. Through its two 5H3 domains, Grb-2 EGF 

then binds polyproline-rich polypeptides within Petep oe 

a protein called Sos. Sos, in turn, binds to Ras and 
activates it. A very prominent signal-transduction 
component, Ras isa member of a class of proteins 


called the small G proteins. Like the G pro- | DUNN 


teins described in Section 14.1, the small G pro- 
teins contain bound GDP in their unactivated 
forms. Sos opens up the nucleotide-binding 
pocket of Ras, allowing GDP to escape and GTP 
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Dimerization 


hk Figure 14.29 Structure of the 
unactivated EGF receptor. The extracellular 
domain of the EGF receptor is shown in the 
absence of bound EGF. Notice that the 
dimerization arm is bound to a part of the 
receptor that makes it unavailable for 
interaction with the other receptor. [Drawn 
from 1NQL.pdb.] 
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to enter in its place. Because of its effect on Ras, 
Sos is referred to as a guanine-nucleotide-exchange 
factor (GEF). Thus, the binding of EGF to its Figure 14.30 Ras activation mechanism. The dimerization of the EGF 


receptor leads to the conversion of Ras into its 


receptor due to EGF binding leads to the phosphorylation of the C-terminal 
tails of the receptor, the subsequent recruitment of Grb-2 and Sos, and the 


GTP form through the intermediacy of Grb-2 exchange of GTP for GDP in Ras. This signal-transduction pathway results in the 
and Sos (Figure 14.30). conversion of Ras into its activated GTP-bound form. 
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Figure 14.31 EGF signaling pathway. The 
key steps in the pathway initiated by EGF 
binding to the EGF receptor. A kinase cascade 
leads to the phosphorylation of transcription 
factors and concomitant changes in gene 
expression. 
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Activated Ras initiates a protein kinase cascade 


Ras changes conformation when it is transformed from its GDP into its 
GTP form. In the GTP form, Ras binds other proteins, including a protein 
kinase termed Raf. When bound to Ras, Raf undergoes a conformational 
change that activates the Raf protein kinase domain. Both Ras and Raf are 
anchored to the membrane through covalently bound isoprene lipids. 
Activated Raf then phosphorylates other proteins, including protein kinases 
termed MEKs. In turn, MEKs activate kinases called extracellular signal- 
regulated kinases (ERKs). ERKs then phosphorylate numerous substrates, 
including transcription factors in the nucleus as well as other protein 
kinases. The complete flow of information from the arrival of EGF at the 
cell surface to changes in gene expression is summarized in Figure 14.31. 


na Small G proteins, or small GTPases, constitute a large superfamily of 

proteins—grouped into subfamilies called Ras, Rho, Arf, Rab, and 
Ran—that play a major role in a host of cell functions including growth, 
differentiation, cell motility, cytokinesis, and the transport of materials 
throughout the cell (Table 14.2). As with the heterotrimeric G proteins, the 
small G proteins cycle between an active GTP-bound form and an inactive 
GDP-bound form. They differ from the heterotrimeric G proteins in being 
smaller (20—25 kd versus 30-35 kd) and monomeric. Nonetheless, the two 
families are related by divergent evolution, and small G proteins have many 
key mechanistic and structural motifs in common with the Ga, subunit of 
the heterotrimeric G proteins. 


EGF signaling is terminated by protein phosphatases and the intrinsic 
GTPase activity of Ras 


Because so many components of the EGF signal-transduction pathway are 
activated by phosphorylation, we can expect protein phosphatases to play 
key roles in the termination of EGF signaling. Indeed, crucial phosphatases 
remove phosphory] groups from tyrosine residues on the EGF receptor and 
from serine, threonine, and tyrosine residues in the protein kinases that 
participate in the signaling cascade. The signaling process itself sets in 
motion the events that activate many of these phosphatases. Consequently, 
signal activation also initiates signal termination. 

Like the G proteins activated by 7TM receptors, Ras possesses intrin- 
sic GTPase activity. Thus, the activated GTP form of Ras spontaneously 
converts into the inactive GDP form. The rate of conversion can be acceler- 
ated in the presence of GTPase-activating proteins (GAPs), proteins that 
interact with small G proteins in the GTP form and facilitate GTP hydro- 
lysis. Thus, the lifetime of activated Ras is regulated by accessory proteins 
in the cell. The GTPase activity of Ras is crucial for shutting off signals 
leading to cell growth, and so it is not surprising that mutations in Ras are 
found in many types of cancer, as will be discussed in Section 14.5. 


Table 14.2 Ras superfamily of GTPases 


Subfamily Function 

Ras Regulates cell growth through serine-threonine protein kinases 

Rho Reorganizes cytoskeleton through serine—threonine protein kinases 

Arf Activates the ADP-ribosyltransferase of the cholera toxin A subunit; regulates 
vesicular trafficking pathways; activates phospholipase D 

Rab Plays a key role in secretory and endocytotic pathways 

Ran Functions in the transport of RNA and protein into and out of the nucleus 


14.4 Many Elements Recur with Variation in Different 
Signal-Transduction Pathways 


We can begin to make sense of the complexity of signal-transduction path- 
ways by taking note of several common themes that have appeared consis- 
tently in the pathways described in this chapter and underlie many addi- 
tional signaling pathways not considered herein. 


1. Protein kinases are central to many signal-transduction pathways. Protein 
kinases are central to all three signal-transduction pathways described in 
this chapter. In the epinephrine-initiated pathway, cAMP-dependent pro- 
tein kinase (PKA) lies at the end of the pathway, transducing information 
represented by an increase in cAMP concentration into covalent modifica- 
tions that alter the activity of key metabolic enzymes. In the insulin- and 
EGF-initiated pathways, the receptors themselves are protein kinases and 
several additional protein kinases participate downstream in the pathways. 
Signal amplification due to protein kinase cascades is a feature common to 
all three pathways. Although not presented in this chapter, protein kinases 
often phosphorylate multiple substrates and are thus able to generate a 
diversity of responses. 


2. Second messengers participate in many signal-transduction pathways. We have 
encountered several second messengers, including cAMP, Ca?* IP3, and 
the lipid DAG. Because second messengers are activated by enzymes or by 
the opening of ion channels, their concentrations can be tremendously 
amplified compared with the signals that lead to their generation. Specialized 
proteins sense the concentrations of these second messengers and continue 
the flow of information along signal-transduction pathways. 

The second messengers that we have seen recur in many additional 
signal-transduction pathways. For example, in a consideration of the 
sensory systems in Chapter 33, we will see how Ca’*-based signaling and 
cyclic nucleotide-based signaling play key roles in vision and olfaction. 


3. Specialized domains that mediate specific interactions are present in many 
signaling proteins. The “wiring” of many signal-transduction pathways is 
based on particular protein domains that mediate the interactions between 
protein components of a particular signaling cascade. We have encountered 
several of them, including: pleckstrin homology domains, which facilitate 
protein interactions with the lipid PIP3; SH2 domains, which mediate inter- 
actions with polypeptides containing phosphorylated tyrosine residues; and 
SH3 domains, which interact with peptide sequences that contain multiple 
proline residues. Many other such domain families exist. In many cases, 
individual members of each domain family have unique features that allow 
them to bind to their targets only within a particular sequence context, mak- 
ing them specific for a given signaling pathway and avoiding unwanted 
cross-talk. Signal-transduction pathways have evolved in large part by the 
incorporation of DNA fragments encoding these domains into genes encoding 
pathway components. 

The presence of these domains is tremendously helpful to scientists try- 
ing to unravel signal-transduction pathways. When a protein in a signal- 
transduction pathway is identified, its amino acid sequence can be analyzed 
for the presence of these specialized domains by the methods described in 
Chapter 6. If one or more domains of known function is found, it is often 
possible to develop clear hypotheses about potential binding partners and 
signal-transduction mechanisms. 
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14.5 Defects in Signal-Transduction Pathways Can Lead to 
Cancer and Other Diseases 


e In light of their complexity, it comes as no surprise that signal- 
transduction pathways occasionally fail, leading to pathological or 
disease states. Cancer, a set of diseases characterized by uncontrolled or 
inappropriate cell growth, is strongly associated with defects in signal- 
transduction proteins. Indeed, the study of cancer, particularly cancers 
caused by certain viruses, has contributed greatly to our understanding of 
signal-transduction proteins and pathways. 

For example, Rous sarcoma virus is a retrovirus that causes sarcoma 
(a cancer of tissues of mesodermal origin such as muscle or connective 
tissue) in chickens. In addition to the genes necessary for viral replication, 
this virus carries a gene termed v-src. The v-src gene is an oncogene; it 
leads to the generation of cancerlike characteristics in susceptible cell 
types. The protein encoded by the v-src gene, v-Src, is a protein tyrosine 
kinase that includes SH2 and SH3 domains. The v-Src protein is similar in 
amino acid sequence to a protein normally found in chicken-muscle cells 
referred to as c-Src (for cellular Src; Figure 14.32A). The c-src gene does 
not induce cell transformation and is termed a proto-oncogene, referring to 
the fact that this gene, when mutated, can be converted into an oncogene. 
The protein that it encodes is a signal-transduction protein that regulates 
cell growth. 

Why is the biological activity of the v-Src protein so different from that 
of c-Src? c-Sre contains a key tyrosine residue near its C-terminal end that, 
when phosphorylated, is bound intramolecularly by the upstream SH2 
domain (Figure 14.32B). This interaction maintains the kinase 
domain in an inactive conformation. However, in v-Src, the 
C-terminal 19 amino acids of c-Src are replaced by a completely dif- 
ferent stretch of 11 amino acids that lacks this critical tyrosine resi- 
due. Thus, v-Src is always active and can promote unregulated cell 
growth. Since the discovery of Src, many other mutated protein 
kinases have been identified as oncogenes. 

The gene encoding Ras, a component of the EGF -initiated path- 
way, is one of the genes most commonly mutated in human tumors. 
Mammalian cells contain three 21-kd Ras proteins (H-, K-, and 
N-Ras), each of which cycles between inactive GDP and active 
GTP forms. The most common mutations in tumors lead to a loss 


Protein of the ability to hydrolyze GTP. Thus, the Ras protein is trapped in 


the “on” position and continues to stimulate cell growth, even in the 
absence of a continuing signal. 

Other genes can contribute to cancer development only when 
both copies of the gene normally present in a cell are deleted or oth- 
erwise damaged. Such genes are called tumor-suppressor genes. For 
example, genes for some of the phosphatases that participate in the 


Phosphotyrosine 


kh Figure 14.32 Src structure. (A) Cellular Src 
includes an SH3 domain, an SH2 domain, a protein 
kinase domain, and a carboxyl-terminal tail that includes 
a key tyrosine residue. (B) Structure of c-Src in an 
inactivated form with the key tyrosine residue 
phosphorylated. Notice how the three domains work 
together to keep the enzyme in an inactive 
conformation: phosphotyrosine residue is bound in the 
SH2 domain and the linker between the SH2 domain 
and the protein kinase domain is bound by the SH3 
domain. [Drawn from 2PTK.pdb.] 


termination of EGF signaling are tumor suppressors. Without any 
functional phosphatase present, EGF signaling persists once initi- 
ated, stimulating inappropriate cell growth. 


Monoclonal antibodies can be used to inhibit signal- 
transduction pathways activated in tumors 


e Mutated or overexpressed receptor tyrosine kinases are fre- 
quently observed in tumors. For instance, the epidermal- 
growth-factor receptor (EGFR) is overexpressed in some human 
epithelial cancers, including breast, ovarian, and colorectal cancer. 


Because some small amount of the receptor can dimerize and activate the 
signaling pathway even without binding to EGF, overexpression of the 
receptor increases the likelihood that a “grow and divide” signal will be inap- 
propriately sent to the cell. This understanding of cancer-related signal- 
transduction pathways has led to a therapeutic approach that targets the 
EGER. The strategy is to produce monoclonal antibodies to the extracellular 
domains of the offending receptors. One such antibody, cetuximab (Erbitux), 
has effectively targeted the EGFR in colorectal cancers. Cetuximab inhibits 
the EGFR by competing with EGF for the binding site on the receptor. 
Because the antibody sterically blocks the change in conformation that 
exposes the dimerization arm, the antibody itself cannot induce dimeriza- 
tion. The result is that the EGFR-controlled pathway is not initiated. 

Cetuximab is not the only monoclonal antibody that has been developed 
to target a receptor tyrosine kinase. Trastuzumab (Herceptin) inhibits 
another EGFR family member, Her2, that is overexpressed in approxi- 
mately 30% of breast cancers. Recall that this protein can signal even in the 
absence of ligand; so it is especially likely that overexpression will stimulate 
cell proliferation. Breast-cancer patients are now being screened for Her2 
overexpression and treated with Herceptin as appropriate. Thus, this cancer 
treatment is tailored to the genetic characteristics of the tumor. 


Protein kinase inhibitors can be effective anticancer drugs 


e The widespread occurrence of overactive protein kinases in cancer 
cells suggests that molecules that inhibit these enzymes might act as 
antitumor agents. For example, more than 90% of patients with chronic 
myelogenous leukemia (CML) show a specific chromosomal defect in 
cancer cells (Figure 14.33). The translocation of genetic material between 
chromosomes 9 and 22 causes the c-abl gene, which encodes a tyrosine 
kinase of the Src family, to be inserted into the ber gene on chromosome 22. 
The result is the production of a fusion protein called Bcr-Abl that consists 
primarily of sequences for the c-Abl kinase. However, the bcr-abl gene is 
not regulated appropriately; it is expressed at higher levels than that of the 
gene encoding the normal c-Abl kinase, stimulating a growth-promoting 
pathway. Because of this overexpression, leukemia cells express a unique 
target for chemotherapy. A specific inhibitor of the Bcr-Abl kinase, Gleevec 
(STI-571, imatinib mesylate), has proved to be a highly effective treatment 
for patients suffering from CML. This approach to cancer chemotherapy is 
fundamentally distinct from most approaches, which target all rapidly 
growing cells, including normal ones. Thus, our understanding of signal- 
transduction pathways is leading to conceptually new disease treatments. 


Cholera and whooping cough are due to altered G-protein activity 


Although defects in signal-transduction pathways have been most 
extensively studied in the context of cancer, such defects are important in 
many other diseases. Cholera and whooping cough are two pathologies of 
the G-protein-dependent signal pathways. Let us first consider the mecha- 
nism of action of the cholera toxin, secreted by the intestinal bacterium 
Vibrio cholerae. Cholera is a potentially life threatening, acute diarrheal 
disease transmitted through contaminated water and food. It causes the 
voluminous secretion of electrolytes and fluids from the intestines of 
infected persons. The cholera toxin, choleragen, is a protein composed of 
two functional units—a B subunit that binds to Gy; gangliosides (p. 765) 
of the intestinal epithelium and a catalytic A subunit that enters the cell. 
The A subunit catalyzes the covalent modification of a Gas protein: the a 
subunit is modified by the attachment of an ADP-ribose to an arginine 
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Figure 14.33 Formation of the bcr-ab/ 
gene by translocation. In chronic 
myelogenous leukemia, parts of 
chromosomes 9 and 22 are reciprocally 
exchanged, causing the bcr and ab/ genes to 
fuse. The protein kinase encoded by the 
bcr-abl gene is expressed at higher levels in 
cells having this translocation than is the c-ab/ 
gene in normal cells. 
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residue. This modification stabilizes the GTP-bound form of Ggs, trapping 
the molecule in its active conformation. The active G protein, in turn, con- 
tinuously activates protein kinase A. PKA opens a chloride channel and 
inhibits sodium absorption by the Na*—H* exchanger by phosphorylating 
both the channel and the exchanger. The net result of the phosphorylation 
is an excessive loss of NaCl and the loss of large amounts of water into the 
intestine. Patients suffering from cholera may pass as much as twice their 
body weight in fluid in 4 to 6 days. Treatment consists of rehydration with 
a glucose-electrolyte solution. 

Whereas cholera is a result of a G protein trapped in the active 
conformation, causing the signal-transduction pathway to be perpetually 
stimulated, pertussis, or whooping cough, is a result of the opposite 
situation. Pertussis toxin also adds an ADP-ribose moiety—ain this case, to 
a Gg; protein, a Gy protein that inhibits adenylate cyclase, closes Ca7* 
channels, and opens K* channels. The effect of this modification, however, 
is to lower the G protein’s affinity for GTP, effectively trapping it in the 
“off” conformation. The pulmonary symptoms have not yet been traced to 
a particular target of the Gy; protein. Pertussis toxin is secreted by Bordetella 
pertussis, the bacterium responsible for whooping cough. 


Summary 


In human beings and other multicellular organisms, specific signal 
molecules are released from cells in one organ and are sensed by cells 
in other organs throughout the body. The message initiated by an 
extracellular ligand is converted into specific changes in metabolism or 
gene expression by means of often complex networks referred to as 
signal-transduction pathways. These pathways amplify the initial 
signal and lead to changes in the properties of specific effector 
molecules. 


14.1 Heterotrimeric G Proteins Transmit Signals and Reset Themselves 
Epinephrine binds to a cell-surface protein called the B-adrenergic 
receptor. This receptor is amember of the seven-transmembrane-helix 
receptor family, so named because each receptor has seven a helices 
that span the cell membrane. When epinephrine binds to the 
B-adrenergic receptor on the outside of the cell, the receptor under- 
goes a conformational change that is sensed inside the cell by a signal- 
ing protein termed a heterotrimeric G protein. The a subunit of the 
G protein exchanges a bound GDP molecule for GTP and concomi- 
tantly releases the heterodimer consisting of the B and y subunits. The 
a subunit in the GTP form then binds to adenylate cyclase and acti- 
vates it, leading to an increase in the concentration of the second mes- 
senger cyclic AMP. This increase in cyclic AMP concentration, in turn, 
activates protein kinase A. Other 7TM receptors also signal through 
heterotrimeric G proteins, although these pathways often include 
enzymes other than adenylate cyclase. One prominent pathway, the 
phosphoinositide pathway, leads to the activation of phospholipase C, 
which cleaves a membrane lipid to produce two secondary messen- 
gers, diacylglycerol and inositol 1,4,5-trisphosphate. An increased IP3 
concentration leads to the release of calcium ion, another important 
second messenger, into the cell. G-protein signaling is terminated by 
the hydrolysis of the bound GTP to GDP. 


14.2 Insulin Signaling: Phosphorylation Cascades Are Central to Many 423 
Signal-Transduction Processes Key Terms 
Protein kinases are key components in many signal-transduction 
pathways, including some for which the protein kinase is an integral 
component of the initial receptor. An example of such a receptor is the 
membrane tyrosine kinase bound by insulin. Insulin binding causes 
one subunit within the dimeric receptor to phosphorylate specific 
tyrosine residues in the other subunit. The resulting conformational 
changes dramatically increase the kinase activity of the receptor. The 
activated receptor kinase initiates a kinase cascade that includes both 
lipid kinases and protein kinases. This cascade eventually leads to the 
mobilization of glucose transporters to the cell surface, increasing 
glucose uptake. Insulin signaling is terminated through the action of 
phosphatases. 


14.3 EGF Signaling: Signal-Transduction Systems Are Poised to Respond 
Only minor modifications are necessary to transform many signal- 
transduction proteins from their inactive into their active forms. 
Epidermal growth factor also signals through a receptor tyrosine 
kinase. EGF binding induces a conformational change that allows 
receptor dimerization and cross-phosphorylation. The phosphorylated 
receptor binds adaptor proteins that mediate the activation of Ras, a 
small G protein. Activated Ras initiates a protein kinase cascade that 
eventually leads to the phosphorylation of transcription factors and 
changes in gene expression. EGF signaling is terminated by the action 
of phosphatases and the hydrolysis of GTP by Ras. 


14.4 Many Elements Recur with Variation in Different 

Signal-Transduction Pathways 

Protein kinases are components of many signal-transduction pathways, 
both as components of receptors and in other roles. Second messen- 
gers, including cyclic nucleotides, calcium, and lipid derivatives, are 
common in many signaling pathways. The changes in the concentra- 
tions of second messengers are often much larger than the changes 
associated with the initial signal owing to amplification processes. 
Small domains that recognize phosphotyrosine residues or specific 
lipids are present in many signaling proteins and are essential to 
determining the specificity of interactions. 


14.5 Defects in Signal-Transduction Pathways Can Lead to 

Cancer and Other Diseases 

Genes encoding components of signal-transduction pathways that 
control cell growth are often mutated in cancer. Some genes can be 
mutated to forms called oncogenes that are active regardless of appro- 
priate signals. Monoclonal antibodies directed against cell-surface 
receptors that participate in signaling have been developed for use in 
cancer treatment. Our understanding of the molecular basis of cancer 
is leading to the development of anticancer drugs directed against spe- 
cific targets, such as the specific kinase inhibitor Gleevec. 


Key Terms 
primary messenger (p. 402) cross talk (p. 403) seven-transmembrane-helix (7TM) 
ligand (p. 402) B-adrenergic receptor (B-AR) receptor (p. 404) 


second messenger (p. 402) (p. 404) rhodopsin (p. 404) 
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G protein (p. 405) 


G-protein-coupled receptor (GPCR) 
(p. 406) 

adenylate cyclase (p. 406) 

protein kinase A (PKA) (p. 406) 

B-adrenergic receptor kinase (p. 408) 

phosphoinositide cascade (p. 408) 

phosphatidylinositol 4,5-bisphosphate 
(PIP2) (p. 408) 

phospholipase C (p. 408) 


calmodulin-dependent protein kinase 
(CaM kinase) (p. 411) 


insulin (p. 411) 

insulin receptor (p. 412) 

tyrosine kinase (p. 412) 

receptor tyrosine kinase (p. 412) 
insulin-receptor substate (IRS) (p. 413) 
adaptor protein (p. 414) 

Src homology 2 (SH2) domain (p. 414) 
epidermal growth factor (EGF) (p. 415) 


Src homology 3 (SH3) domain (p. 417) 
Ras (p. 417) 

small G protein (p. 417) 
guanine-nucleotide-exchange factor 


(GEF) (p. 417) 


extracellular signal-regulated kinase 
(ERK) (p. 418) 


GTPase-activating protein (GAP) 
(p. 418) 


oncogene (p. 420) 


protein kinase C (PKC) (p. 409) 
calmodulin (CaM) (p. 410) 
EF hand (p. 410) 


EGF receptor (EGFR) (p. 415) 
dimerization arm (p. 416) 


proto-oncogene (p. 420) 
tumor-suppressor gene (p. 420) 


Problems 


1. Active mutants. Some protein kinases are inactive unless 
they are phosphorylated on key serine or threonine resi- 
dues. In some cases, active enzymes can be generated by 
mutating these serine or threonine residues to glutamate. 
Explain. 


2. In the pocket. SH2 domains bind phosphotyrosine resi- 
dues in deep pockets on their surfaces. Would you expect 
SH2 domains to bind phosphoserine or phosphothreonine 
with high affinity? Why or why not? 


3. On-off. Why is the GTPase activity of G proteins crucial 
to the proper functioning of a cell? Why have G proteins not 
evolved to catalyze GTP hydrolysis more efficiently? 


4, Viva la différence. Why is the fact that a monomeric hor- 
mone binds to two identical receptor molecules, thus pro- 
moting the formation of a dimer of the receptor, considered 
remarkable? 


5. Antibodies mimicking hormones. Antibodies have two 
identical antigen-binding sites. Remarkably, antibodies to 
the extracellular parts of growth-factor receptors often lead 
to the same cellular effects as does exposure to growth 
factors. Explain this observation. 


6. Facile exchange. A mutated form of the a subunit of the 
heterotrimeric G protein has been identified; this form 
readily exchanges nucleotides even in the absence of an 
activated receptor. What would be the effect on a signaling 
pathway containing the mutated a subunit? 


7. Making connections. Suppose that you were investigat- 
ing a newly discovered growth-factor signal-transduction 
pathway. You found that, if you added GTPYS, a nonhy- 
drolyzable analog of GTP, the duration of the hormonal 
response increased. What can you conclude? 


8. Diffusion rates. Normally, rates of diffusion vary 
inversely with molecular weights; so smaller molecules 


diffuse faster than do larger ones. In cells, however, calcium 
ion diffuses more slowly than does cAMP. Propose a pos- 
sible explanation. 


9. Negativity abounds. Fura-2 is not effective for the study 
of calcium levels in intact, living cells. On the basis of how 
Fura-2 is depicted on p. 410, why is it ineffective? 


10. Awash with glucose. Glucose is mobilized for ATP gen- 
eration in muscle in response to epinephrine, which activates 
Gas. Cyclic AMP phosphodiesterase is an enzyme that con- 
verts cAMP into AMP. How would inhibitors of cAMP 
phosphodiesterase affect glucose mobilization in muscle? 


11. Getting it started. The insulin receptor, on dimeriza- 
tion, cross-phosphorylates the activation loop of the other 
receptor molecule, leading to activation of the kinase. 
Propose how this phosphorylation event can take place if 
the kinase starts in an inactive conformation. 


12. Many defects. Considerable effort has been directed 
toward determining the genes in which sequence variation 
contributes to the development of type 2 diabetes. 
Approximately 800 genes have been implicated. Propose an 
explanation for this observation. 


13. Growth-factor signaling. Human growth hormone binds 
to a cell-surface membrane protein that is not a receptor 
tyrosine kinase. The intracellular domain of the receptor 
can bind other proteins inside the cell. Furthermore, stud- 
ies indicate that the receptor is monomeric in the absence of 
hormone but dimerizes on hormone binding. Propose a 
possible mechanism for growth-hormone signaling. 


14. Receptor truncation. You prepare a cell line that overex- 
presses a mutant form of EGFR in which the entire intra- 
cellular region of the receptor has been deleted. Predict the 
effect of overexpression of this construct on EGF signaling 
in this cell line. 


15. Hybrid. Suppose that, through genetic manipulations, a 
chimeric receptor is produced that consists of the extracel- 
lular domain of the insulin receptor and the transmembrane 
and intracellular domains of the EGF receptor. Cells 
expressing this receptor are exposed to insulin, and the level 
of phosphorylation of the chimeric receptor is examined. 
What would you expect to observe and why? What would 
you expect to observe if these cells were exposed to EGF? 


16. Total amplification. Suppose that each B-adrenergic 
receptor bound to epinephrine converts 100 molecules of 
Ggs into their GTP forms and that each molecule of acti- 
vated adenylate cyclase produces 1000 molecules of cAMP 
per second. With the assumption of a full response, how 
many molecules of cAMP will be produced in 1 s after the 
formation of a single complex between epinephrine and the 
B-adrenergic receptor? 


Chapter Integration Problems 


17. Nerve-growth-factor pathway. Nerve-growth factor 
(NGF) binds to a protein tyrosine kinase receptor. The 
amount of diacylglycerol in the plasma membrane increases 
in cells expressing this receptor when treated with NGF. 
Propose a simple signaling pathway and identify the iso- 
form of any participating enzymes. Would you expect the 
concentrations of any other common second messengers to 
increase on NGF treatment? 


18. Redundancy. Because of the high degree of genetic vari- 
ability in tumors, typically no single anticancer therapy is 
universally effective for all patients, even within a given 
tumor type. Hence, it is often desirable to inhibit a particu- 
lar pathway at more than one point in the signaling cascade. 
In addition to the EGFR-directed monoclonal antibody 
cetuximab, propose alternative strategies for targeting the 
EGF signaling pathway for antitumor drug development. 


Mechanism Problems 


19. Distant relatives. The structure of adenylate cyclase is 
similar to the structures of some types of DNA polymerases, 
suggesting that these enzymes derived from a common 
ancestor. Compare the reactions catalyzed by these two 
enzymes. In what ways are they similar? 


20. Kinase inhibitors as drugs. Functional and structural 
analysis indicates that Gleevec is an ATP-competitive 
inhibitor of the Bcr-Abl kinase. In fact, many kinase inhib- 
itors under investigation or currently marketed as drugs are 
ATP competitive. Can you suggest a potential drawback of 
drugs that utilize this particular mechanism of action? 


Data Interpretation Problems 


21. Establishing specificity. You wish to determine the 
hormone-binding specificity of a newly identified mem- 
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Problems 


brane receptor. Three different hormones, X, Y, and Z, were 
mixed with the receptor in separate experiments, and the 
percentage of binding capacity of the receptor was deter- 
mined as a function of hormone concentration, as shown in 


graph A. 
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(a) What concentrations of each hormone yield 50% 
maximal binding? 


(b) Which hormone shows the highest binding affinity for 
the receptor? 


You next wish to determine whether the hormone-receptor 
complex stimulates the adenylate cyclase cascade. To do 
so, you measure adenylate cyclase activity as a function of 
hormone concentration, as shown in graph B. 
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(c) What is the relation between the binding affinity of the 
hormone-receptor complex and the ability of the hormone 
to enhance adenylate cyclase activity? What can you con- 
clude about the mechanism of action of the hormone— 
receptor complex? 


(d) Suggest experiments that would determine whether 
a Gags protein is a component of the signal-transduction 
pathway. 
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22. Binding issues. A scientist wishes to determine the num- 
ber of receptors specific for a ligand X, which he has in both 
radioactive and nonradioactive form. In one experiment, he 
adds increasing amounts of radioactive X and measures 
how much of it is bound to the cells. The result is shown as 
total activity in the following graph. Next, he performs the 
same experiment, except that he includes a several hun- 
dredfold excess of nonradioactive X. This result is shown as 
nonspecific binding. The difference between the two curves 
is the specific binding. 


Total binding 


Specific binding 


[Ligand bound] — 


Nonspecific binding 


[Ligand] —~ 


(a) Why is the total binding not an accurate representation 
of the number of receptors on the cell surface? 


(b) What is the purpose of performing the experiment in 
the presence of excess nonradioactive ligand? 


(c) What is the significance of the fact that specific binding 
attains a plateau? 


23. Counting receptors. With the use of experiments such as 
those described in Problems 21 and 22, the number of 
receptors in the cell membrane can be calculated. Suppose 
that the specific activity of the ligand is io cpm per 
millimole and that the maximal specific binding is 104 cpm 
per milligram of membrane protein. There are 10'° cells per 
milligram of membrane protein. Assume that one ligand 
binds per receptor. Calculate the number of receptor mole- 
cules present per cell. 


Metabolism: Basic Concepts 
and Design 


Hummingbirds are capable of prodigious feats of endurance. For instance, the A \ 
tiny ruby-throated hummingbird can store enough fuel to fly across the Gulf of : OD 
Mexico, a distance of some 500 miles, without resting. This achievement is possible 

because of the ability to convert fuels into the cellular energy currency, ATP, represented 

by the model at the right. [(Left) William Leaman/Alamy.] 


he concepts of conformation and dynamics developed in Part ]— 

especially those dealing with the specificity and catalytic power of enzymes, 
the regulation of their catalytic activity, and the transport of molecules and 
ions across membranes— enable us to now ask questions fundamental to 
biochemistry: 


1. How does a cell extract energy and reducing power from its environment? 


2. How does a cell synthesize the building blocks of its macromolecules and 
then the macromolecules themselves? 


These processes are carried out by a highly integrated network of chemical 
reactions that are collectively known as metabolism or intermediary 
metabolism. 

More than a thousand chemical reactions take place in even as simple an 
organism as Escherichia coli. The array of reactions may seem overwhelming 
at first glance. However, closer scrutiny reveals that metabolism has a 
coherent design containing many common motifs. These motifs include the use 
of an energy currency and the repeated appearance of a limited number of 
activated intermediates. In fact, a group of about 100 molecules play central 
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Figure 15.1 Glucose metabolism. Glucose 
is metabolized to pyruvate in 10 linked 
reactions. Under anaerobic conditions, 
pyruvate is metabolized to lactate and, under 
aerobic conditions, to acetyl CoA. The 
glucose-derived carbons of acetyl CoA are 
subsequently oxidized to CO>. 


roles in all forms of life. Furthermore, although the number of reactions in 
metabolism is large, the number of kinds of reactions is small and the mech- 
anisms of these reactions are usually quite simple. Metabolic pathways are 
also regulated in common ways. The purpose of this chapter is to introduce 
some general principles and motifs of metabolism to provide a foundation 
for the more detailed studies to follow. These principles are: 


1. Fuels are degraded and large molecules are constructed step by step ina 
series of linked reactions called metabolic pathways. 


2. An energy currency common to all life forms, adenosine triphosphate 
(ATP), links energy-releasing pathways with energy-requiring pathways. 


3. The oxidation of carbon fuels powers the formation of ATP. 


4. Although there are many metabolic pathways, a limited number of 
types of reactions and particular intermediates are common to many 
pathways. 


5. Metabolic pathways are highly regulated. 


15.1 Metabolism Is Composed of Many Coupled, 
Interconnecting Reactions 


Living organisms require a continual input of free energy for three major 
purposes: (1) the performance of mechanical work in muscle contraction 
and cellular movements, (2) the active transport of molecules and ions, and 
(3) the synthesis of macromolecules and other biomolecules from simple 
precursors. The free energy used in these processes, which maintain an 
organism in a state that is far from equilibrium, is derived from the environ- 
ment. Photosynthetic organisms, or phototrophs, obtain this energy by trap- 
ping sunlight, whereas chemotrophs, which include animals, obtain energy 
through the oxidation of foodstuffs generated by phototrophs. 


Metabolism consists of energy-yielding and energy-requiring reactions 


Metabolism is essentially a linked series of chemical reactions that begins 
with a particular molecule and converts it into some other molecule or mol- 
ecules in a carefully defined fashion (Figure 15.1). There are many such 
defined pathways in the cell (Figure 15.2), and we will examine a few of 
them in some detail later. These pathways are interdependent, and their 
activity is coordinated by exquisitely sensitive means of communication in 
which allosteric enzymes are predominant (Section 10.1). We considered 
the principles of this communication in Chapter 14. 

We can divide metabolic pathways into two broad classes: (1) those that 
convert energy from fuels into biologically useful forms and (2) those that 
require inputs of energy to proceed. Although this division is often impre- 
cise, it is nonetheless a useful distinction in an examination of metabolism. 
Those reactions that transform fuels into cellular energy are called catabolic 
reactions or, more generally, catabolism. 


Catabolism 


Fuel (carbohydrates, fats) —————> CO, + H2O + useful energy 


Those reactions that require energy—such as the synthesis of glucose, fats, 
or DNA—are called anabolic reactions or anabolism. The useful forms of 
energy that are produced in catabolism are employed in anabolism to 
generate complex structures from simple ones, or energy-rich states from 
energy-poor ones. 
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Some pathways can be either anabolic or catabolic, depending on the energy 
conditions in the cell. These pathways are referred to as amphibolic 
pathways. 

An important general principle of metabolism is that biosynthetic and 
degradative pathways are almost always distinct. This separation is necessary 
for energetic reasons, as will be evident in subsequent chapters. It also 
facilitates the control of metabolism. 


A thermodynamically unfavorable reaction can be driven by a 
favorable reaction 


How are specific pathways constructed from individual reactions? A path- 
way must satisfy minimally two criteria: (1) the individual reactions must be 
specific and (2) the entire set of reactions that constitute the pathway must 
be thermodynamically favored. A reaction that is specific will yield only one 
particular product or set of products from its reactants. As discussed in 
Chapter 8, a function of enzymes is to provide this specificity. The thermo- 
dynamics of metabolism is most readily approached in relation to free 
energy, which was also discussed in Chapter 8. A reaction can occur spon- 
taneously only if AG, the change in free energy, is negative. Recall that AG 
for the formation of products C and D from substrates A and B is given by 


[C][D] 


AG = AG°®’ + RT In 
[A]}[B] 


Thus, the AG of a reaction depends on the nature of the reactants and prod- 
ucts (expressed by the AG®’ term, the standard free-energy change) and on 
their concentrations (expressed by the second term). 
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Figure 15.2 Metabolic pathways. [From 
the Kyoto Encyclopedia of Genes and 
Genomes (www.genome.ad.jp/kegg).] 
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An important thermodynamic fact is that the overall free-energy change 
for a chemically coupled series of reactions is equal to the sum of the free- 
energy changes of the individual steps. Consider the following reactions: 


A= B+C AG°' = +21kJ mol! (+5 kcal mol‘) 
B=—D AG°' = —34kJ mol! (—8 kcal mol!) 
A= C+D = AG’ = -13kJ mol! (—3 kcal mol!) 


Under standard conditions, A cannot be spontaneously converted into 
B and C, because AG®’ is positive. However, the conversion of B into D 
under standard conditions is thermodynamically feasible. Because free- 
energy changes are additive, the conversion of A into C and D has a AG®’ of 
~13 kJ mol | (—3 kcal mol‘), which means that it can occur spontane- 
ously under standard conditions. Thus, a thermodynamically unfavorable 
reaction can be driven by a thermodynamically favorable reaction to which it is 
coupled. In this example, the reactions are coupled by the shared chemical 
intermediate B. Thus, metabolic pathways are formed by the coupling of 
enzyme-catalyzed reactions such that the overall free energy of the pathway 
is negative. 


15.2 ATP Is the Universal Currency of Free Energy in 
Biological Systems 


Just as commerce is facilitated by the use of a common currency, the com- 
merce of the cell—metabolism—is facilitated by the use of a common 
energy currency, adenosine triphosphate (ATP). Part of the free energy 
derived from the oxidation of foodstuffs and from light is transformed 
into this highly accessible molecule, which acts as the free-energy donor 
in most energy-requiring processes such as motion, active transport, and 
biosynthesis. Indeed, most of catabolism consists of reactions that extract 
energy from fuels such as carbohydrates and fats and convert it into 
ATP: 


ATP hydrolysis is exergonic 


ATP is a nucleotide consisting of adenine, a ribose, and a triphosphate unit 
(Figure 15.3). The active form of ATP is usually a complex of ATP 
with Mg”* or Mn?*. In considering the role of ATP as an energy carrier, we 
can focus on its triphosphate moiety. ATP is an energy-rich molecule because 
its triphosphate unit contains two phosphoanhydride bonds. A large amount of 
free energy is liberated when ATP is hydrolyzed to adenosine diphosphate 
(ADP) and orthophosphate (P;) or when ATP is hydrolyzed to adenosine 
monophosphate (AMP) and pyrophosphate (PP;). 


ATP + H,O = —= ADP + P; 
AG?’ = —30.5kJ mol! (=73 kcal mol~') 


ATP + H,O == AMP + PP; 
AG*' = —45.6kJ mol! (—10.9 kcal mol‘) 


The precise AG®’ for these reactions depends on the ionic strength of the 
medium and on the concentrations of Me?* and other metal ions. Under 
typical cellular concentrations, the actual AG for these hydrolyses is 
approximately —50 kJ mol! (—12 kcal mol7!). 
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Figure 15.3 Structures of ATP, ADP, and AMP. These adenylates consist 
of adenine (blue), a ribose (black), and a tri-, di-, or monophosphate unit 
HO OH (red). The innermost phosphorus atom of ATP is designated P,, the middle 
Adenosine monophosphate (AMP) one Pg, and the outermost one Py. 


The free energy liberated in the hydrolysis of ATP is harnessed to drive 
reactions that require an input of free energy, such as muscle contraction. In 
turn, ATP is formed from ADP and P; when fuel molecules are oxidized in 
chemotrophs or when light is trapped by phototrophs. This ATP-ADP cycle 
is the fundamental mode of energy exchange in biological systems. 

Some biosynthetic reactions are driven by the hydrolysis of nucleoside 
triphosphates that are analogous to AT P—namely, guanosine triphosphate 
(GTP), uridine triphosphate (UTP), and cytidine triphosphate (CTP). The 
diphosphate forms of these nucleotides are denoted by GDP, UDP, and 
CDP, and the monophosphate forms are denoted by GMP, UMP, and 
CMP. Enzymes catalyze the transfer of the terminal phosphoryl group from 
one nucleotide to another. The phosphorylation of nucleoside monophos- 
phates is catalyzed by a family of nucleoside monophosphate kinases, as 
discussed in Section 9.4. The phosphorylation of nucleoside diphosphates 
is catalyzed by nucleoside diphosphate kinase, an enzyme with broad 
specificity. 


Nucleoside monophosphate 
kinase 


NMP + ATP == NDP + ADP 


Nucleoside 
monophosphate 
Nucleoside diphosphate 
kinase 
NDP + ATP == NTP + ADP 
Nucleoside 
diphosphate 


It is intriguing to note that although all of the nucleotide triphosphates are 
energetically equivalent, ATP is nonetheless the primary cellular energy 
carrier. In addition, two important electron carriers, NAD* and FAD, are 
derivatives of ATP. The role of ATP in energy metabolism is paramount. 


ATP hydrolysis drives metabolism by shifting the equilibrium 
of coupled reactions 


An otherwise unfavorable reaction can be made possible by coupling to 
ATP hydrolysis. Consider a chemical reaction that is thermodynamically 
unfavorable without an input of free energy, a situation common to 
many biosynthetic reactions. Suppose that the standard free energy of 
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the conversion of compound A into compound B is +16.7 kJ mol! 
(+4.0 kcal mol‘): 


A=—B — AG°’ = +16.7kJ mol7! (+4 kcal mol!) 


The equilibrium constant K¢q of this reaction at 25°C is related to AG’ (in 
units of kilojoules per mole) by 


Ko =([Ble/iAle=i0 OO PS is 0 


Thus, net conversion of A into B cannot take place when the molar ratio of 
B to A is equal to or greater than 1.15 X 10 °. However, A can be con- 
verted into B under these conditions if the reaction is coupled to the hydro- 
lysis of ATP. Under standard conditions, the AG®’ of hydrolysis is approxi- 
mately —30.5 kJ mol! (—7.3 kcal mol !). The new overall reaction is 


A+ ATP +H,O =——B+ATP +P; 
AG? = 138 kJ mol 7 !(—3.3 kcal mol 7 ') 


Its free-energy change of —13.8 kJ mol! (—3.3 kcal mol!) is the sum 
of the value of AG®°’ for the conversion of A into B [+16.7 kJ mol! 
(+4.0 kcal mol~!)] and the value of AG°’ for the hydrolysis of ATP 
[—30.5 kJ mol”! (—7.3 kcal mol~!)]. At pH 7, the equilibrium constant 
of this coupled reaction is 


[Bleq ADP lel File 


D4 913.8/3.69 _ 2 
= 10 = 2.67 X 10 
[Alay [ATP ley 


! = 
Keg = 


At equilibrium, the ratio of [B] to [A] is given by 
[B] [ATP ]eq 
[A] SL ADP sal Pile 


which means that the hydrolysis of ATP enables A to be converted into B 
until the [B]/[A] ratio reaches a value of 2.67 X 10°. This equilibrium 
ratio is strikingly different from the value of 1.15 X 10° ° for the reaction 
A — B in the absence of ATP hydrolysis. In other words, coupling the 
hydrolysis of ATP with the conversion of A into B under standard condi- 
tions has changed the equilibrium ratio of B to A by a factor of about 10°. 
If we were to use the AG of hydrolysis of ATP under cellular conditions 
[—50.2 kJ mol”! (—12 kcal mol” ')] in our calculations instead of AG’, 
the change in the equilibrium ratio would be even more dramatic, on the 
order of 10°. 

We see here the thermodynamic essence of ATP’s action as an energy- 
coupling agent. Cells maintain a high level of ATP by using oxidizable sub- 
strates or light as sources of free energy for synthesizing the molecule. In the 
cell, the hydrolysis of an ATP molecule in a coupled reaction then changes 
the equilibrium ratio of products to reactants by a very large factor, of the 
order of 10°. More generally, the hydrolysis of n ATP molecules changes the 
equilibrium ratio of a coupled reaction (or sequence of reactions) by a factor 
of 10°". For example, the hydrolysis of three ATP molecules in a coupled 
reaction changes the equilibrium ratio by a factor of 107*. Thus, a thermody- 
namically unfavorable reaction sequence can be converted into a favorable one 
by coupling it to the hydrolysis of a sufficient number of ATP molecules in a new 
reaction. It should also be emphasized that A and B in the preceding coupled 
reaction may be interpreted very generally, not only as different chemical 
species. For example, A and B may represent activated and unactivated 
conformations of a protein that is activated by phosphorylation with ATP. 
Through such changes in protein conformation, molecular motors such as 
myosin, kinesin, and dynein convert the chemical energy of ATP into 


eq 


eq q 


mechanical energy (Chapter 34). Indeed, this conversion is the basis of 
muscle contraction. 

Alternatively, A and B may refer to the concentrations of an ion or mol- 
ecule on the outside and inside of a cell, as in the active transport of a nutri- 
ent. The active transport of Na* and K* across membranes is driven by the 
phosphorylation of the sodium—potassium pump by ATP and its subse- 
quent dephosphorylation (Section 13.2). 


The high phosphoryl potential of ATP results from structural differences 
between ATP and its hydrolysis products 


What makes ATP a particularly efficient phosphoryl-group donor? Let us 
compare the standard free energy of hydrolysis of ATP with that of a phos- 
phate ester, such as glycerol 3-phosphate: 


ATP + H,O == ADP + P; 
AG*’ = —30.5kJ mol! (—7.3 kcal mol!) 


Glycerol 3-phosphate + H,O == glycerol + P; 
AG°' = —9.2kJ mol! (—2.2 kcal mol!) 


The magnitude of AG°’ for the hydrolysis of glycerol 3-phosphate is 
much smaller than that of ATP, which means that ATP has a stronger tend- 
ency to transfer its terminal phosphoryl group to water than does glycerol 
3-phosphate. In other words, ATP has a higher phosphoryl-transfer potential 
(phosphoryl-group-transfer potential) than does glycerol 3-phosphate. 

The high phosphoryl-transfer potential of ATP can be explained by 
features of the ATP structure. Because AG°’ depends on the difference in 
free energies of the products and reactants, we need to examine the struc- 
tures of both ATP and its hydrolysis products, ADP and P;, to answer this 
question. Three factors are important: resonance stabilization, electrostatic 
repulsion, and stabilization due to hydration. 


1. Resonance Stabilization. ADP and, particularly, P;, have greater resonance 
stabilization than does ATP. Orthophosphate has a number of resonance 
forms of similar energy (Figure 15.4), whereas the y phosphoryl group of 
ATP has a smaller number. Forms like that shown in Figure 15.5 are unfa- 
vorable because a positively charged oxygen atom is adjacent to a positively 
charged phosphorus atom, an electrostatically unfavorable juxtaposition. 
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Figure 15.4 Resonance structures of orthophosphate. 


2. Electrostatic Repulsion. At pH 7, the triphosphate unit of ATP carries 
about four negative charges. These charges repel one another because they 
are in close proximity. The repulsion between them is reduced when ATP is 
hydrolyzed. 


3. Stabilization Due to Hydration. More water can bind more effectively to 
ADP and P; than can bind to the phosphoanhydride part of ATP, stabiliz- 
ing the ADP and P; by hydration. 


ATP is often called a high-energy phosphate compound, and its 
phosphoanhydride bonds are referred to as high-energy bonds. Indeed, a 
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Figure 15.5 Improbable resonance 
structure. The structure contributes little to 
the terminal part of ATP, because two positive 
charges are placed adjacent to each other. 
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Figure 15.6 Compounds with high 
phosphoryl-transfer potential. These 
compounds have a higher phosphoryl-transfer 
potential than that of ATP and can be used to 
phosphorylate ADP to form ATP. 
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“squiggle” (~P) is often used to indicate such a bond. Nonetheless, there 
is nothing special about the bonds themselves. They are high-energy bonds 
in the sense that much free energy is released when they are hydrolyzed, for 
the reasons listed in factors 1 through 3. 


Phosphoryl-transfer potential is an important form of cellular 
energy transformation 


The standard free energies of hydrolysis provide a convenient means of 
comparing the phosphoryl-transfer potential of phosphorylated com- 
pounds. Such comparisons reveal that ATP is not the only compound with a 
high phosphoryl-transfer potential. In fact, some compounds in biologi- 
cal systems have a higher phosphoryl-transfer potential than that of ATP. 
These compounds include phosphoenolpyruvate (PEP), 1,3-bisphospho- 
glycerate (1,3-BPG), and creatine phosphate (Figure 15.6). Thus, PEP can 
transfer its phosphoryl group to ADP to form ATP. Indeed, this transfer is 
one of the ways in which AT? is generated in the breakdown of sugars 
(Chapter 16). It is significant that ATP has a phosphoryl-transfer potential 
that is intermediate among the biologically important phosphorylated 
molecules (Table 15.1). This intermediate position enables ATP to function 
efficiently as a carrier of phosphoryl groups. 

The amount of ATP in muscle suffices to sustain contractile activity for 
less than a second. Creatine phosphate in vertebrate muscle serves as a res- 
ervoir of high-potential phosphoryl groups that can be readily transferred 
to ADP. Indeed, we use creatine phosphate to regenerate ATP from ADP 
every time that we exercise strenuously. This reaction is catalyzed by cre- 
atine kinase. 


Creatine 
kinase 


Creatine phosphate + ADP == ATP + creatine 


At pH 7, the standard free energy of hydrolysis of creatine phosphate is 
—43.1 kJ mol! (—10.3 kcal mol’), compared with —30.5 kJ mol! (—7.3 
kcal mol!) for ATP. Hence, the standard free-energy change in forming 
ATP from creatine phosphate is —12.6 kJ mol! (—3.0 kcal mol~'), which 
corresponds to an equilibrium constant of 162. 


oo LA Eine) = 19-AG"/5.69 — 19 12-6/5.69 — 169 
[ADP ][creatine phosphate | 


In resting muscle, typical concentrations of these metabolites are [ATP] = 
4mM, [ADP] = 0.013 mM [creatine phosphate] = 25 mM, and [creatine] = 
13 mM. Because of its abundance and high phosphoryl-transfer potential 
relative to that of ATP, creatine phosphate is a highly effective phosphoryl 


Table 15.1 Standard free energies of hydrolysis of some phosphorylated 


compounds 
Compound kJ mol! kcal mol7! 
Phosphoenolpyruvate —61.9 —14.8 
1,3-Bisphosphoglycerate —49.4 =i 
Creatine phosphate —43.1 —10.3 
ATP (to ADP) = 30,5 = 7.3 
Glucose 1-phosphate —20.9 — 5.0 
Pyrophosphate =119),3 — 4.6 
Glucose 6-phosphate —13.8 — 3.3 
Glycerol 3-phosphate = OY = 22 


ATP Aerobic metabolism 
x (Chapters 17 and 18) 


Creatine phosphate 


Anaerobic 
metabolism 
(Chapter 16) 


Energy —> 


Seconds —> Minutes ——~ Hours —~> 


buffer. Indeed, creatine phosphate is the major source of phosphoryl groups 
for ATP regeneration for a runner during the first 4 seconds of a 100-meter 
sprint. The fact that creatine phosphate can replenish ATP pools is the basis 
of the use of creatine as a dietary supplement by athletes in sports requiring 
short bursts of intense activity. After the creatine phosphate pool is depleted, 
ATP must be generated through metabolism (Figure 15.7). 


15.3 The Oxidation of Carbon Fuels Is an Important Source 
of Cellular Energy 


ATP serves as the principal immediate donor of free energy in biological sys- 
tems rather than as a long-term storage form of free energy. In a typical cell, 
an ATP molecule is consumed within a minute of its formation. Although 
the total quantity of ATP in the body is limited to approximately 100 g, the 
turnover of this small quantity of ATP is very high. For example, a resting 
human being consumes about 40 kg of ATP in 24 hours. During strenuous 
exertion, the rate of utilization of ATP may be as high as 0.5 kg/minute. For 
a 2-hour run, 60 kg (132 pounds) of ATP is utilized. Clearly, having mech- 
anisms for regenerating ATP is vital. Motion, active transport, signal 
amplification, and biosynthesis can take place only if ATP is continually 
regenerated from ADP (Figure 15.8). The generation of ATP is one of the 
primary roles of catabolism. The carbon in fuel molecules—such as glucose 
and fats—is oxidized to CO>. The resulting electrons are captured and used 
to regenerate ATP from ADP and P;. 

In aerobic organisms, the ultimate electron acceptor in the oxidation of 
carbon is O 2 and the oxidation product is CO . Consequently, the more 
reduced a carbon is to begin with, the more free energy is released by its oxida- 
tion. Figure 15.9 shows the AG®’ of oxidation for one-carbon compounds. 
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Figure 15.7 Sources of ATP during 
exercise. In the initial seconds, exercise is 
powered by existing high-phosphoryl-transfer 
compounds (ATP and creatine phosphate). 
Subsequently, the ATP must be regenerated 
by metabolic pathways. 


ATP ADP 


Figure 15.8 ATP-ADP cycle. This cycle is 
the fundamental mode of energy exchange in 
biological systems. 
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Figure 15.9 Free energy of 
oxidation of single-carbon 
compounds. 
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Although fuel molecules are more complex (Figure 15.10) than the 
single-carbon compounds depicted in Figure 15.9, when a fuel is oxidized 
the oxidation takes place one carbon at a time. The carbon-oxidation 
energy is used in some cases to create a compound with high phosphory]l- 
transfer potential and in other cases to create an ion gradient. In either case, 
the end point is the formation of ATP. 


Compounds with high phosphoryl-transfer potential can couple carbon 
oxidation to ATP synthesis 


Oss CH How is the energy released in the oxidation of a carbon compound con- 
| verted into ATP? As an example, consider glyceraldehyde 3-phosphate 
H—C—OH (shown in the margin), which is a metabolite of glucose formed in the oxida- 
| tion of that sugar. The C-1 carbon (shown in red) is at the aldehyde-oxidation 


CH20PO37- , ow as ; ; 
= nant level and is not in its most oxidized state. Oxidation of the aldehyde to an 
lyceraldehyde 3-phosphate 


(GAP) acid will release energy. 
O ee O ~~ _/OH 
Cc C 
| Oxidation | 
ae ia > H i OH 
CH,0PO37- CH,0PO32- 
Glyceraldehyde 3-phosphate 3-Phosphoglyceric acid 


However, the oxidation does not take place directly. Instead, the carbon 
oxidation generates an acyl phosphate, 1,3-bisphosphoglycerate. The elec- 
trons released are captured by NAD“, which we will consider shortly. 


2- 
Ow Hl OW _OPOs 
I I 
a aie + NAD* + HPO,?- > H r OH + NADH + H* 
CHOPO37- CH2OPO37~ 
Glyceraldehyde 3-phosphate 1,3-Bisphosphoglycerate 
(GAP) (1,3-BPG) 


For reasons similar to those discussed for ATP, 1,3-bisphosphoglycerate 
has a high phosphoryl-transfer potential. Thus, the cleavage of 1,3-BPG 
can be coupled to the synthesis of ATP. 


2= 
Ow OPO oS _OH 
| [ 
H—C——OH + ADP ——~ oat + ATP 
CH2OPO37- CH20PO37- 
1,3-Bisphosphoglycerate 3-Phosphoglyceric acid 


The energy of oxidation is initially trapped as a high-phosphoryl-transfer- 
potential compound and then used to form ATP. The oxidation energy of a 


carbon atom is transformed into phosphoryl-transfer potential, first as 437 
1,3-bisphosphoglycerate and ultimately as ATP. We will consider these 15.3 The Oxidation of Carbon Fuels 


reactions in mechanistic detail in Chapter 16. 


lon gradients across membranes provide an important 
form of cellular energy that can be coupled to ATP 
synthesis 


As described in Chapter 13, electrochemical potential is 
an effective means of storing free energy. Indeed, the elec- 
trochemical potential of ion gradients across membranes, 
produced by the oxidation of fuel molecules or by photo- 
synthesis, ultimately powers the synthesis of most of the 
ATP in cells. In general, ion gradients are versatile means 
of coupling thermodynamically unfavorable reactions to 
favorable ones. Indeed, in animals, proton gradients gener- 
ated by the oxidation of carbon fuels account for more 
than 90% of ATP generation (Figure 15.11). This process 
is called oxidative phosphorylation (Chapter 18). ATP 
hydrolysis can then be used to form ion gradients of differ- 
ent types and functions. The electrochemical potential ; of 
a Na‘ gradient, for example, can be tapped to pump Ca?* 
out of cells or to transport nutrients such as sugars and 
amino acids into cells. 


Energy from foodstuffs is extracted in three stages 


Let us take an overall view of the processes of energy con- 
version in higher organisms before considering them in 
detail in subsequent chapters. Hans Krebs described three 
stages in the generation of energy from the oxidation of 
foodstuffs (Figure 15.12). 

In the first stage, large molecules in food are broken down 
into smaller units. This process is digestion. Proteins are 
hydrolyzed to their 20 different amino acids, polysaccha- 
rides are hydrolyzed to simple sugars such as glucose, and 
fats are hydrolyzed to glycerol and fatty acids. The degra- 
dation products are then absorbed by the cells of the intes- 
tine and distributed throughout the body. This stage is 
strictly a preparation stage; no useful energy is captured in 
this phase. 

In the second stage, these numerous small molecules are 
degraded to a few simple units that play a central role in 
metabolism. In fact, most of them—sugars, fatty acids, 
glycerol, and several amino acids—are converted into the 
acetyl unit of acetyl CoA. Some ATP is generated in this 
stage, but the amount is small compared with that obtained 
in the third stage. 

In the third stage, ATP is produced from the complete 
oxidation of the acetyl unit of acetyl CoA. The third stage 
consists of the citric acid cycle and oxidative phosphoryla- 
tion, which are the final common pathways in the oxidation of 
fuel molecules. Acetyl CoA brings acetyl units into the citric 
acid cycle [also called the tricarboxylic acid (TCA) cycle or 
Krebs cycle], where they are completely oxidized to CO». 
Four pairs of electrons are transferred (three to NAD* and 
one to FAD) for each acetyl group that is oxidized. Then, a 
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Figure 15.11 Proton gradients. The oxidation of fuels can power 
the formation of proton gradients by the action of specific proton 
pumps. These proton gradients can in turn drive the synthesis of ATP 
when the protons flow through an ATP-synthesizing enzyme. 
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Figure 15.12 Stages of catabolism. The extraction of energy 
rom fuels can be divided into three stages. 
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Figure 15.13 Structures of the oxidized 
forms of nicotinamide-derived electron 
carriers. Nicotinamide adenine dinucleotide 
(NAD*) and nicotinamide adenine 
dinucleotide phosphate (NADP*) are 
prominent carriers of high-energy 

electrons. In NAD*, R=H; 

in NADP*, R= PO3. 


proton gradient is generated as electrons flow from the reduced forms of 
these carriers to Oy, and this gradient is used to synthesize ATP. 


15.4 Metabolic Pathways Contain Many Recurring Motifs 


At first glance, metabolism appears intimidating because of the sheer num- 
ber of reactants and reactions. Nevertheless, there are unifying themes that 
make the comprehension of this complexity more manageable. These unify- 
ing themes include common metabolites, reactions, and regulatory schemes 
that stem from a common evolutionary heritage. 


Activated carriers exemplify the modular design and economy of 
metabolism 


We have seen that phosphory] transfer can be used to drive otherwise ender- 
gonic reactions, alter the energy of conformation of a protein, or serve as a 
signal to alter the activity of a protein. The phosphoryl-group donor in all of 
these reactions is ATP. In other words, ATP is an activated carrier of phos- 
phoryl groups because phosphoryl transfer from ATP is an exergonic process. 
The use of activated carriers is a recurring motif in biochemistry, and we 
will consider several such carriers here. Many such activated carriers func- 
tion as coenzymes: 


1. Activated Carriers of Electrons for Fuel Oxidation. In aerobic organisms, 
the ultimate electron acceptor in the oxidation of fuel molecules is Op. 
However, electrons are not transferred directly to Op. Instead, fuel mole- 
cules transfer electrons to special carriers, which are either pyridine nucle- 
otides or flavins. The reduced forms of these carriers then transfer their 
high-potential electrons to Op. 

Nicotinamide adenine dinucleotide 1s a major electron carrier in the oxi- 
dation of fuel molecules (Figure 15.13). The reactive part of NAD™ is its 
nicotinamide ring, a pyridine derivative synthesized from the vitamin nia- 
cin. In the oxidation of a substrate, the nicotinamide ring of NAD* accepts a 
hydrogen ion and two electrons, which are equivalent to a hydride ion (H:_ ). 
The reduced form of this carrier is called NADH. In the oxidized form, the 
nitrogen atom carries a positive charge, as indicated by NAD*. NAD* is 
the electron acceptor in many reactions of the type 


if i 

Cc + NADt C + NADH + Ht 
Rm \ SR’ Rm Np 

H 


In this dehydrogenation, one hydrogen atom of the substrate is directly 
transferred to NAD“, whereas the other appears in the solvent as a proton. 
Both electrons lost by the substrate are transferred to the nicotinamide 
ring. 

The other major electron carrier in the oxidation of fuel molecules is the 
coenzyme flavin adenine dinucleotide (Figure 15.14). The abbreviations for 
the oxidized and reduced forms of this carrier are FAD and FADH), 
respectively. FAD is the electron acceptor in reactions of the type 
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The reactive part of FAD is its isoalloxazine ring, a derivative of the vitamin 
riboflavin (Figure 15.15). FAD, like NAD*, can accept two electrons. In 
doing so, FAD, unlike NAD‘, takes up two protons. These carriers of high- 
potential electrons as well as flavin mononucleotide (FMN), an electron 
similar to FAD but lacking the adenine nucleotide, will be considered fur- 
ther in Chapter 18. 


H H a 
HzC N HzC N 
7S NH 
+2H'+2e = | | 
= 
HzC N N O HzC N 
H H | 
R R 
Oxidized form Reduced form 
(FAD) (FADH,) 


2. An Activated Carrier of Electrons for Reductive Biosynthesis. High- 
potential electrons are required in most biosyntheses because the precursors 
are more oxidized than the products. Hence, reducing power is needed in 
addition to ATP. For example, in the biosynthesis of fatty acids, the keto 
group of an added two-carbon unit is reduced to a methylene group in sev- 
eral steps. This sequence of reactions requires an input of four electrons. 


C R’ C2 R’ 
Ser” & FHP a ae Se RN 
I Ha 

0 


R~ + H,0 


The electron donor in most reductive biosyntheses is NADPH, the reduced 
form of nicotinamide adenine dinucleotide phosphate (NADP*; see 
Figure 15.13). NADPH differs from NADH in that the 2'-hydroxyl group 
of its adenosine moiety is esterified with phosphate. NADPH carries elec- 
trons in the same way as NADH. However, NADPH is used almost exclu- 
sively for reductive biosyntheses, whereas NADH is used primarily for the 
generation of ATP. The extra phosphoryl group on NADPH 1s a tag that 
enables enzymes to distinguish between high-potential electrons to be used 
in anabolism and those to be used in catabolism. 
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Figure 15.14 Structure of the 
oxidized form of flavin 
adenine dinucleotide (FAD). 
This electron carrier consists of a 
flavin mononucleotide (FMN) unit 
(shown in blue) and an AMP unit 
(shown in black). 


Figure 15.15 Structures of the 
reactive parts of FAD and 
FADH). The electrons and 
protons are carried by the 
isoalloxazine ring component of 
FAD and FADH>. 
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Figure 15.16 Structure of coenzyme A 
(CoA-SH). 
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3. An Activated Carrier of Two-Carbon Fragments. Coenzyme A, another 
central molecule in metabolism, is a carrier of acyl groups derived from the 
vitamin pantothenate (Figure 15.16). Acyl groups are important constitu- 
ents both in catabolism, as in the oxidation of fatty acids, and in anabolism, 
as in the synthesis of membrane lipids. The terminal sulfhydryl group in 
CoA is the reactive site. Acyl groups are linked to CoA by thioester bonds. 
The resulting derivative is called an acyl CoA. An acyl group often linked to 
CoA is the acetyl unit; this derivative is called acetyl CoA. The AG®’ for the 
hydrolysis of acetyl CoA has a large negative value: 


Acetyl CoA + H,O == acetate + CoA + H* 
AG°' = —31.4kJ mol” 1(—7.5 kcal mol‘) 


The hydrolysis of a thioester is thermodynamically more favorable than 
that of an oxygen ester because the electrons of the C=O bond cannot form 
resonance structures with the C—S bond that are as stable as those that they 
can form with the C—O bond. Consequently, acetyl CoA has a high acetyl- 
group-transfer potential because transfer of the acetyl group is exergonic. 
Acetyl CoA carries an activated acetyl group, just as ATP carries an 
activated phosphoryl group. 

The use of activated carriers illustrates two key aspects of metabolism. 
First, NADH, NADPH, and FADH)> react slowly with O> in the absence 
of acatalyst. Likewise, ATP and acetyl CoA are hydrolyzed slowly (in times 
of many hours or even days) in the absence of a catalyst. These molecules 
are kinetically quite stable in the face of a large thermodynamic driving 
force for reaction with Op» (in regard to the electron carriers) and H2O (for 
ATP and acetyl CoA). The kinetic stability of these molecules in the absence 
of specific catalysts is essential for their biological function because it enables 
enzymes to control the flow of free energy and reducing power. 

Second, most interchanges of activated groups in metabolism are accomplished 
by a rather small set of carriers (Table 15.2). The existence of a recurring set 


Table 15.2 Some activated carriers in metabolism 


Carrier molecule in activated form Group carried Vitamin precursor 
ATP Phosphoryl 

NADH and NADPH Electrons Nicotinate (niacin) 
FADH2 Electrons Riboflavin (vitamin B») 
FMNH), Electrons Riboflavin (vitamin Bz) 
Coenzyme A Acyl Pantothenate 
Lipoamide Acyl 

Thiamine pyrophosphate Aldehyde Thiamine (vitamin B,) 
Biotin CO, Biotin 
Tetrahydrofolate One-carbon units Folate 
S-Adenosylmethionine Methyl 

Uridine diphosphate glucose Glucose 

Cytidine diphosphate diacylglycerol Phosphatidate 

Nucleoside triphosphates Nucleotides 


Note: Many of the activated carriers are coenzymes that are derived from water-soluble vitamins. 


Table 15.3 The B vitamins 


Vitamin Coenzyme 
Thiamine (B,) | Thiamine 
pyrophosphate 
Riboflavin (Bz) — Flavin adenine 
dinucleotide (FAD) 
Pyridoxine (Bg) Pyridoxal phosphate 


Nicotinamide adenine 


dinucleotide (NAD*) 


Pantothenic acid Coenzyme A 


Nicotinic acid 
(niacin) 


Biotin Biotin—lysine 
adducts (biocytin) 

Folic acid Tetrahydrofolate 

By 5’-Deoxyadenosyl 
cobalamin 


Typical reaction 
type 


Aldehyde transfer 


Oxidation—reduction 


Group transfer to or 
from amino acids 


Oxidation—reduction 


Acyl-group transfer 

ATP-dependent 
carboxylation and 
carboxyl-group 
transfer 

Transfer of one- 
carbon components; 
thymine synthesis 

Transfer of methyl 
groups; 
intramolecular 
rearrangements 


Consequences of 
deficiency 


Beriberi (weight loss, 
heart problems, 
neurological dysfunction) 

Cheliosis and angular 
stomatitis (lesions of the 
mouth), dermatitis 

Depression, confusion, 
convulsions 

Pellagra (dermatitis, 
depression, diarrhea) 

Hypertension 

Rash about the 
eyebrows, muscle 
pain, fatigue (rare) 


Anemia, neural-tube 
defects in development 


Anemia, pernicious 
anemia, methylmalonic 
acidosis 


of activated carriers in all organisms 1s one of the unifying motifs of biochem- 
istry. Furthermore, it illustrates the modular design of metabolism. A small set 
of molecules carries out a very wide range of tasks. Metabolism is readily com- 
prehended because of the economy and elegance of its underlying design. 


Many activated carriers are derived from vitamins 


Almost all the activated carriers that act as coenzymes are derived from 
vitamins. Vitamins are organic molecules that are needed in small amounts 
in the diets of some higher animals. Table 15.3 lists the vitamins that act 
as coenzymes and Figure 15.17 shows the structures of some. This series 
of vitamins is known as the vitamin B group. Note that, in all cases, the 
vitamin must be modified before it can serve its function. We have 
already touched on the roles of niacin, riboflavin, and pantothenate. We 
will see these three and the other B vitamins many times in our study of 
biochemistry. 


e Vitamins serve the same roles in nearly all forms of life, but higher 
animals lost the capacity to synthesize them in the course of evolu- 
tion. For instance, whereas E. coli can thrive on glucose and organic salts, 


Vitamin Bs we Vitamin B; 
(Pantothenate) H—+—OH (Niacin) 
H—-—OH 
H——OH 
CH20OH 
Vitamin B2 


(Riboflavin) 
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Figure 15.17 Structures of some of 
the B vitamins. 
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Figure 15.18 Structures 
of some vitamins that 
do not function as 
coenzymes. 


HO. 


H3C 


Table 15.4 Noncoenzyme vitamins 


Vitamin Function Deficiency 
A Roles in vision, growth, Night blindness, cornea 
reproduction damage, damage to respiratory 
and gastrointestinal tract 
C (ascorbic acid) Antioxidant Scurvy (swollen and bleeding 
gums, subdermal hemorrhaging) 
D Regulation of calcium Rickets (children): skeletal deformities, 
and phosphate metabolism impaired growth 
Osteomalacia (adults): soft, bending 
bones 
B Antioxidant Inhibition of sperm production; 


lesions in muscles and nerves (rare) 


K Blood coagulation Subdermal hemorrhaging 


human beings require at least 12 vitamins in their diet. The biosynthetic 
pathways for vitamins can be complex; thus, it is biologically more efficient 
to ingest vitamins than to synthesize the enzymes required to construct 
them from simple molecules. This efficiency comes at the cost of depen- 
dence on other organisms for chemicals essential for life. Indeed, vitamin 
deficiency can generate diseases in all organisms requiring these molecules 
(see Tables 15.3 and 15.4). 

Not all vitamins function as coenzymes. Vitamins designated by the let- 
ters A, C, D, E, and K (Figure 15.18 and Table 15.4) have a diverse array of 
functions. Vitamin A (retinol) is the precursor of retinal, the light-sensitive 
group in rhodopsin and other visual pigments (Section 32.3), and retinoic 
acid, an important signaling molecule. A deficiency of this vitamin leads to 
night blindness. In addition, young animals require vitamin A for growth. 
Vitamin C, or ascorbate, acts as an antioxidant. A deficiency in vitamin C can 
lead to scurvy, a disease due to malformed collagen and characterized by skin 
lesions and blood-vessel fragility (Section 27.6).A metabolite of vitamin D is 
a hormone that regulates the metabolism of calcium and phosphorus. A defi- 
ciency in vitamin D impairs bone formation in growing animals. Infertility in 
rats is a consequence of vitamin E (a-tocopherol) deficiency. This vitamin 
reacts with reactive oxygen species such as hydroxy] radicals and inactivates 
them before they can oxidize unsaturated membrane lipids, damaging cell 
structures. Vitamin K is required for normal blood clotting (Section 10.4). 
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Table 15.5 Types of chemical reactions in metabolism 


Type of reaction Description 
Oxidation—-reduction Electron transfer 
Ligation requiring ATP cleavage Formation of covalent bonds (i.e., carbon-carbon bonds) 
Isomerization Rearrangement of atoms to form isomers 
Group transfer Transfer of a functional group from one molecule to another 
Hydrolytic Cleavage of bonds by the addition of water 
Addition or removal of Addition of functional groups to double 
functional groups bonds or their removal to form double bonds 


Key reactions are reiterated throughout metabolism 


Just as there is an economy of design in the use of activated carriers, so is 
there an economy of design in biochemical reactions. The thousands of 
metabolic reactions, bewildering at first in their variety, can be subdivided 
into just six types (Table 15.5). Specific reactions of each type appear 
repeatedly, reducing the number of reactions that a student needs to learn. 


1. Oxidation—reduction reactions are essential components of many path- 
ways. Useful energy is often derived from the oxidation of carbon com- 
pounds. Consider the following two reactions: 


O 
8 ay 
‘a C? we ¢ O 
o~ “e eee A ee Sc + FADHD (1) 
‘ 2 —_ 
fe) nl fe) 
Succinate Fumarate 
0 e) 
oe Be eB 
OW Nee + NADY =O cia ne + NADH + Ht (2) 
“HH —_ 
HO fe) fe) fe) 
Malate Oxaloacetate 


These two oxidation—reduction reactions are components of the citric acid 
cycle (Chapter 17), which completely oxidizes the activated two-carbon 
fragment of acetyl CoA to two molecules of CO». In reaction 1, FADH)2 
carries the electrons, whereas, in reaction 2, electrons are carried by 
NADH. 


2. Ligation reactions form bonds by using free energy from AT'P cleavage. 
Reaction 3 illustrates the ATP-dependent formation of a carbon—carbon 
bond, necessary to combine smaller molecules to form larger ones. 
Oxaloacetate is formed from pyruvate and CO. 


i 
Cc O 
HsC~ ~C*-+ COn + ATP + HO —— 
0 0 4 
Pyruvate = | | 
| ag 
fe) ¢ c* + ADP +P} + Ht (3) 
H> | - 


Oxaloacetate 
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The oxaloacetate can be used in the citric acid cycle, or converted into 
glucose or amino acids such as aspartic acid. 


3. Isomerization reactions rearrange particular atoms within a molecule. 
Their role is often to prepare the molecule for subsequent reactions such as 
the oxidation—reduction reactions described in point 1. 


nA Ccoo- H COoO- 
BO es ee ee ee (a) 
H> /\ H2 4 % 
H H H OH 
Citrate Isocitrate 


Reaction 4 is, again, acomponent of the citric acid cycle. This isomerization 
prepares the molecule for subsequent oxidation and decarboxylation by 
moving the hydroxyl group of citrate from a tertiary to a secondary 
position. 


4. Group-transfer reactions play a variety of roles. Reaction 5 is representa- 
tive of such a reaction. A phosphoryl group is transferred from the activated 
phosphoryl-group carrier, ATP, to glucose, the initial step in glycolysis, a 
key pathway for extracting energy from glucose (Chapter 16). This reaction 
traps glucose in the cell so that further catabolism can take place. 


2 9 = 0 mae 
CH.0H | | | 
O of ~_P P adenine 
oJ “oJ ~o fe) 
OH + O O oO — 
HO OH 
OH 
HO OH 
Glucose ATP 
2- = 
20 ~ O 
We j | 
° [\ P P 
0 60 ge er adenine 
tof oO O 
OH + O fo) (5) 
HO OH 
OH 
HO OH 
Glucose 6-phosphate ADP 
(G-6P) 


As stated earlier, group-transfer reactions are used to synthesize ATP. We 
also saw examples of their use in signaling pathways (Chapter 14). 


5. Hydrolytic reactions cleave bonds by the addition of water. Hydrolysis is 
a common means employed to break down large molecules, either to facili- 
tate further metabolism or to reuse some of the components for biosyn- 
thetic purposes. Proteins are digested by hydrolytic cleavage (Chapters 9 
and 10). Reaction 6 illustrates the hydrolysis of a peptide to yield two 
smaller peptides. 


6. Functional groups may be added to double bonds to form single bonds or 
removed from single bonds to form double bonds. The enzymes that catalyze 
these types of reaction are classified as lyases. An important example, illus- 
trated in reaction 7, is the conversion of the six-carbon molecule fructose 
1,6-bisphosphate into two three-carbon fragments: dihydroxyacetone phos- 
phate and glyceraldehyde 3-phosphate. 


2- 
O Nc _/CH20PO3 
| fc) CH,0PO32- H. __-0 
— —H Ea ee 
i —OH ae =i + — —OH (7) 
— —OH H CH2O0PO37- 
CH20PO37- 
Fructose 1,6-bisphosphate Dihydroxyacetone phosphate Glyceraldehyde 3-phosphate 
(F-1,6-BP) (DHAP) (GAP) 


This reaction is a critical step in glycolysis (Chapter 16). Dehydrations to 
form double bonds, such as the formation of phosphoenolpyruvate (see 
Table 15.1) from 2-phosphoglycerate (reaction 8), are important reactions 
of this type. 


0. ~ 0 O.. ~ .0 
Ser" as 
| : 8) 
H—C—OPO;? H Cc + HO ( 
ee “ppg 
CH,0H | 
H 
2-Phosphoglycerate Phosphoenolpyruvate 
(PEP) 


The dehydration sets up the next step in the pathway, a group-transfer reac- 
tion that uses the high phosphoryl-transfer potential of the product PEP to 
form ATP from ADP. 

These six fundamental reaction types are the basis of metabolism. 
Remember that all six types can proceed in either direction, depending on 
the standard free energy for the specific reaction and the concentrations of 
the reactants and products inside the cell. An effective way to learn is to look 
for commonalities in the diverse metabolic pathways that we will be 
examining. There is a chemical logic that, when exposed, renders the com- 
plexity of the chemistry of living systems more manageable and reveals its 
elegance. 


Metabolic processes are regulated in three principal ways 


It is evident that the complex network of metabolic reactions must be rigor- 
ously regulated. At the same time, metabolic control must be flexible, to 
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adjust metabolic activity to the constantly changing external environments 
of cells. Metabolism is regulated through control of (1) the amounts of 
enzymes, (2) their catalytic activities, and (3) the accessibility of substrates. 


Controlling the amounts of enzymes. The amount of a particular enzyme 
depends on both its rate of synthesis and its rate of degradation. The level of 
many enzymes is adjusted primarily by a change in the rate of transcription 
of the genes encoding them (Chapters 29 and 31). In E. coli, for example, 
the presence of lactose induces within minutes a more than 50-fold increase 
in the rate of synthesis of B-galactosidase, an enzyme required for the break- 
down of this disaccharide. 


Controlling catalytic activity. The catalytic activity of enzymes is con- 
trolled in several ways. Reversible allosteric control is especially important. 
For example, the first reaction in many biosynthetic pathways is allosteri- 
cally inhibited by the ultimate product of the pathway. The inhibition of 
aspartate transcarbamoylase by cytidine triphosphate (Section 10.1) is a 
well-understood example of feedback inhibition. This type of control can be 
almost instantaneous. Another recurring mechanism is reversible covalent 
modification. For example, glycogen phosphorylase, the enzyme catalyzing 
the breakdown of glycogen, a storage form of sugar, is activated by the 
phosphorylation of a particular serine residue when glucose is scarce 
(Section 21.1). 

Hormones coordinate metabolic relations between different tissues, often 
by regulating the reversible modification of key enzymes. For instance, the 
hormone epinephrine triggers a signal-transduction cascade in muscle, 
resulting in the phosphorylation and activation of key enzymes and leading 
to the rapid degradation of glycogen to glucose, which is then used to supply 
ATP for muscle contraction. As described in Chapter 14, many hormones 
act through intracellular messengers, such as cyclic AMP and calcium ion, 
that coordinate the activities of many target proteins. 

Many reactions in metabolism are controlled by the energy status of the 
cell. One index of the energy status is the energy charge, which is propor- 
tional to the mole fraction of ATP plus half the mole fraction of ADP, given 
that ATP contains two anhydride bonds, whereas ADP contains one. 
Hence, the energy charge is defined as 


[ATP] + %[ADP] 
[ATP] + [ADP] + [AMP] 


Energy charge = 


The energy charge can have a value ranging from 0 (all AMP) to 1 (all 
ATP). Daniel Atkinson showed that ATP-generating (catabolic) pathways 
are inhibited by a high energy charge, whereas ATP-utilizing (anabolic) path- 
ways are stimulated by a high energy charge. In plots of the reaction rates of 
such pathways versus the energy charge, the curves are steep near an energy 
charge of 0.9, where they usually intersect (Figure 15.19). It is evident that 


ATP-generating 7 


pathway 


Figure 15.19 Energy charge regulates 
metabolism. High concentrations of ATP 
inhibit the relative rates of a typical pathway ss 
ATP-generating (catabolic) pathway and | | | 
stimulate the typical ATP-utilizing (anabolic) 0) 0.25 0.50 0.75 1 
pathway. Energy charge 


ATP-utilizing 


Relative rate — 


the control of these pathways has evolved to maintain the energy charge 
within rather narrow limits. In other words, the energy charge, like the pH of 
a cell, is buffered. The energy charge of most cells ranges from 0.80 to 0.95. 
An alternative index of the energy status is the phosphorylation potential, 
which is defined as 


[ATP] 


Phosphorylati al = 
osphorylation potentia [ADP] + [P,] 


The phosphorylation potential, in contrast with the energy charge, depends 
on the concentration of P; and is directly related to the free-energy storage 
available from ATP. 


Controlling the accessibility of substrates. In eukaryotes, metabolic regu- 
lation and flexibility are enhanced by compartmentalization. For example, 
fatty acid oxidation takes place in mitochondria, whereas fatty acid synthe- 
sis takes place in the cytoplasm. Compartmentalization segregates opposed 
reactions. 

Controlling the flux of substrates is another means of regulating metabo- 
lism. Glucose breakdown can take place in many cells only if insulin is pres- 
ent to promote the entry of glucose into the cell. The transfer of substrates 
from one compartment of a cell to another (e.g., from the cytoplasm to mito- 
chondria) can serve as a control point. 


Aspects of metabolism may have evolved from an RNA world 


We How did the complex pathways that constitute metabolism evolve? 

The current thinking is that RNA was an early biomolecule and that, 
in an early RNA world, RNA served as catalysts and information-storage 
molecules. 

Why do activated carriers such as ATP, NADH, FADH)b, and coen- 
zyme A contain adenosine diphosphate units (Figure 15.20)? A possible 
explanation is that these molecules evolved from the early RNA catalysts. 
Non-RNA units such as the isoalloxazine ring may have been recruited to 
serve as efficient carriers of activated electrons and chemical units, a func- 
tion not readily performed by RNA itself. We can picture the adenine ring 
of FADH)> binding to a uracil unit in a niche of an RNA enzyme (ribozyme) 
by base-pairing, whereas the isoalloxazine ring protrudes and functions as 
an electron carrier. When the more versatile proteins replaced RNA as the 
major catalysts, the ribonucleotide coenzymes stayed essentially unchanged 
because they were already well suited to their metabolic roles. The nicotin- 
amide unit of NADH, for example, can readily transfer electrons irrespec- 
tive of whether the adenine unit interacts with a base in an RNA enzyme or 
with amino acid residues in a protein enzyme. With the advent of protein 
enzymes, these important cofactors evolved as free molecules without losing 
the adenosine diphosphate vestige of their RNA-world ancestry. That mol- 
ecules and motifs of metabolism are common to all forms of life testifies to 
their common origin and to the retention of functioning modules through 
billions of years of evolution. Our understanding of metabolism, like that of 
other biological processes, is enriched by inquiry into how these beautifully 
integrated patterns of reactions came into being. 


Summary 


All cells transform energy. They extract energy from their environ- 
ment and use this energy to convert simple molecules into cellular 
components. 
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Summary 


Figure 15.20 Adenosine diphosphate 
(ADP) is an ancient module in 
metabolism. This fundamental building block 
is present in key molecules such as ATP, 
NADH, FAD, and coenzyme A. The adenine 
unit is shown in blue, the ribose unit in red, 
and the diphosphate unit in yellow. 


448 


CHAPTER 15 Metabolism: Basic 
Concepts and Design 


15.1 


15.2 


Metabolism Is Composed of Many Coupled, Interconnecting 
Reactions 

The process of energy transduction takes place through metabolism, a 
highly integrated network of chemical reactions. Metabolism can be 
subdivided into catabolism (reactions employed to extract energy 
from fuels) and anabolism (reactions that use this energy for biosyn- 
thesis). The most valuable thermodynamic concept for understanding 
bioenergetics is free energy. A reaction can occur spontaneously only 
if the change in free energy (AG) is negative. A thermodynamically 
unfavorable reaction can be driven by a thermodynamically favorable 
one, which is the hydrolysis of ATP in many cases. 


ATP Is the Universal Currency of Free Energy in Biological Systems 
The energy derived from catabolism is transformed into adenosine 
triphosphate. ATP hydrolysis is exergonic and the energy released can 
be used to power cellular processes, including motion, active transport, 
and biosynthesis. Under cellular conditions, the hydrolysis of ATP 
shifts the equilibrium of a coupled reaction by a factor of 10°. ATP, the 
universal currency of energy in biological systems, is an energy-rich 
molecule because it contains two phosphoanhydride bonds. 


15.3 The Oxidation of Carbon Fuels Is an Important Source of 


15.4 


Cellular Energy 

ATP formation is coupled to the oxidation of carbon fuels, either 
directly or through the formation of ion gradients. Photosynthetic 
organisms can use light to generate such gradients. ATP is consumed 
in muscle contraction and other motions of cells, in active transport, in 
signal-transduction processes, and in biosyntheses. The extraction of 
energy from foodstuffs by aerobic organisms comprises three stages. 
In the first stage, large molecules are broken down into smaller ones, 
such as amino acids, sugars, and fatty acids. In the second stage, these 
small molecules are degraded to a few simple units that have pervasive 
roles in metabolism. One of them is the acetyl unit of acetyl CoA, a 
carrier of activated acyl groups. The third stage of metabolism is the 
citric acid cycle and oxidative phosphorylation, in which ATP is gen- 
erated as electrons flow to On, the ultimate electron acceptor, and fuels 
are completely oxidized to CO. 


Metabolic Pathways Contain Many Recurring Motifs 

Metabolism is characterized by common motifs. A small number of 
recurring activated carriers, such as ATP, NADH, and acetyl CoA, 
transfer activated groups in many metabolic pathways. NADPH, 
which carries two electrons at a high potential, provides reducing power 
in the biosynthesis of cell components from more-oxidized precursors. 
Many activated carriers are derived from vitamins, small organic mol- 
ecules required in the diets of many higher organisms. Moreover, key 
reaction types are used repeatedly in metabolic pathways. 

Metabolism is regulated in a variety of ways. The amounts of some 
critical enzymes are controlled by regulation of the rate of synthesis 
and degradation. In addition, the catalytic activities of many enzymes 
are regulated by allosteric interactions (as in feedback inhibition) and 
by covalent modification. The movement of many substrates into cells 
and subcellular compartments also is controlled. The energy charge, 
which depends on the relative amounts of ATP, ADP, and AMP, 
plays a role in metabolic regulation. A high energy charge inhibits 
ATP-generating (catabolic) pathways, whereas it stimulates ATP- 
utilizing (anabolic) pathways. 


Key Terms 


metabolism or intermediary metabolism 
(p. 427) 


phototroph (p. 428) 

chemotroph (p. 428) 

catabolism (p. 428) 

anabolism (p. 428) 

amphibolic pathway (p. 429) 
adenosine triphosphate (ATP) (p. 430) 


(p. 433) 


vitamin (p. 441) 


Problems 


1. Complex patterns. What is meant by intermediary 
metabolism? 


2. Opposites. Differentiate between anabolism and 
catabolism. 


3. Why bother to eat? What are the three primary uses for 
cellular energy? 


4. Match ’em. 
1. Cellular energy currency a. NAD* 
2. Anabolic electron carrier b. Coenzyme A 
3. Phototroph c. Precursor to 
4. Catabolic electron carrier coenzymes 
reaction d. Yields energy 
5. Oxidation-reduction reaction e. Requires energy 
6. Activated carrier of two f. ATP 
carbon fragments g. Transfers electrons 
7. Vitamin h. NADP* 
8. Anabolism i. Converts light 
9. Amphibolic reaction energy to chemical 
10. Catabolism energy 


j. Used in anabolism 
and catabolism 


5. Energy to burn. What factors account for the high- 
phosphoryl transfer potential of nucleoside triphosphates? 


6. Back in time. Account for the fact that ATP, and not 
another nucleoside triphosphate, is the cellular energy 
currency. 


7. Currency Issues. Why does it make good sense to have 
a single nucleotide, ATP, function as the cellular energy 
currency? 


8. Environmental conditions. The standard free energy of 
hydrolysis for ATP is —30.5 kJ mol7! (—7.3 kcal mol~'). 


ATP + H,O = — ADP + P; 


What conditions might be changed to alter the free energy 
of hydrolysis? 


phosphory]-transfer potential 


oxidative phosphorylation (p. 437) 
activated carrier (p. 438) 


oxidation-reduction reaction (p. 443) 


ligation reaction (p. 443) 
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group-transfer reaction (p. 444) 

hydrolytic reaction (p. 444) 

addition to or formation of double-bond 
reaction (p. 445) 


lyase (p. 445) 
energy charge (p. 446) 
phosphorylation potential (p. 447) 


isomerization reaction (p. 444) 


9. Brute force? Metabolic pathways frequently contain 
reactions with positive standard free energy values, yet the 
reactions still take place. How is this possible? 


10. Energy flow. What is the direction of each of the follow- 
ing reactions when the reactants are initially present in 
equimolar amounts? Use the data given in Table 15.1. 

(a) ATP + creatine == creatine phosphate + ADP 
(b) ATP + glycerol == glycerol 3-phosphate + ADP 
(c) ATP + pyruvate = — phosphoenolpyruvate + ADP 


(d) ATP + glucose 


11. A proper inference. What information do the AG®’ data 
given in Table 15.1 provide about the relative rates of 
hydrolysis of pyrophosphate and acetyl phosphate? 


=— glucose 6-phosphate + ADP 


12. A potent donor. Consider the following reaction: 


ATP + pyruvate = — phosphoenolpyruvate + ADP 


(a) Calculate AG*’ and Kj, at 25°C for this reaction by 
using the data given in Table 15.1. 


(b) What is the equilibrium ratio of pyruvate to phospho- 
enolpyruvate if the ratio of ATP to ADP is 10? 


13. Isomeric equilibrium. Calculate AG®°’ for the isomeriza- 
tion of glucose 6-phosphate to glucose 1-phosphate. What 
is the equilibrium ratio of glucose 6-phosphate to glucose 
1-phosphate at 25°C? 


14. Activated acetate. The formation of acetyl CoA from 
acetate is an ATP-driven reaction: 


Acetate + ATP + CoA = — acetyl CoA + AMP + PP; 


(a) Calculate AG®’ for this reaction by using data given in 
this chapter. 


(b) The PP; formed in the preceding reaction is rapidly 
hydrolyzed in vivo because of the ubiquity of inorganic 
pyrophosphatase. The AG®’ for the hydrolysis of PP; is 
—19.2 kJ mol”! (—4.6 kcal mol” !). Calculate the AG’ for 
the overall reaction, including pyrophosphate hydrolysis. 
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What effect does the hydrolysis of PP; have on the forma- 
tion of acetyl CoA? 


15. Acid strength. The pK of an acid is a measure of its 
proton-group-transfer potential. 

(a) Derive a relation between AG®’ and pK. 

(b) What is the AG®’ for the ionization of acetic acid, which 
has a pK of 4.8? 

16. Raison d’étre. The muscles of some invertebrates are 
rich in arginine phosphate (phosphoarginine). Propose a 
function for this amino acid derivative. 


HOR P 
NH,* f 


*H3N COO- 


Arginine phosphate 


17. Recurring motif. What is the structural feature common 


to ATP, FAD, NAD™, and CoA? 


18. Ergogenic help or hindrance? Creatine is a popular, but 
untested, dietary supplement. 


(a) What is the biochemical rationale for the use of 
creatine? 


(b) What type of exercise would most benefit from creatine 
supplementation? 


19. Standard conditions versus veal life 1. The enzyme 
aldolase catalyzes the following reaction in the glycolytic 


athway: 
P y Aldolase 


Fructose 1, 6-bisphosphate === 
dihydroxyacetone phosphate + 
glyceraldehyde 3-phosphate 


The AG” for the reaction is + 23.8 kJ mol! (+ 5.7 kcal 
mol !), whereas the AG in the cell is —1.3 kJ mol”! (—0.3 
kcal mol~'). Calculate the ratio of reactants to products 
under equilibrium and intracellular conditions. Using your 
results, explain how the reaction can be endergonic under 
standard conditions and exergonic under intracellular 
conditions. 


20. Standard conditions versus real life 2. On page 430, we 
showed that a reaction, A =— B, with a AG’= +13 kJ 
mol! (+ 4.0 kcal mol‘) has an Keg of 1.15 X 10-%. The 
Keg is increased to 2.67 X 10° if the reaction is coupled to 
ATP hydrolysis under standard conditions. The ATP- 
generating system of cells maintains the [ATP]/[ADP][P;] 
ratio at a high level, typically of the order of 500 M’?. 
Calculate the ratio of B/A under cellular conditions. 


21. Not all alike. The concentrations of ATP, ADP, and P; 
differ with cell type. Consequently, the release of free 
energy with the hydrolysis of ATP will vary with cell type. 
Using the following table, calculate the AG for the hydroly- 


sis of ATP in liver, muscle, and brain cells. In which cell 
type is the free energy of ATP hydrolysis most negative? 


ATP (mM) ADP (mM) P; (mM) 
Liver 3.5 1.8 5.0 
Muscle 8.0 0.9 8.0 
Brain 2.6 0.7 2 


22. Oxidation issues. Examine the pairs of molecules and 


identify the more-reduced molecule in each pair. 


(d) 


(e) 


Bees 
H~| 
CH; 
Ethanol 


COO- 


H—C——H 


H—C—H 


COoO- 
Succinate 


“OOC. 0 


~“OOC—C—H 
CH> 


CcOoO- 
Oxalosuccinate 


CcoOo- 
OH—C—H 
H—C——H 


COO- 
Malate 


O 


oe 


CHz 
Pyruvate 


HW Ze 
[ 
CH; 
Acetaldehyde 


i 


oe 


CH; 
Pyruvate 


Ccoo- 


CcCOoOo- 
Oxaloacetate 


O20 
pepe 
= 

: 


2-Phosphoglycerate 


23. Running downhill. Glycolysis is a series of 10 linked 
reactions that convert one molecule of glucose into two mol- 
ecules of pyruvate with the concomitant synthesis of two 
molecules of ATP (Chapter 16). The AG®’ for this set of 
reactions is —35.6 kJ mol! (—8.5 kcal mol _!), whereas the 
AG is —76.6 kJ mol! (—18.3 kcal mol '). Explain why the 
free-energy release is so much greater under intracellular 
conditions than under standard conditions. 


24. Breakdown products. Digestion is the first stage in the 
extraction of energy from food, but no useful energy is 
acquired during this stage. Why is digestion considered a 
stage in energy extraction? 


25. High-energy electrons. What are the activated electron 
carriers for catabolism? For anabolism? 


26. Less reverberation. Thioesters, common in biochemis- 
try, are more unstable (energy-rich) than oxygen esters. 
Explain why this is the case. 


27. Classifying reactions. What are the six common types of 
reactions seen in biochemistry? 


28. Staying in control. What are the three principal means 
of controlling metabolic reactions? 
Chapter Integration Problems 


29. Kinetic vs. thermodynamic. The reaction of NADH with 
oxygen to produce NAD* and HO is very exergonic, yet 
the reaction of NADH and oxygen takes place very slowly. 
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Why does a thermodynamically favorable reaction not 
occur rapidly? 


30. Activated sulfate. Fibrinogen contains tyrosine-O- 
sulfate. Propose an activated form of sulfate that could react 
in vivo with the aromatic hydroxyl group of a tyrosine 
residue in a protein to form tyrosine-O-sulfate. 


Data Interpretation Problem 


31. Opposites attract. The following graph shows how the 
AG for the hydrolysis of ATP varies as a function of the 


+ . +4 
Mg?" concentration (pMg = —log[Mg’*]). 
36 - + 86 
Sele 8.4 
gy 
a = 7482 7 
L 34 x 
S | 8.0 
eof = 
2 BE 478 3 
YS = = 
t76 = 
S a2 76 S 
Poo 474 4 
” [_ 4 72 
P| ge ees es Se 
12345 67 


(a) How does decreasing [Me**] affect the AG of hydroly- 
sis for ATP? 


(b) Explain this effect. 
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Glycolysis and Gluconeogenesis 


CHAPTER 16 


Usain Bolt sprints through a world record in the 200-meter 
finals at the Olympics in Beijing in 2008. Glucose 
metabolism can generate the ATP to power muscle 
contraction. During a sprint, when the ATP needs outpace 
oxygen delivery, as would be the case for Bolt, glucose is 
metabolized to lactate. When oxygen delivery is adequate, 
glucose is metabolized more efficiently to carbon dioxide 
and water. [Reix-Liews/For Photo/Corbis.] 


A. Low O, 
(last seconds of a sprint) 


B. Normal 
(long slow run) 


Glycolysis A 
Glucose — Pyruvate Lactate 
ATP 
Mitochondrion 
Cytoplasm 
Muscle fiber 


he first metabolic pathway that we encounter is glycolysis, an ancient 

pathway employed by a host of organisms. Glycolysis is the sequence of 
reactions that metabolizes one molecule of glucose to two molecules of pyruvate 
with the concomitant net production of two molecules of ATP. This process is 
anaerobic (i.e., it does not require Oz) because it evolved before substantial 
amounts of oxygen accumulated in the atmosphere. Pyruvate can be fur- 
ther processed anaerobically to lactate (lactic acid fermentation) or ethanol 
(alcoholic fermentation). Under aerobic conditions, pyruvate can be com- 
pletely oxidized to COz, generating much more ATP, as will be described in 
Chapters 17 and 18. Figure 16.1 shows some possible fates of pyruvate 
produced by glycolysis. 

Because glucose is such a precious fuel, metabolic products, such as 
pyruvate and lactate, are salvaged to synthesize glucose in the process of 
gluconeogenesis. Although glycolysis and gluconeogenesis have some 
enzymes in common, the two pathways are not simply the reverse of each 
other. In particular, the highly exergonic, irreversible steps of glycolysis are 
bypassed in gluconeogenesis. The two pathways are reciprocally regulated 
so that glycolysis and gluconeogenesis do not take place simultaneously in 
the same cell to a significant extent. 

Our understanding of glucose metabolism, especially glycolysis, has a 
rich history. Indeed, the development of biochemistry and the delineation of 


OUTLINE 


16.1 Glycolysis Is an Energy- 
Conversion Pathway in 
Many Organisms 


16.2 The Glycolytic Pathway Is Tightly 
Controlled 


16.3, Glucose Can Be Synthesized 
from Noncarbohydrate 
Precursors 


16.4 Gluconeogenesis and Glycolysis 
Are Reciprocally Regulated 


Glycolysis 
Derived from the Greek stem glyk-, “sweet,” 
and the word lysis, “dissolution.” 
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CHAPTER 16 Glycolysis and 
Gluconeogenesis 


Glycolysis 
— 


Enzyme 

A term coined by Friedrich Wilhelm Kiihne 
in 1878 to designate catalytically active 
substances that had formerly been called 
ferments. Derived from the Greek words en, 
“in,” and zyme, “leaven.’ 


FERMENTATION COMPLETE 


OXIDATION 
eo O, 


es aa 


Figure 16.1 Some fates of glucose. 


glycolysis went hand in hand. A key discovery was made by Hans Buchner 
and Eduard Buchner in 1897, quite by accident. The Buchners were inter- 
ested in manufacturing cell-free extracts of yeast for possible therapeutic 
use. These extracts had to be preserved without the use of antiseptics such 
as phenol, and so they decided to try sucrose, a commonly used preservative 
in kitchen chemistry. They obtained a startling result: sucrose was rapidly 
fermented into alcohol by the yeast juice. The significance of this finding 
was immense. The Buchners demonstrated for the first time that fermentation 
could take place outside living cells. The accepted view of their day, asserted 
by Louis Pasteur in 1860, was that fermentation is inextricably tied to living 
cells. The chance discovery by the Buchners refuted this dogma and opened 
the door to modern biochemistry. The Buchners’ discovery inspired the 
search for the biochemicals that catalyze the conversion of sucrose into 
alcohol. The study of metabolism became the study of chemistry. 

Studies of muscle extracts then showed that many of the reactions of 
lactic acid fermentation were the same as those of alcoholic fermentation. 
This exciting discovery revealed an underlying unity in biochemistry. The com- 
plete glycolytic pathway was elucidated by 1940, largely through the pio- 
neering contributions of Gustav Embden, Otto Meyerhof, Carl Neuberg, 
Jacob Parnas, Otto Warburg, Gerty Cori, and Carl Cori. Glycolysis is also 
known as the Embden—Meyerhof pathway. 


Glucose is generated from dietary carbohydrates 


We typically consume in our diets a generous amount of starch and a smaller 
amount of glycogen. These complex carbohydrates must be converted into 
simpler carbohydrates for absorption by the intestine and transport in the 
blood. Starch and glycogen are digested primarily by the pancreatic enzyme 
a-amylase and to a lesser extent by salivary a-amylase. Amylase cleaves the 
a-1,4 bonds of starch and glycogen, but not the a-1,6 bonds. The products 
are the di- and trisaccharides maltose and maltotriose. The material not 
digestible because of the a-1,6 bonds is called the limit dextrin. 

Maltase cleaves maltose intotwo glucose molecules, whereas a-glucosidase 
digests maltotriose and any other oligosaccharides that may have escaped 
digestion by the amylase. a-Dextrinase further digests the limit dextrin. 
Maltase and a-glucosidase are located on the surface of the intestinal 
cells, as is sucrase, an enzyme that degrades the sucrose contributed by 
vegetables to fructose and glucose. The enzyme lactase is responsible for 
degrading the milk sugar lactose into glucose and galactose. The monosac- 
charides are transported into the cells lining the intestine and then into the 
bloodstream. 


Glucose is an important fuel for most organisms 


ae Glucose is a common and important fuel. In mammals, glucose is the 

only fuel that the brain uses under nonstarvation conditions and 
the only fuel that red blood cells can use at all. Indeed, almost all organisms 
use glucose, and most that do process it in a similar fashion. Recall from 
Chapter 11 that there are many carbohydrates. Why is glucose instead of 
some other monosaccharide such a prominent fuel? We can speculate on the 
reasons. First, glucose is one of several monosaccharides formed from form- 
aldehyde under prebiotic conditions, and so it may have been available as a 
fuel source for primitive biochemical systems. Second, glucose has a low 
tendency, relative to other monosaccharides, to nonenzymatically glycosy- 
late proteins. In their open-chain forms, monosaccharides contain carbonyl 
groups that can react with the amino groups of proteins to form Schiff 
bases, which rearrange to form a more stable amino-ketone linkage. Such 
nonspecifically modified proteins often do not function effectively. Glucose 
has a strong tendency to exist in the ring conformation and, consequently, 
relatively little tendency to modify proteins. Recall that all the hydroxyl 
groups in the ring conformation of B-glucose are equatorial, contributing to 
the sugar’s high relative stability (Section 11.1). 


16.1 Glycolysis Is an Energy-Conversion Pathway 
in Many Organisms 


We now begin our consideration of the glycolytic pathway. This pathway is 
common to virtually all cells, both prokaryotic and eukaryotic. In eukary- 
otic cells, glycolysis takes place in the cytoplasm. This pathway can be 
thought of as comprising two stages (Figure 16.2). Stage 1 is the trapping 
and preparation phase. No ATP is generated in this stage. Stage 1 begins 
with the conversion of glucose into fructose 1,6-bisphosphate, which con- 
sists of three steps: a phosphorylation, an isomerization, and a second phos- 
phorylation reaction. The strategy of these initial steps in glycolysis is to trap 
the glucose in the cell and form a compound that can be readily cleaved into 
phosphorylated three-carbon units. Stage 1 is completed with the cleavage of 
the fructose 1,6-bisphosphate into two three-carbon fragments. These 
resulting three-carbon units are readily interconvertible. In stage 2, ATP is 
harvested when the three-carbon fragments are oxidized to pyruvate. 


Hexokinase traps glucose in the cell and begins glycolysis 


Glucose enters cells through specific transport proteins (p. 477) and has one 
principal fate: it is phosphorylated by ATP to form glucose 6-phosphate. This 
step is notable for two reasons: (1) glucose 6-phosphate cannot pass through 
the membrane because it is not a substrate for the glucose transporters, and 
(2) the addition of the phosphoryl group acts to destabilize glucose, thus 
facilitating its further metabolism. The transfer of the phosphoryl group 
from ATP to the hydroxyl group on carbon 6 of glucose is catalyzed by 
hexokinase. 


CH20H CH20P032- 
0 0, 
Hexokinase 
OH + AIP —— OH + ADP + Ht 
HO OH HO OH 
OH OH 
Glucose Glucose 6-phosphate 


(G-6P) 
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First stage of glycolysis. The first stage of 
glycolysis begins with the phosphorylation of 
glucose by hexokinase and ends with the 
isomerization of dihydroxyacetone phosphate 
to glyceraldehyde 3-phosphate. 
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Figure 16.2 Stages of glycolysis. The glycolytic pathway can be divided into two stages: (1) 
glucose is trapped, destabilized, and cleaved into two interconvertible three-carbon molecules 
generated by cleavage of six-carbon fructose; and (2) ATP is generated. 


Phosphoryl transfer is a fundamental reaction in biochemistry. Kinases 
are enzymes that catalyze the transfer of a phosphoryl group from ATP to 
an acceptor. Hexokinase, then, catalyzes the transfer of a phosphoryl group 
from ATP to a variety of six-carbon sugars (hexoses), such as glucose and 
mannose. Hexokinase, like adenylate kinase (Section 9.4) and all other 
kinases, requires Mg** (or another divalent metal ion such as Mn**) for activity. 
The divalent metal ion forms a complex with ATP. 

X-ray crystallographic studies of yeast hexokinase revealed that the 
binding of glucose induces a large conformational change in the enzyme. 
Hexokinase consists of two lobes, which move toward each other when glu- 
cose is bound (Figure 16.3). On glucose binding, one lobe rotates 12 degrees 
with respect to the other, resulting in movements of the polypeptide back- 
bone of as much as 8 A. The cleft between the lobes closes, and the bound 
glucose becomes surrounded by protein, except for the hydroxyl group of 
carbon 6, which will accept the phosphoryl group from ATP. The closing of 
the cleft in hexokinase is a striking example of the role of induced fit in 
enzyme action (Section 8.3). 

The glucose-induced structural changes are significant in two respects. 
First, the environment around the glucose becomes more nonpolar, which 
favors reaction between the hydrophilic hydroxyl group of glucose and 
the terminal phosphoryl group of ATP. Second, the conformational changes 
enable the kinase to discriminate against HO asa substrate. The closing of 
the cleft keeps water molecules away from the active site. If hexokinase were 
rigid, a molecule of HzO occupying the binding site for the —CH2OH of 
glucose could attack the y phosphoryl group of ATP, forming ADP and P;. 
In other words, a rigid kinase would likely also be an ATPase. It is interesting 
to note that other kinases taking part in glycolysis—phosphofructokinase, 
phosphoglycerate kinase, and pyruvate kinase—also contain clefts between 
lobes that close when substrate is bound, although the structures of these 
enzymes are different in other regards. Substrate-induced cleft closing is a 
general feature of kinases. 


Fructose 1,6-bisphosphate is generated from glucose 6-phosphate 


The next step in glycolysis is the isomerization of glucose 6-phosphate to 
fructose 6-phosphate. Recall that the open-chain form of glucose has an 
aldehyde group at carbon 1, whereas the open-chain form of fructose has a 
keto group at carbon 2. Thus, the isomerization of glucose 6-phosphate to 
fructose 6-phosphate is a conversion of an aldose into a ketose. The reaction 
catalyzed by phosphoglucose isomerase takes several steps because both 
glucose 6-phosphate and fructose 6-phosphate are present primarily in the 
cyclic forms. The enzyme must first open the six-membered ring of glucose 
6-phosphate, catalyze the isomerization, and then promote the formation of 
the five-membered ring of fructose 6-phosphate. 
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h Figure 16.3 Induced fit in 
hexokinase. As shown in blue, the two lobes 
of hexokinase are separated in the absence of 
glucose. The conformation of hexokinase 
changes markedly on binding glucose, as 
shown in red. Notice that two lobes of the 
enzyme come together and surround the 
substrate, creating the necessary environment 
for catalysis. [Courtesy of Dr. Thomas Steitz.] 
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A second phosphorylation reaction follows the isomerization step. 
Fructose 6-phosphate is phosphorylated at the expense of ATP to fructose 
1, 6-bisphosphate (F-1,6-BP). The prefix bis- in bisphosphate means that two 
separate monophosphoryl groups are present, whereas the prefix di- in 
diphosphate (as in adenosine diphosphate) means that two phosphoryl 
groups are present and are connected by an anhydride bond. 


2-O3POH,C CH OH 2-O3sPOH2C CH20PO,?- 
Oo Phosphofructokinase 0 
HO + ATP = HO + ADP + Ht 
OH OH 
OH OH 
Fructose 6-phosphate Fructose 1,6-bisphosphate 
(F-6P) (F-1, 6-BP) 


This reaction is catalyzed by phosphofructokinase (PFK), an allosteric 
enzyme that sets the pace of glycolysis. As we will learn, this enzyme plays 
a central role in the metabolism of many molecules in all parts of the body. 


The six-carbon sugar is cleaved into two three-carbon fragments 


The newly formed fructose 1,6-bisphosphate is cleaved into glyceraldehyde 
3-phosphate (GAP) and dihydroxyacetone phosphate (DHAP), completing 
stage 1 of glycolysis. The products of the remaining steps in glycolysis 
consist of three-carbon units rather than six-carbon units. 


se i 
Dihydroxyacetone 
fis hosphate 
a, ee HO—C—H P (DHAP) 
HO—C—H 
| Aldolase 
ia = 
H O 
Saeeo 
ae oe Cc 
H j OH Glyceraldehyde 
3-phosphate 
CH20PO032- =) P (GAP) 
Fructose = 
1,6-bisphosphate CH2OPO;? 
(F-1, 6-BP) 


This reaction, which is readily reversible, is catalyzed by aldolase. This 
enzyme derives its name from the nature of the reverse reaction, an aldol 
condensation. 

Glyceraldehyde 3-phosphate is on the direct pathway of glycolysis, 
whereas dihydroxyacetone phosphate is not. Unless a means exists to con- 
vert dihydroxyacetone phosphate into glyceraldehyde 3-phosphate, a three- 
carbon fragment useful for generating ATP will be lost. These compounds 
are isomers that can be readily interconverted: dihydroxyacetone phosphate 
is a ketose, whereas glyceraldehyde 3-phosphate is an aldose. The isomer- 
ization of these three-carbon phosphorylated sugars is catalyzed by triose 
phosphate isomerase (TPI, sometimes abbreviated TIM; Figure 16.4). 


H ih H 2° 
\C—OH Triose phosphate 
1 isomerase | 
Oo—C es saa 
2- 
CH20PO; CH,OPO;2- 
Dihydroxyacetone Glyceraldehyde 


phosphate 3-phosphate 


This reaction is rapid and reversible. At equilibrium, 96% of the triose 
phosphate is dihydroxyacetone phosphate. However, the reaction proceeds 
readily from dihydroxyacetone phosphate to glyceraldehyde 3-phosphate 
because the subsequent reactions of glycolysis remove this product. 

We now see the significance of the isomerization of glucose 6- phosphate 
to fructose 6-phosphate and its subsequent phosphorylation to form 
fructose 1,6-bisphosphate. Had the aldol cleavage taken place in the aldose 
glucose, a two-carbon and a four-carbon fragment would have resulted. 
Two different metabolic pathways, one to process the two-carbon fragment 
and one for the four-carbon fragment, would have been required to extract 
energy. Isomerization to the ketose fructose followed by aldol cleavage 
yields two phosphorylated interconvertable three-carbon fragments that 
will be oxidized in the later steps of glycolysis to capture energy in the form 
of ATP. 


Mechanism: Triose phosphate isomerase salvages a 
three-carbon fragment 


Much is known about the catalytic mechanism of triose phosphate isomerase. 
TPI catalyzes the transfer of a hydrogen atom from carbon 1 to carbon 2, an 
intramolecular oxidation—reduction. This isomerization of a ketose into an 
aldose proceeds through an enediol intermediate (Figure 16.5). 

X-ray crystallographic and other studies showed that glutamate 165 
plays the role of a general acid—base catalyst: it abstracts a proton (H™ ) from 
carbon 1 and then donates it to carbon 2. However, the carboxylate group of 
glutamate 165 by itself is not basic enough to pull a proton away from a 
carbon atom adjacent to a carbonyl group. Histidine 95 assists catalysis by 
donating a proton to stabilize the negative charge that develops on the C-2 
carbonyl group. 

Two features of this enzyme are noteworthy. First, TPI displays 
great catalytic prowess. It accelerates isomerization by a factor of 101° 
compared with the rate obtained with a simple base catalyst such as acetate 
ion. Indeed, the k.a/K)q ratio for the isomerization of glyceraldehyde 
3-phosphate is 2 X 10° M~'s |, which is close to the diffusion-controlled 
limit. In other words, catalysis takes place every time that enzyme and 
substrate meet. The diffusion-controlled encounter of substrate and enzyme 
is thus the rate-limiting step in catalysis. TPI is an example of a kinetically 
perfect enzyme (Section 8.4). Second, TPI suppresses an undesired side 
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hk Figure 16.4 Structure of triose 
phosphate isomerase. This enzyme consists 
of a central core of eight parallel 8 strands 
(orange) surrounded by eight « helices (blue). 
This structural motif, called an a barrel, is also 
found in the glycolytic enzymes aldolase, 
enolase, and pyruvate kinase. Votice that 
histidine 95 and glutamate 165, essential 
components of the active site of triose 
phosphate isomerase, are located in the barrel. 
A loop (red) closes off the active site on 
substrate binding. [Drawn from 2YPI.pdb_] 
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Figure 16.5 Catalytic mechanism of 
triose phosphate isomerase. (1) 
Glutamate 165 acts as a general base by 
abstracting a proton (H*) from carbon 1. 
Histidine 95, acting as a general acid, donates 
a proton to the oxygen atom bonded to 
carbon 2, forming the enediol intermediate. 
(2) Glutamic acid, now acting as a general 
acid, donates a proton to C-2 while histidine 
removes a proton from the OH group of C-l. 
(3) The product is formed, and glutamate and 
histidine are returned to their ionized and 
neutral forms, respectively. 
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reaction, the decomposition of the enediol intermediate into methyl glyoxal 
and orthophosphate. 


od ye P, Q p 
i =& gs c =< 
H ‘C—OPO;2- H CH; 
Hp 


Enediol intermediate Methyl glyoxal 

In solution, this physiologically useless reaction is 100 times as fast as 
isomerization. Moreover, methyl glyoxal is a highly reactive compound that 
can modify the structure and function of a variety of biomolecules, includ- 
ing proteins and DNA. The reaction of methyl glyoxal with a biomolecule 
is an example of deleterious reactions called advance glycation end prod- 
ucts, discussed earlier (AGEs, Section 11.1). Hence, TPI must prevent the 
enediol from leaving the enzyme. This labile intermediate is trapped in the 
active site by the movement of a loop of 10 residues (see Figure 16.4). This 
loop serves as a lid on the active site, shutting it when the enediol is present 
and reopening it when isomerization is completed. We see here a striking 
example of one means of preventing an undesirable alternative reaction: the 
active site is kept closed until the desirable reaction takes place. 

Thus, two molecules of glyceraldehyde 3-phosphate are formed from 
one molecule of fructose 1,6-bisphosphate by the sequential action of aldo- 
lase and triose phosphate isomerase. The economy of metabolism is evident 
in this reaction sequence. The isomerase funnels dihydroxyacetone phos- 
phate into the main glycolytic pathway; a separate set of reactions is not 
needed. 


The oxidation of an aldehyde to an acid powers the formation of a 
compound with high phosphoryl-transfer potential 


The preceding steps in glycolysis have transformed one molecule of glucose 
into two molecules of glyceraldehyde 3-phosphate, but no energy has yet 


been extracted. On the contrary, thus far, two molecules of ATP have been 
invested. We come now to the second stage of glycolysis, a series of steps 
that harvest some of the energy contained in glyceraldehyde 3-phosphate as 
ATP. The initial reaction in this sequence is the conversion of glyceraldehyde 
3-phosphate into 1, 3-bisphosphoglycerate (1,3-BPG), a reaction catalyzed by 
glyceraldehyde 3-phosphate dehydrogenase. 


Hy 2 Glyceraldehyde *0,PO_ mo) 
Cc 3-phosphate Cc 
| dehydrogenase 
H—C—OH + NAD+ + P= H=—G—OH + NADH + Ht 
CH,OPO;2- CH,OPO;2- 
Glyceraldehyde 1,3-Bisphosphoglycerate 
3-phosphate (1,3-BPG) 
(GAP) 


1,3-Bisphosphoglycerate is an acyl phosphate, which is a mixed anhydride 
of phosphoric acid and a carboxylic acid. Such compounds have a high 
phosphoryl-transfer potential; one of its phosphory1 groups is transferred to 
ADP in the next step in glycolysis. 

The reaction catalyzed by glyceraldehyde 3-phosphate dehydrogenase 
can be viewed as the sum of two processes: the oxidation of the aldehyde to 
a carboxylic acid by NAD* and the joining of the carboxylic acid and 
orthophosphate to form the acyl-phosphate product. 


Oh UH — _OH 
| Oxidation 
an ete + NAD* + H,O0 —>==— H—C—OH + NADH + Ht 
CH20PO;32- CH20PO32- 
a Acyl-phosphate a 
formation 
| (dehydration) | 


ain ied + Pj) ——_ a + H,0 


CH20PO;2- CH,0PO32- 


The first reaction is thermodynamically quite favorable, with a standard 
free-energy change, AG°’, of approximately — 50 kJ mol”! (-12 kcal mol”), 
whereas the second reaction is quite unfavorable, witha standard free-energy 
change of the same magnitude but the opposite sign. If these two reactions 
simply took place in succession, the second reaction would have a very 
large activation energy and thus not take place at a biologically significant 
rate. These two processes must be coupled so that the favorable aldehyde 
oxidation can be used to drive the formation of the acyl phosphate. How 
are these reactions coupled? The key is an intermediate, formed as a result of 
the aldehyde oxidation, that is linked to the enzyme by a thioester bond. 
Thioesters are high-energy compounds found in many biochemical path- 
ways (Section 15.4). This intermediate reacts with orthophosphate to form 
the high-energy compound 1,3-bisphosphoglycerate. 

The thioester intermediate is higher in free energy than the free carboxylic 
acid is. The favorable oxidation and unfavorable phosphorylation reactions 
are coupled by the thioester intermediate, which preserves much of the free 
energy released in the oxidation reaction. We see here the use of a covalent 
enzyme-bound intermediate as a mechanism of energy coupling. A free-energy 
profile of the glyceraldehyde 3- phosphate dehydrogenase reaction, compared 
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Figure 16.7 Structure of 
glyceraldehyde 3-phosphate 
dehydrogenase. Notice that the active site 
includes a cysteine residue and a histidine 


residue adjacent to a bound NAD* molecule. 


The sulfur atom of cysteine will link with the 
substrate to form a transitory thioester 
intermediate. [Drawn from 1GAD.pdb.] 
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Figure 16.6 Free-energy profiles for glyceraldehyde oxidation followed by acyl-phosphate 
formation. (A) A hypothetical case with no coupling between the two processes. The second step 
must have a large activation barrier, making the reaction very slow. (B) The actual case with the two 
reactions coupled through a thioester intermediate. 


with a hypothetical process in which the reaction proceeds without this 
intermediate, reveals how this intermediate allows a favorable process to 
drive an unfavorable one (Figure 16.6). 


Mechanism: Phosphorylation is coupled to the oxidation of 
glyceraldehyde 3-phosphate by a thioester intermediate 


The active site of glyceraldehyde 3-phosphate dehydrogenase includes 
a reactive cysteine residue, as well as NAD? and a crucial histidine 
(Figure 16.7). Let us consider in detail how these components cooperate in 
the reaction mechanism (Figure 16.8). In step 1, the aldehyde substrate 
reacts with the sulfhydryl group of cysteine 149 on the enzyme to form a 
hemithioacetal. Step 2 is the transfer of a hydride ion to a molecule of NAD* 
that is tightly bound to the enzyme and is adjacent to the cysteine residue. This 
reaction is favored by the deprotonation of the hemithioacetal by histi- 
dine 176. The products of this reaction are the reduced coenzyme NADH 
and a thioester intermediate. This thioester intermediate has a free energy 
close to that of the reactants (see Figure 16.6). In step 3, the NADH formed 
from the aldehyde oxidation leaves the enzyme and is replaced by a second 
molecule of NAD™. This step is important because the positive charge on 
NAD* polarizes the thioester intermediate to facilitate the attack by 
orthophosphate. In step 4, orthophosphate attacks the thioester to form 
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Figure 16.8 Catalytic mechanism of glyceraldehyde 3-phosphate 
dehydrogenase. The reaction proceeds through a thioester intermediate, 
which allows the oxidation of glyceraldehyde to be coupled to the 
phosphorylation of 3-phosphoglycerate. (1) Cysteine reacts with the 
aldehyde group of the substrate, forming a hemithioacetal. (2) An 
oxidation takes place with the transfer of a hydride ion to NAD*, forming a 
thioester. This reaction is facilitated by the transfer of a proton to histidine. 
(3) The reduced NADH is exchanged for an NAD* molecule. (4) 
Orthophosphate attacks the thioester, forming the product 1,3-BPG. 


1,3-BPG and free the cysteine residue. This example illustrates the essence 
of energy transformations and of metabolism itself: energy released by 
carbon oxidation is converted into high phosphoryl-transfer potential. 


ATP is formed by phosphoryl transfer from 1,3-bisphosphoglycerate 


1,3-Bisphosphoglycerate is an energy-rich molecule with a greater phos- 
phoryl-transfer potential than that of ATP (Section 15.2). Thus, 1,3-BPG 
can be used to power the synthesis of ATP from ADP. Phosphoglycerate 
kinase catalyzes the transfer of the phosphoryl group from the acyl phos- 
phate of 1,3-bisphosphoglycerate to ADP. ATP and 3-phosphoglycerate 


are the products. 
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The formation of ATP in this manner is referred to as substrate-level 
phosphorylation because the phosphate donor, 1,3-BPG, is a substrate with 
high phosphoryl-transfer potential. We will contrast this manner of ATP 
formation with the formation of ATP from ionic gradients in Chapters 18 
and 19. 

Thus, the outcomes of the reactions catalyzed by glyceraldehyde 
3-phosphate dehydrogenase and phosphoglycerate kinase are as follows: 


1. Glyceraldehyde 3-phosphate, an aldehyde, is oxidized to 3-phospho- 
glycerate, a carboxylic acid. 


2. NAD* is concomitantly reduced to NADH. 


3. ATP is formed from P; and ADP at the expense of carbon-oxidation 
energy. 


In essence, the energy released during the oxidation of glyceraldehyde 
3-phosphate to 3-phosphoglycerate is temporarily trapped as 1,3-bisphos- 
phoglycerate. This energy powers the transfer of a phosphoryl group from 
1,3-bisphosphoglycerate to ADP to yield ATP. Keep in mind that, because 
of the actions of aldolase and triose phosphate isomerase, two molecules of 
glyceraldehyde 3-phosphate were formed and hence two molecules of ATP 
were generated. These ATP molecules make up for the two molecules of 
ATP consumed in the first stage of glycolysis. 


Additional ATP is generated with the formation of pyruvate 


In the remaining steps of glycolysis, 3-phosphoglycerate is converted into 
pyruvate, and a second molecule of ATP is formed from ADP. 
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The first reaction is a rearrangement. The position of the phosphoryl group 
shifts in the conversion of 3-phosphoglycerate into 2-phosphoglycerate, a reac- 
tion catalyzed by phosphoglycerate mutase. In general, a mutase is an enzyme 
that catalyzes the intramolecular shift of a chemical group, such as a 
phosphoryl group. The phosphoglycerate mutase reaction has an interest- 
ing mechanism: the phosphoryl group is not simply moved from one carbon 
atom to another. This enzyme requires catalytic amounts of 2,3-bisphos- 
phoglycerate (2,3-BPG) to maintain an active-site histidine residue in a 
phosphorylated form. This phosphoryl group is transferred to 3-phospho- 
glycerate to re-form 2,3-bisphosphoglycerate. 


Enz-His-phosphate + 3-phosphoglycerate === 
Enz-His + 2,3-bisphosphoglycerate 
The mutase then functions as a phosphatase: it converts 2,3-bisphospho- 
glycerate into 2-phosphoglycerate. The mutase retains the phosphoryl 
group to regenerate the modified histidine. 
Enz-His + 2,3-bisphosphoglycerate == 
Enz-His-phosphate + 2-phosphoglycerate 


The sum of these reactions yields the mutase reaction: 
3-Phosphoglycerate == 2-phosphoglycerate 


In the next reaction, the dehydration of 2-phosphoglycerate introduces 
a double bond, creating an enol. Enolase catalyzes this formation of the enol 
phosphate phosphoenolpyruvate (PEP). This dehydration markedly elevates 
the transfer potential of the phosphoryl group. An enol phosphate has a 
high phosphoryl-transfer potential, whereas the phosphate ester of an 
ordinary alcohol, such as 2-phosphoglycerate, has a low one. The AG®’ of 
the hydrolysis of a phosphate ester of an ordinary alcohol is —13 kJ mol! 
(-3 kcal mol"), whereas that of phosphoenolpyruvate is —62 kJ mol! 
(—15 kcal mol’). 

Why does phosphoenolpyruvate have such a high phosphory1-transfer 
potential? The phosphoryl group traps the molecule in its unstable enol 
form. When the phosphoryl group has been donated to ATP, the enol 
undergoes a conversion into the more stable ketone—namely, pyruvate. 
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Phosphenolpyruvate Pyruvate Pyruvate 
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Thus, the high phosphoryl-transfer potential of phosphoenolpyruvate arises 
primarily from the large driving force of the subsequent enol—ketone conversion. 
Hence, pyruvate is formed, and ATP is generated concomitantly. The 
virtually irreversible transfer of a phosphoryl group from phosphoenol- 
pyruvate to ADP is catalyzed by pyruvate kinase. What is the energy source 
for the formation of phosphoenolpyruvate? The answer to this question 
becomes clear when we compare the structures of 2-phosphoglycerate and 
pyruvate. The formation of pyruvate from 2-phosphoglycerate is, in 
essence, an internal oxidation—reduction reaction; carbon 3 takes electrons 
from carbon 2 in the conversion of 2-phosphoglycerate into pyruvate. 
Compared with 2-phosphoglycerate, C-3 is more reduced in pyruvate, 
whereas C-2 is more oxidized. Once again, carbon oxidation powers the 
synthesis of a compound with high phosphoryl-transfer potential, phospho- 
enolpyruvate here and 1,3-bisphosphoglycerate earlier, which allows the 
synthesis of ATP. 

Because the molecules of ATP used in forming fructose 1,6-bisphosphate 
have already been regenerated, the two molecules of ATP generated from 
phosphoenolpyruvate are “profit.” 


Two ATP molecules are formed in the conversion of 
glucose into pyruvate 


The net reaction in the transformation of glucose into pyruvate is 


Glucose + 2 P; + 2 ADP + 2 NAD* > 
2 pyruvate + 2 ATP + 2NADH + 2H* +2H,O 


Thus, two molecules of ATP are generated in the conversion of glucose into 
two molecules of pyruvate. The reactions of glycolysis are summarized in 
Table 16.1. 

The energy released in the anaerobic conversion of glucose into two 
molecules of pyruvate is about —96 kJ mol”! (—23 kcal mol !). We shall see 
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Glucose 


ATP 


Regeneration of NAD*. 


ae 


NADH 


NAD* 


Table 16.1 Reactions of glycolysis 


Step Reaction 


Glucose + ATP —> glucose 6-phosphate + ADP + H* 


1 
2 Glucose 6-phosphate fructose 6-phosphate 

3 Fructose 6-phosphate + ATP —> fructose 1,6-bisphosphate + ADP + H* 
4 


Fructose 1,6-bisphosphate == 
dihydroxyacetone phosphate + glyceraldehyde 3-phosphate 


5 Dihydroxyacetone phosphate glyceraldehyde 3-phosphate 
6  Glyceraldehyde 3-phosphate + P; + NAD* === 
1,3-bisphosphoglycerate + NADH + H~* 
7 1,3-Bisphosphoglycerate + ADP == 3-phosphoglycerate + ATP 
8  3-Phosphoglycerate 


2-phosphoglycerate 


9  2-Phosphoglycerate phosphoenolpyruvate + H20 
10 Phosphoenolpyruvate + ADP + H* —> pyruvate + ATP 


Note: AG, the actual free-energy change, has been calculated from AG*’ and known concentrations of 
reactants under typical physiological conditions. Glycolysis can proceed only if the AG values of all reactions 
are negative. The smalls positive AG values of three of the above reactions indicate that the concentrations 
of metabolites in vivo in cells undergoing glycolysis are not precisely known. 


in Chapters 17 and 18 that much more energy can be released from glucose 
in the presence of oxygen. 


NAD* is regenerated from the metabolism of pyruvate 


The conversion of glucose into two molecules of pyruvate has resulted in 
the net synthesis of ATP. However, an energy-converting pathway that 
stops at pyruvate will not proceed for long, because redox balance has 
not been maintained. As we have seen, the activity of glyceraldehyde 
3-phosphate dehydrogenase, in addition to generating a compound with 
high phosphoryl-transfer potential, of necessity leads to the reduction of 
NAD* to NADH. In the cell, there are limited amounts of NAD*, which 
is derived from the vitamin niacin, a dietary requirement for human beings. 
Consequently, NAD* must be regenerated for glycolysis to proceed. Thus, 
the final process in the pathway is the regeneration of NAD* through the 
metabolism of pyruvate. 

The sequence of reactions from glucose to pyruvate is similar in most 
organisms and most types of cells. In contrast, the fate of pyruvate 1s vari- 
able. Three reactions of pyruvate are of primary importance: conversion 
into ethanol, lactate, or carbon dioxide (Figure 16.9). The first two reactions 
are fermentations that take place in the absence of oxygen. A fermentation is 

an ATP-generating process in which organic compounds 
act both as the donor and as the acceptor of electrons. In the 


serving as the final electron acceptor. We now take a closer 
look at these three possible fates of pyruvate. 


1. Ethanol is formed from pyruvate in yeast and sev- 


Further 
oxidation 


Figure 16.9 Diverse fates of pyruvate. Ethanol and lactate can be 
formed by reactions that include NADH. Alternatively, a two-carbon 
unit from pyruvate can be coupled to coenzyme A (Chapter 17) to 
form acetyl CoA. 


presence of oxygen, the most common situation in multicel- 
lular organisms and in many unicellular ones, pyruvate is 
ed, metabolized to carbon dioxide and water through the citric 
acid cycle and the electron-transport chain with oxygen 


eral other microorganisms. The first step is the decarboxy- 
lation of pyruvate. This reaction is catalyzed by pyruvate 
decarboxylase, which requires the coenzyme thiamine pyro- 
phosphate. This coenzyme, derived from the vitamin thia- 
mine (By), also participates in reactions catalyzed by other 
enzymes (Section 17.1). The second step is the reduction of 


AG” inkJ mol! AG inkJ mol! 


Enzyme Reaction type (kcal mol™!) (kcal mol!) 
Hexokinase Phosphory] transfer —16.7 (—4.0) —33.5 (—8.0) 
Phosphoglucose isomerase Isomerization +1.7 (+0.4) = 2.5 (=O,0) 
Phosphofructokinase Phosphory] transfer —14.2 (—3.4) =22,2.(=5,3) 
Aldolase Aldol cleavage SPs (Gr SL) =i.3(=0.3) 
Triose phosphate isomerase —[somerization +7.5(+1.8) +2.5(+0.6) 
Glyceraldehyde 3-phosphate Phosphorylation coupled +6.3 (+1.5) =e (08) 

dehydrogenase to oxidation 
Phosphoglycerate kinase Phosphory] transfer —18.8 (—4.5) +1.3 (+0.3) 
Phosphoglycerate mutase Phosphory] shift +4.6(+1.1) a0) {3 (40,2) 
Enolase Dehydration +1.7 (+0.4) —3.3 (—0.8) 
Pyruvate kinase Phosphory] transfer = Sil A= 7.8) —16.7 (—4.0) 


acetaldehyde to ethanol by NADH, ina reaction catalyzed by alcohol dehy- 
drogenase. This process regenerates NAD. 


9 NADH 
ad Ht co +H NAD* 
wl ZZ 3 H _-0 H OH 
Pyruvate Alcohol 
CH decarboxylase CH dehydrogenase CH 
Pyruvate Acetaldehyde Ethanol 


The active site of alcohol dehydrogenase contains a zinc ion that is coordi- 
nated to the sulfur atoms of two cysteine residues and a nitrogen atom of 
histidine (Figure 16.10). This zinc ion polarizes the carbonyl group of the 
substrate to favor the transfer of a hydride from NADH. 

The conversion of glucose into ethanol is an example of alcoholic 
fermentation. The net result of this anaerobic process is 


Glucose + 2 P; +2 ADP +2H* > 
2 ethanol + 2 CO, + 2 ATP + 2H2O 


Note that NAD* and NADH do not appear in this equation, even though 
they are crucial for the overall process. NADH generated by the oxidation 
of glyceraldehyde 3-phosphate is consumed in the reduction of acetalde- 
hyde to ethanol. Thus, there is no net oxidation—reduction in the conversion 
of glucose into ethanol (Figure 16.11). The ethanol formed in alcoholic 
fermentation provides a key ingredient for brewing and winemaking. 


2. Lactate is formed from pyruvate in a variety of microorganisms in a 
process called lactic acid fermentation. The reaction also takes place in the 
cells of higher organisms when the amount of oxygen is limiting, as in 
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NADH 


Hydride 
donor 


Hydride 
acceptor 


Figure 16.10 Active site of alcohol 
dehydrogenase. The active site contains a 
zinc ion bound to two cysteine residues and 
one histidine residue. Notice that the zinc ion 
binds the acetaldehyde substrate through its 
oxygen atom, polarizing the substrate so that 
it more easily accepts a hydride from NADH. 
Only the nicotinamide ring of NADH is shown. 


Figure 16.11 Maintaining redox balance. 
The NADH produced by the glyceraldehyde 
3-phosphate dehydrogenase reaction must be 
reoxidized to NAD* for the glycolytic pathway 
to continue. In alcoholic fermentation, alcohol 
dehydrogenase oxidizes NADH and generates 
ethanol. In lactic acid fermentation (not 
shown), lactate dehydrogenase oxidizes 
NADH while generating lactic acid. 
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muscle cells during intense activity. The reduction of pyruvate by NADH 
to form lactate is catalyzed by lactate dehydrogenase. 


NADH : 
9 + H* — NAD* Se 
oe ee Se. HO—C—H 
Lactate | 
ie dehydrogenase CH; 
3 
Pyruvate Lactate 


The overall reaction in the conversion of glucose into lactate is 
Glucose + 2 P; + 2 ADP — 2 lactate + 2 ATP + 2 H2O 


As in alcoholic fermentation, there is no net oxidation—reduction. The 
NADH formed in the oxidation of glyceraldehyde 3-phosphate is con- 
sumed in the reduction of pyruvate. The regeneration of NAD" in the reduc- 
tion of pyruvate to lactate or ethanol sustains the continued process of glycoly- 
sis under anaerobic conditions. 


3. Only a fraction of the energy of glucose is released in its anaerobic 
conversion into ethanol or lactate. Much more energy can be extracted 
aerobically by means of the citric acid cycle and the electron-transport 
chain. The entry point to this oxidative pathway is acetyl coenzyme A (acetyl 
CoA), which is formed inside mitochondria by the oxidative decarbox- 
ylation of pyruvate. 


Pyruvate + NAD* + CoA => acetyl CoA + CO.+ NADH + H 


This reaction, which is catalyzed by the pyruvate dehydrogenase complex, 
will be considered in detail in Chapter 17. The NAD* required for this 
reaction and for the oxidation of glyceraldehyde 3-phosphate is regenerated 
when NADH ultimately transfers its electrons to O» through the electron- 
transport chain in mitochondria. 


Fermentations provide usable energy in the absence of oxygen 


Fermentations yield only a fraction of the energy available from the complete 
combustion of glucose. Why is a relatively inefficient metabolic pathway so 
extensively used? The fundamental reason is that oxygen is not required. 
The ability to survive without oxygen affords a host of living accommoda- 
tions such as soils, deep water, and skin pores. Some organisms, called 
obligate anaerobes, cannot survive in the presence of Oo, a highly reactive 
compound. The bacterium Clostridium perfringens, the cause of gangrene, is 
an example of an obligate anaerobe. Other pathogenic obligate anaerobes 
are listed in Table 16.2. Skeletal muscles in most animals can function 
anaerobically for short periods. For example, when animals perform bursts 
of intense exercise, their ATP needs rise faster than the ability of the body 


Table 16.2 Examples of pathogenic obligate anaerobes 


Bacterium Result of infection 

Clostridium tetani Tetanus (lockjaw) 

Clostridium botulinum Botulism (an especially severe type of food poisoning) 

Clostridium perfringens Gas gangrene (gas is produced as an end point of the fermentation, 


distorting and destroying the tissue) 
Bartonella hensela Cat scratch fever (flu-like symptoms) 
Bacteroides fragilis Abdominal, pelvic, pulmonary, and blood infections 


to provide oxygen to the muscle. The muscle functions anaerobically until 
fatigue sets in, which is caused, in part, by lactate buildup. 

Although we have considered only lactic acid and alcoholic fermentation, 
microorganisms are capable of generating a wide array of molecules as end 
points of fermentation (Table 16.3). Indeed, many food products, including 
sour cream, yogurt, various cheeses, beer, wine, and sauerkraut, result from 
fermentation. 


The binding site for NAD* is similar in many dehydrogenases 


a The three dehydrogenases—glyceraldehyde 3-phosphate dehydroge- 

nase, alcohol dehydrogenase, and lactate dehydrogenase—have quite 
different three-dimensional structures. However, their NAD*-binding 
domains are strikingly similar (Figure 16.12). This nucleotide-binding region 
is made up of four a helices and a sheet of six parallel B strands. Moreover, 
in all cases, the bound NAD* displays nearly the same conformation. This 
common structural domain was one of the first recurring structural domains 
to be discovered. It is often called a Rossmann fold after Michael Rossmann, 
who first recognized it. This fold likely represents a primordial dinucleotide- 
binding domain that recurs in the dehydrogenases of glycolysis and other 
enzymes because of their descent from a common ancestor. 


Nicotinamide-binding half 


ww d 
Fe Ribose 


Galactose 


Fructose and galactose are converted into glycolytic 
intermediates 


Table 16.3 Starting and ending points 
of various fermentations 


Glucose > lactate 

Lactate > acetate 
Glucose => ethanol 
Ethanol = acetate 
Arginine > carbon dioxide 
Pyrimidines = carbon dioxide 
Purines => formate 
Ethylene glycol => acetate 
Threonine > propionate 
Leucine => 2-alkylacetate 
Phenylalanine > propionate 


Note: The products of some fermentations are the 
substrates for others. 


Figure 16.12 NAD*-binding region in 
dehydrogenases. Notice that the 
nicotinamide-binding half (yellow) is 
structurally similar to the adenine-binding half 
(red). The two halves together form a 
structural motif called a Rossmann fold. The 
NAD* molecule binds in an extended 
conformation. [Drawn from 3LDH.pdb.] 


Glucose 


Although glucose is the most widely used monosaccharide, 


Fructose 
(adipose tissue) 


others also are important fuels. Let us consider how two 
abundant sugars—fructose and galactose—can be funneled 
into the glycolytic pathway (Figure 16.13). There are no cata- 
bolic pathways for metabolizing fructose or galactose, and so 
the strategy is to convert these sugars into a metabolite of 
glucose. 

Fructose can take one of two pathways to enter the glycolytic 
pathway. Much of the ingested fructose is metabolized by 
the liver, using the fructose 1-phosphate pathway (Figure 16.14). 
The first step is the phosphorylation of fructose to fructose 
1-phosphate by fructokinase. Fructose 1-phosphate is then 
split into glyceraldehyde and dihydroxyacetone phosphate, an 


Fructose 
(liver) 


fructose. 


=| Fructose 
. (liver) 
' 2x 


Pyruvate 


Figure 16.13 Entry points in glycolysis for galactose and 
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Figure 16.14 Fructose metabolism. 
Fructose enters the glycolytic pathway in the 
liver through the fructose 1-phosphate 
pathway. 
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intermediate in glycolysis. This aldol cleavage is catalyzed by a specific 
fructose 1-phosphate aldolase. Glyceraldehyde is then phosphorylated to 
glyceraldehyde 3-phosphate, a glycolytic intermediate, by triose kinase. In 
other tissues, fructose can be phosphorylated to fructose 6-phosphate by 
hexokinase. 

Galactose is converted into glucose 6-phosphate in four steps. The first 
reaction in the galactose-glucose interconversion pathway is the phosphoryla- 
tion of galactose to galactose 1-phosphate by galactokinase. 


CH,0H ADP CH,OH 
ATP + Ht 
HO Q HO 0. 

OH Galactokinase On O eF 

OH Oo 4 

Se 

OH OH IK, 

6 oO 


Galactose Galactose 1-phosphate 

Galactose 1-phosphate then acquires a uridyl group from uridine diphos- 
phate glucose (UDP-glucose), an activated intermediate in the synthesis of 
carbohydrates (Section 21.4). 


CH20OH CH2OH 
HO 0, 0 
OH te OH 
a aa + HO Lge ge <——_ 
AN, NE 
0 _90 0_0 
Galactose UDP-glucose 
1-phosphate 
Galactose 1-phosphate 
uridyl transferase 
CH,0H (— CH,OH * 
HO O O, 
OH OH pin 
<2 O 
— fae uridine] + | H ie 
Pp. F . 
4 - ‘oO re} 7 d 0 fe) 
i Glucose 
1-phosphate 
UDP-galactose L phoep 3 
UDP-galactose 
4-epimerase 
CH20OH 
0 
OH 
O oO. O—uridine 
° OH i oe 
J ee) é» 
UDP-glucose 


The products of this reaction, which is catalyzed by galactose 1-phosphate 
uridyl transferase, are UDP-galactose and glucose 1-phosphate. The 
galactose moiety of UDP-galactose is then epimerized to glucose. The con- 
figuration of the hydroxyl group at carbon 4 is inverted by UDP-galactose 
4-epimerase. 


The sum of the reactions catalyzed by galactokinase, the transferase, and 
the epimerase is 


Galactose + ATP — glucose 1-phosphate + ADP + H* 


Note that UDP-glucose is not consumed in the conversion of galactose into 
glucose, because it is regenerated from UDP-galactose by the epimerase. 
This reaction is reversible, and the product of the reverse direction also is 
important. The conversion of UDP-glucose into UDP-galactose is essential 
for the synthesis of galactosyl residues in complex polysaccharides and glyco- 
proteins if the amount of galactose in the diet is inadequate to meet these 
needs. 

Finally, glucose 1-phosphate, formed from galactose, is isomerized to 
glucose 6-phosphate by phosphoglucomutase. We shall return to this reaction 
when we consider the synthesis and degradation of glycogen, which pro- 
ceeds through glucose 1-phosphate, in Chapter 21. 


Many adults are intolerant of milk because they are deficient in lactase 


ey Many adults are unable to metabolize the milk sugar lactose and 
experience gastrointestinal disturbances if they drink milk. Lactose 
intolerance, or hypolactasia, is most commonly caused by a deficiency of 
the enzyme lactase, which cleaves lactose into glucose and galactose. 


CH2,0H CH 20H CH20OH CH20H 
ai a Lactase oT ° r . 
OH O OH + HO OH 4 OH 
OH HO OH 
OH OH OH OH 
Lactose Galactose Glucose 


“Deficiency” is not quite the appropriate term, because a decrease in lactase 
is normal in the course of development in all mammals. As children are 
weaned and milk becomes less prominent in their diets, lactase activity nor- 
mally declines to about 5% to 10% of the level at birth. This decrease is not 
as pronounced with some groups of people, most notably Northern 
Europeans, and people from these groups can continue to ingest milk with- 
out gastrointestinal difficulties. With the appearance of milk-producing 
domesticated animals, an adult with active lactase would have a selective 
advantage in being able to consume calories from the readily available milk. 
Because dairy farming originated only about 10,000 years ago, the evolu- 
tionary selective pressure on lactase persistence must have been substantial, 
attesting to the biochemical value of being able to use milk as an energy 
source into adulthood. 

What happens to the lactose in the intestine of a lactase-deficient per- 
son? The lactose is a good energy source for microorganisms in the colon, 
and they ferment it to lactic acid while generating methane (CH4) and 
hydrogen gas (H2). The gas produced creates the uncomfortable feeling of 
gut distension and the annoying problem of flatulence. The lactate pro- 
duced by the microorganisms is osmotically active and draws water into the 
intestine, as does any undigested lactose, resulting in diarrhea. If severe 
enough, the gas and diarrhea hinder the absorption of other nutrients such 
as fats and proteins. The simplest treatment is to avoid the consumption of 
products containing much lactose. Alternatively, the enzyme lactase can be 
ingested with milk products. 
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Scanning electron micrograph of 
Lactobacillus. The anaerobic bacterium 
Lactobacillus is shown here (artificially colored) 
at a magnification of 22.245x. As suggested by 
its name, this genus of bacteria ferments 
glucose into lactic acid and is widely used in the 
food industry. Lactobacillus is also a component 
of the normal human bacterial flora of the 
urogenital tract where, because of its ability to 
generate an acidic environment, it prevents the 
growth of harmful bacteria. [Dr. Dennis Kunkel/ 
PhotoTake.] 
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(A) 


Figure 16.15 Cataracts are evident as 
the clouding of the lens. (A) A healthy eye. 
(B) An eye with a cataract. [(A) © Imageafter; 
(B) SPL/Photo Researchers.] 


Galactose is highly toxic if the transferase is missing 


Less common than lactose intolerance are disorders that interfere 

with the metabolism of galactose. The disruption of galactose metab- 
olism is referred to as galactosemia. The most common form, called classic 
galactosemia, is an inherited deficiency in galactose 1-phosphate uridyl 
transferase activity. Afflicted infants fail to thrive. They vomit or have 
diarrhea after consuming milk, and enlargement of the liver and jaundice 
are common, sometimes progressing to cirrhosis. Cataracts will form, and 
lethargy and retarded mental development also are common. The 
blood-galactose level is markedly elevated, and galactose is found in the 
urine. The absence of the transferase in red blood cells is a definitive 
diagnostic criterion. 

The most common treatment is to remove galactose (and lactose) from 
the diet. An enigma of galactosemia is that, although elimination of galac- 
tose from the diet prevents liver disease and cataract development, the 
majority of patients still suffer from central nervous system malfunction, 
most commonly a delayed acquisition of language skills. Female patients 
also display ovarian failure. 

Cataract formation is better understood. A cataract is the clouding of the 
normally clear lens of the eye (Figure 16.15). If the transferase is not active 
in the lens of the eye, the presence of aldose reductase causes the accumulat- 
ing galactose to be reduced to galactitol. 


H 
H—C—OH_ NADPH H—C—\OH 
+ Ht NADPt 
HO—-C——H HO—-C——-H 
HO—-C——-H Aldose HO—-C——-H 
reductase 

H—C—OH H—C—OH 

CH20H CH20OH 
Galactose Galactitol 


Galactitol is osmotically active, and water will diffuse into the lens, instigat- 
ing the formation of cataracts. In fact, there is a high incidence of cataract 
formation with age in populations that consume substantial amounts of 
milk into adulthood. 


16.2 The Glycolytic Pathway Is Tightly Controlled 


The glycolytic pathway has a dual role: it degrades glucose to generate ATP 
and it provides building blocks for synthetic reactions, such as the forma- 
tion of fatty acids. The rate of conversion of glucose into pyruvate is regu- 
lated to meet these two major cellular needs. In metabolic pathways, enzymes 
catalyzing essentially irreversible reactions are potential sites of control. In 
glycolysis, the reactions catalyzed by hexokinase, phosphofructokinase, and 
pyruvate kinase are virtually irreversible; hence, these enzymes would be 
expected to have regulatory as well as catalytic roles. In fact, each of them 
serves as a control site. These enzymes become more active or less so in 
response to the reversible binding of allosteric effectors or covalent modifi- 
cation. In addition, the amounts of these important enzymes are varied by 
the regulation of transcription to meet changing metabolic needs. The time 
required for reversible allosteric control, regulation by phosphorylation, 
and transcriptional control is measured typically in milliseconds, seconds, 


and hours, respectively. We will consider the control of glycolysis in two 
different tissues—skeletal muscle and liver. 


Glycolysis in muscle is regulated to meet the need for ATP 


Glycolysis in skeletal muscle provides ATP primarily to power contraction. 
Consequently, the primary control of muscle glycolysis is the energy charge 
of the cell—the ratio of ATP to AMP. Let us examine how each of the key 
regulatory enzymes responds to changes in the amounts of ATP and AMP 
present in the cell. 


Phosphofructokinase. Phosphofructokinase is the most important control 
site in the mammalian glycolytic pathway (Figure 16.16). High levels of ATP 
allosterically inhibit the enzyme (a 340-kd tetramer). ATP binds to a 
specific regulatory site that is distinct from the catalytic site. The binding of 
ATP lowers the enzyme’s affinity for fructose 6-phosphate. Thus, a 
high concentration of ATP converts the hyperbolic binding curve of 
fructose 6-phosphate into a sigmoidal one (Figure 16.17). AMP reverses 
the inhibitory action of ATP, and so the activity of the enzyme increases when 
the ATP/AMP ratio is lowered. In other words, glycolysis is stimulated as the 
energy charge falls. A decrease in pH also inhibits phosphofructokinase 
activity by augmenting the inhibitory effect of ATP. The pH might fall 
when muscle is functioning anaerobically, producing excessive quantities of 
lactic acid. The inhibitory effect protects the muscle from damage that 
would result from the accumulation of too much acid. 
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hk Figure 16.16 Structure of phosphofructokinase. The structure of phosphofructokinase 
from E. coli comprises a tetramer of four identical subunits. Notice the separation of the catalytic 
and allosteric sites. Each subunit of the human liver enzyme consists of two domains that are 
similar to the E. coli enzyme. [Drawn from 1PFK.pdb.] 
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Low [ATP] 


Reaction velocity — 
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Figure 16.17 Allosteric regulation of 
phosphofructokinase. A high level of ATP 
inhibits the enzyme by decreasing its affinity 
for fructose 6-phosphate. AMP diminishes and 
citrate enhances the inhibitory effect of ATP. 
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Why is AMP and not ADP the positive regulator of phosphofruc- 
tokinase? When ATP is being utilized rapidly, the enzyme adenylate kinase 
(Section 9.4) can form ATP from ADP by the following reaction: 


ADP + ADP == ATP + AMP 


Thus, some ATP is salvaged from ADP, and AMP becomes the signal for 
the low-energy state. Moreover, the use of AMP as an allosteric regulator 
provides an especially sensitive control. We can understand why by consid- 
ering, first, that the total adenylate pool ([ATP], [ADP], [AMP]]) ina cell is 
constant over the short term and, second, that the concentration of ATP is 
greater than that of ADP and the concentration of ADP is, in turn, greater 
than that of AMP. Consequently, small-percentage changes in [ATP] result 
in larger-percentage changes in the concentrations of the other adenylate 
nucleotides. This magnification of small changes in [ATP] to larger changes 
in [AMP] leads to tighter control by increasing the range of sensitivity of 
phosphofructokinase. 


Hexokinase. Phosphofructokinase is the most prominent regulatory 
enzyme in glycolysis, but it is not the only one. Hexokinase, the enzyme 
catalyzing the first step of glycolysis, is inhibited by its product, glucose 
6-phosphate. High concentrations of this molecule signal that the cell no 
longer requires glucose for energy or for the synthesis of glycogen, a storage 
form of glucose (Chapter 21), and the glucose will be left in the blood. A rise 
in glucose 6-phosphate concentration is a means by which phosphofruc- 
tokinase communicates with hexokinase. When phosphofructokinase is 
inactive, the concentration of fructose 6-phosphate rises. In turn, the level 
of glucose 6-phosphate rises because it is in equilibrium with fructose 
6-phosphate. Hence, the inhibition of phosphofructokinase leads to the inhibi- 
tion of hexokinase. 

Why is phosphofructokinase rather than hexokinase the pacemaker of 
glycolysis? The reason becomes evident on noting that glucose 6-phosphate 
is not solely a glycolytic intermediate. In muscle, glucose 6-phosphate can 
also be converted into glycogen. The first irreversible reaction unique to the 
glycolytic pathway, the committed step (Section 10.1), is the phosphorylation 
of fructose 6-phosphate to fructose 1,6-bisphosphate. Thus, it is highly 
appropriate for phosphofructokinase to be the primary control site in glyco- 
lysis. In general, the enzyme catalyzing the committed step in a metabolic 
sequence is the most important control element in the pathway. 


Pyruvate kinase. Pyruvate kinase, the enzyme catalyzing the third irre- 
versible step in glycolysis, controls the outflow from this pathway. This 
final step yields ATP and pyruvate, a central metabolic intermediate that 
can be oxidized further or used as a building block. ATP allosterically 
inhibits pyruvate kinase to slow glycolysis when the energy charge is high. 
Finally, alanine (synthesized in one step from pyruvate, Section 23.3) also 
allosterically inhibits pyruvate kinase—in this case, to signal that building 
blocks are abundant. When the pace of glycolysis increases, fructose 
1,6-bisphosphate, the product of the preceding irreversible step in glycoly- 
sis, activates the kinase to enable it to keep pace with the oncoming high flux 
of intermediates. A summary of the regulation of glycolysis in resting and 
active muscle is shown in Figure 16.18. 


The regulation of glycolysis in the liver illustrates the 
biochemical versatility of the liver 


The liver has more-diverse biochemical functions than muscle. Significantly, 
the liver maintains blood-glucose levels: it stores glucose as glycogen when 
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Figure 16.18 Regulation of glycolysis in muscle. At rest (left), glycolysis is not very active (thin 
arrows). The high concentration of ATP inhibits phosphofructokinase (PFK), pyruvate kinase, and 
hexokinase. Glucose 6-phosphate is converted into glycogen (Chapter 21). During exercise (right), 
the decrease in the ATP/AMP ratio resulting from muscle contraction activates phosphofructokinase 
and hence glycolysis. The flux down the pathway is increased, as represented by the thick arrows. 


glucose is plentiful, and it releases glucose when supplies are low. It also 
uses glucose to generate reducing power for biosynthesis (Section 20.3) as 
well as to synthesize a host of biochemicals. So, although the liver has many 
of the regulatory features of muscle glycolysis, the regulation of glycolysis 
in the liver is more complex. 


Phosphofructokinase. Regulation with respect to ATP is the same in the 
liver as in muscle. Low pH is not a metabolic signal for the liver enzyme, 
because lactate is not normally produced in the liver. Indeed, as we will see, 
lactate is converted into glucose in the liver. 

Glycolysis also furnishes carbon skeletons for biosyntheses, and so a 
signal indicating whether building blocks are abundant or scarce should also 
regulate phosphofructokinase. In the liver, phosphofructokinase is inhibited 
by citrate, an early intermediate in the citric acid cycle (Chapter 17). A high 
level of citrate in the cytoplasm means that biosynthetic precursors are 
abundant, and so there is no need to degrade additional glucose for this 
purpose. Citrate inhibits phosphofructokinase by enhancing the inhibitory 
effect of ATP. 

One means by which glycolysis in the liver responds to changes in blood 
glucose is through the signal molecule fructose 2, 6-bisphosphate (F-2,6-BP), 
a potent activator of phosphofructokinase (Figure 16.19). In the liver, the 
concentration of fructose 6-phosphate rises when blood-glucose concentra- 
tion is high, and the abundance of fructose 6-phosphate accelerates the 
synthesis of F-2,6-BP (Figure 16.20). Hence, an abundance of fructose 
6-phosphate leads to a higher concentration of F-2,6-BP. The binding of 
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Figure 16.19 Regulation of 
phosphofructokinase by fructose 
2,6-bisphosphate. In high concentrations, 
fructose 6-phosphate (F-6P) activates the 
enzyme phosphofructokinase (PFK) through 
an intermediary, fructose 2,6-bisphosphate 
(F-2,6-BP). 
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Figure 16.20 Activation of 
phosphofructokinase by fructose 
2,6-bisphosphate. (A) The sigmoidal 
dependence of velocity on substrate 
concentration becomes hyperbolic in the 
presence of 1 jM fructose 2,6-bisphosphate. 
(B) ATP, acting as a substrate, initially 
stimulates the reaction. As the concentration 
of ATP increases, it acts as an allosteric 
inhibitor. The inhibitory effect of ATP is 
reversed by fructose 2,6-bisphosphate. [After 
E. Van Schaftingen, M. F. Jett, L. Hue, and H. 
G. Hers. Proc. Natl. Acad. Sci. U. S. A. 
78:3483-3486, 1981.] 
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fructose 2,6-bisphosphate increases the affinity of phosphofructokinase for 
fructose 6-phosphate and diminishes the inhibitory effect of ATP. Glycolysis 
is thus accelerated when glucose is abundant. Such a process is called feed- 
forward stimulation. We will turn to the synthesis and degradation of this 
important regulatory molecule after we have considered gluconeogenesis. 


Hexokinase. The hexokinase reaction in the liver is controlled as in the 
muscle. However, the liver, in keeping with its role as monitor of blood- 
glucose levels, possesses another specialized isozyme of hexokinase, called 
glucokinase, which is not inhibited by glucose 6-phosphate. Glucokinase 
phosphorylates glucose only when glucose is abundant because the affinity 
of glucokinase for glucose is about 50-fold lower than that of hexokinase. 
The role of glucokinase is to provide glucose 6-phosphate for the synthesis 
of glycogen and for the formation of fatty acids (Section 22.1). The low 
affinity of glucokinase for glucose in the liver gives the brain and muscles 
first call on glucose when its supply is limited, and it ensures that glucose 
will not be wasted when it is abundant. Glucokinase is also present in the 
B cells of the pancreas, where the increased formation of glucose 6-phosphate 
by glucokinase when blood -glucose levels are elevated leads to the secretion 
of the hormone insulin. Insulin signals the need to remove glucose from the 
blood for storage as glycogen or conversion into fat. 


Pyruvate kinase. Several isozymic forms of 
pyruvate kinase (a tetramer of 57-kd subunits) 
encoded by different genes are present in mam- 
mals: the L type predominates in the liver, and the 
M type in muscle and the brain. The L and M 
forms of pyruvate kinase have many properties in 
common. Indeed, the liver enzyme behaves much 
ATP like the muscle enzyme with regard to allosteric 

regulation. However, the isozymic forms differ in 

their susceptibility to covalent modification. The 

catalytic properties of the L form—but not of the 
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Phesphocnolpyrivate:s ROP aH eo Rates ae M form—are also controlled by reversible phos- 
y x phorylation (Figure 16.21). When the blood-glu- 

Fructose ATP cose level is low, the glucagon-triggered cyclic 

1,6-bisphosphate Alanine AMP cascade (p. 487) leads to the phosphorylation 


Figure 16.21 Control of the catalytic activity of pyruvate kinase. 
Pyruvate kinase is regulated by allosteric effectors and covalent modification. 


of pyruvate kinase, which diminishes its activity. 
This hormone-triggered phosphorylation prevents 


Table 16.4 Family of glucose transporters 


Name Tissue location Ku Comments 


GLUT1 — All mammalian tissues 1mM Basal glucose uptake 
GLUT2_ —_Liverand pancreatic B cells 15-20mM _ In the pancreas, plays a role in the 
regulation of insulin 
In the liver, removes excess glucose 


from the blood 
GLUT3 — All mammalian tissues 1mM Basal glucose uptake 
GLUT4 = Muscle and fat cells 5mM Amount in muscle plasma membrane 


increases with endurance training 


GLUTS5 — Small intestine — Primarily a fructose transporter 


the liver from consuming glucose when it is more urgently needed by the 
brain and muscle. We see here a clear-cut example of how isoenzymes con- 
tribute to the metabolic diversity of different organs. We will return to the 
control of glycolysis after considering gluconeogenesis. 


A family of transporters enables glucose to enter and leave animal cells 


Several glucose transporters mediate the thermodynamically downhill 
movement of glucose across the plasma membranes of animal cells. Each 
member of this protein family, named GLUT1 to GLUTS, consists of a 
single polypeptide chain about 500 residues long (Table 16.4). Each glucose 
transporter has a 12-transmembrane-helix structure similar to that of 
lactose permease (Section 13.3). 

The members of this family have distinctive roles: 


1. GLUT1 and GLUT3, present in nearly all mammalian cells, are 
responsible for basal glucose uptake. Their K)y value for glucose is about 
1 mM, significantly less than the normal serum-glucose level, which typi- 
cally ranges from 4 mM to 8 mM. Hence, GLUT1 and GLUT3 continu- 
ally transport glucose into cells at an essentially constant rate. 


2. GLUT2, present in liver and pancreatic B cells, is distinctive in having 
avery high Kyy value for glucose (15-20 mM). Hence, glucose enters these 
tissues at a biologically significant rate only when there is much glucose 
in the blood. The pancreas can sense the glucose level and accordingly 
adjust the rate of insulin secretion. The high Ky value of GLUT2 also 
ensures that glucose rapidly enters liver cells only in times of plenty. 


3. GLUT4, which has a Kyy value of 5 mM, transports glucose into muscle 
and fat cells. The number of GLUT4 transporters in the plasma mem- 
brane increases rapidly in the presence of insulin, which signals the fed 
state. Hence, insulin promotes the uptake of glucose by muscle and fat. 
Endurance exercise training increases the amount of this transporter pres- 
ent in muscle membranes. 


4. GLUTS5, present in the small intestine, functions primarily as a fructose 
transporter. 


bya This family of transporters vividly illustrates how isoforms of a single 

protein can significantly shape the metabolic character of cells and 
contribute to their diversity and functional specialization. The transporters 
are members of a superfamily of transporters called the major facilitator 
(MF) superfamily. Members of this family transport sugars in organisms as 
diverse as E. coli, Trypanosoma brucei (a parasitic protozoan that causes 
sleeping sickness), and human beings. An elegant solution to the problem of 
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fuel transport evolved early and has been tailored to meet the needs of 
different organisms and even different tissues. 


Cancer and exercise training affect glycolysis in a similar fashion 


c 2 Tumors have been known for decades to display enhanced rates of 
glucose uptake and glycolysis. Indeed, rapidly growing tumor cells 
will metabolize glucose to lactate even in the presence of oxygen, a process 
called aerobic glycolysis or the “Warburg effect,” after Otto Warburg, the 
biochemist who first noted this characteristic of cancer cells in the 1920s. In 
fact, tumors with a high glucose uptake are particularly aggressive, and the 
cancer is likely to have a poor prognosis. A nonmetabolizable glucose 
analog, 2-'SF-2-p-deoxyglucose, detectable by a combination of positron 
emission tomography (PET) and computer-aided tomography (CAT), 
easily visualizes tumors (Figure 16.22). 

What selective advantage does aerobic glycolysis offer the tumor over 
the energetically more efficient oxidative phosphorylation? Research is 
being actively pursued to answer the question, but we can speculate on the 
benefits. First, aerobic glycolysis generates lactic acid that is then secreted. 
Acidification of the tumor environment has been shown to facilitate tumor 
invasion and inhibit the immune system from attacking the tumor. Second, 
the increased uptake of glucose and formation of glucose 6-phosphate pro- 
vides substrates for another metabolic pathway, the pentose phosphate 
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Figure 16.22 Tumors can be visualized with 2-'®F-2-p-deoxyglucose (FDG) and positron 
emission tomography. (A) A nonmetabolizable glucose analog infused into a patient and 
detected by a combination of positron emission and computer-aided tomography reveals the 
presence of a malignant tumor (T). (B) After 4 weeks of treatment with a tyrosine kinase inhibitor 
(Section 14.5), the tumor shows no uptake of FDG, indicating decreased metabolism. Excess FDG, 
which is excreted in the urine, also visualizes the kidney (K) and bladder (B). [Images courtesy of 
A. D. Van den Abbeele, Dana-Farber Cancer Institute, Boston.] 


pathway (Chapter 20), that generates biosynthetic reducing power. 
Moreover, the pentose phosphate pathway, in cooperation with glycolysis, 
produces precursors for biomolecules necessary for growth, such as nucle- 
otides. Finally, cancer cells grow more rapidly than the blood vessels that 
nourish them; thus, as solid tumors grow, the oxygen concentration in their 
environment falls. In other words, they begin to experience hypoxia, a defi- 
ciency of oxygen. The use of aerobic glycolysis reduces the dependence of 
cell growth on oxygen. 

Hypoxia itself enhances tumor growth by activating a transcription 
factor, hypoxia-inducible transcription factor (HIF-1). HIF-1 increases the 
expression of most glycolytic enzymes and the glucose transporters 
GLUT1 and GLUT3 (Table 16.5). These adaptations by the cancer cells 
enable a tumor to survive until blood vessels can grow. HIF-1 also increases 
the expression of signal molecules, such as vascular endothelial growth 
factor (VEGF), that facilitate the growth of blood vessels that will provide 
nutrients to the cells (Figure 16.23). Without new blood vessels, a tumor 
would cease to grow and either die or remain harmlessly small. Efforts are 
underway to develop drugs that inhibit the growth of blood vessels in 
tumors. 

What biochemical alterations facilitate the switch to aerobic glycolysis? 
Again, the answers are not complete, but changes in gene expression of 
isozymic forms of two glycolytic enzymes may be crucial. Tumor cells 
express an isozyme of hexokinase that binds to mitochondria. There, the 
enzyme has ready access to any ATP generated by oxidative phospho- 
rylation and is no longer susceptible to feedback inhibition by its product, 
glucose 6-phosphate. An embryonic isozyme of pyruvate kinase also is 
expressed; it facilitates uses of glycolytic intermediates for biosynthetic 
reactions and is sensitive to growth-factor regulation. 

Interestingly, anaerobic exercise training activates HIF-1, producing the 
same effects as those seen in the tumor—enhanced ability to generate ATP 
anaerobically and a stimulation of blood-vessel growth. These biochemical 
effects account for the improved athletic performance that results from 
training and demonstrate how behavior can affect biochemistry. 


16.3 Glucose Can Be Synthesized from 
Noncarbohydrate Precursors 


We now turn to the synthesis of glucose from noncarbohydrate precursors, a 
process called gluconeogenesis. Maintaining levels of glucose is important 
because the brain depends on glucose as its primary fuel and red blood cells 
use glucose as their only fuel. The daily glucose requirement of the brain in 
a typical adult human being is about 120 g, which accounts for most of the 
160 g of glucose needed daily by the whole body. The amount of glucose 
present in body fluids is about 20 g, and that readily available from glycogen 
is approximately 190 g. Thus, the direct glucose reserves are sufficient to 
meet glucose needs for about a day. Gluconeogenesis 1s especially important 
during a longer period of fasting or starvation (Section 27.5). 

The gluconeogenic pathway converts pyruvate into glucose. Noncarbohy- 
drate precursors of glucose are first converted into pyruvate or enter the 
pathway at later intermediates such as oxaloacetate and dihydroxyace- 
tone phosphate (Figure 16.24). The major noncarbohydrate precursors 
are lactate, amino acids, and glycerol. Lactate is formed by active skeletal 
muscle when the rate of glycolysis exceeds the rate of oxidative metabolism. 
Lactate is readily converted into pyruvate by the action of lactate dehydro- 
genase (p. 468). Amino acids are derived from proteins in the diet and, 


Table 16.5 Proteins in glucose 
metabolism encoded by 
genes regulated by 
hypoxia-inducible factor 


GLUT1 

GLUT3 

Hexokinase 

Phosphofructokinase 

Aldolase 

Glyceraldehyde 3-phosphate dehydrogenase 
Phosphoglycerate kinase 

Enolase 

Pyruvate kinase 


Hypoxia 
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ae (increase in glycolytic 
HIF-1 enzymes) 
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Figure 16.23 Alteration of gene 
expression in tumors owing to hypoxia. 
The hypoxic conditions inside a tumor mass 
lead to the activation of the hypoxia-inducible 
transcription factor (HIF-1), which induces 
metabolic adaptation (an increase in glycolytic 
enzymes) and activates angiogenic factors 
that stimulate the growth of new blood 
vessels. [After C. V. Dang and G. L. Semenza. 
Trends Biochem. Sci. 24:68-72, 1999.] 
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Figure 16.24 Pathway of 
gluconeogenesis. The distinctive 
reactions and enzymes of this pathway 
are shown in red. The other reactions are 
common to glycolysis. The enzymes for 
gluconeogenesis are located in the 
cytoplasm, except for pyruvate 
carboxylase (in the mitochondria) and 
glucose 6-phosphatase (membrane 
bound in the endoplasmic reticulum). 
The entry points for lactate, glycerol, and 
amino acids are shown. 
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carboxylase 
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during starvation, from the breakdown of proteins in skeletal muscle 
(Section 23.1). The hydrolysis of triacylglycerols (Section 22.2) in fat cells 
yields glycerol and fatty acids. Glycerol is a precursor of glucose, but ani- 
mals cannot convert fatty acids into glucose, for reasons that will be given 
later. Glycerol may enter either the gluconeogenic or the glycolytic pathway 
at dihydroxyacetone phosphate. 
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The major site of gluconeogenesis 1s the liver, with a small amount also 
taking place in the kidney. Little gluconeogenesis takes place in the brain, 
skeletal muscle, or heart muscle. Rather, gluconeogenesis in the liver and 
kidney helps to maintain the glucose level in the blood so that the brain and 
muscle can extract sufficient glucose from it to meet their metabolic demands. 


Gluconeogenesis is not a reversal of glycolysis 


In glycolysis, glucose is converted into pyruvate; in gluconeogenesis, pyru- 
vate is converted into glucose. However, gluconeogenesis is not a reversal of 
glycolysis. Several reactions must differ because the equilibrium of glycolysis 
lies far on the side of pyruvate formation. The actual AG for the formation 
of pyruvate from glucose is about —84 kJ mol! (20 kcal mol” +) under 
typical cellular conditions. Most of the decrease in free energy in glycolysis 
takes place in the three essentially irreversible steps catalyzed by hexoki- 
nase, phosphofructokinase, and pyruvate kinase. 


Hexokinase 


Glucose + ATP ——\——> glucose 6-phosphate + ADP 
AG = —33kJ mol! (—8.0 kcal mol!) 
Phosphofructokinase 
Fructose 6-phosphate + ATP 
fructose 1,6-bisphosphate + ADP 
AG = —22kJ mol! (—5.3 kcal mol!) 


Pyruvate kinase 


Phosphoenolpyruvate + ADP pyruvate + ATP 


AG = —17kJ mol! (—4.0 kcal mol!) 


In gluconeogenesis, the following new steps bypass these virtually 
irreversible reactions of glycolysis: 


1. Phosphoenolpyruvate is formed from pyruvate by way of oxaloacetate 
through the action of pyruvate carboxylase and phosphoenolpyruvate 
carboxykinase. 


Pyruvate carboxylase 


Pyruvate + CO) + ATP + H2O 
oxaloacetate + ADP + P, +2 H* 

Owilodeetate ah GTP Phosphoenolpyruvate carboxykinase 
phosphoenolpyruvate + GDP + CO, 
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Figure 16.25 Domain structure of 
pyruvate carboxylase. The ATP-grasp 
domain activates HCOz” and transfers CO2 
to the biotin-binding domain. From there, 
the CO; is transferred to pyruvate generated 
in the central domain. 


2. Fructose 6-phosphate is formed from fructose 1,6-bisphosphate by hydroly- 
sis of the phosphate ester at carbon 1. Fructose 1,6-bisphosphatase catalyzes 
this exergonic hydrolysis. 


Fructose 1,6-bisphosphate + H,0 ——> fructose 6-phosphate + P; 
3. Glucose is formed by the hydrolysis of glucose 6-phosphate in a reaction 
catalyzed by glucose 6-phosphatase. 
Glucose 6-phosphate + H»O ——> glucose + P; 
We will examine each of these steps in turn. 


The conversion of pyruvate into phosphoenolpyruvate begins with the 
formation of oxaloacetate 


The first step in gluconeogenesis is the carboxylation of pyruvate to form 
oxaloacetate at the expense of a molecule of ATP. 
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Then, oxaloacetate is decarboxylated and phosphorylated to yield phos- 
phoenolpyruvate, at the expense of the high phosphoryl-transfer potential 
of GTP. 
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The first of these reactions takes place inside the mitochondria. 

The first reaction is catalyzed by pyruvate carboxylase and the second 
by phosphoenolpyruvate carboxykinase (PEPCK). The sum of these reac- 
tions is 


Pyruvate + ATP + GTP + H,0 == 
phosphoenolpyruvate + ADP + GDP + P; + 2H™ 


Pyruvate carboxylase is of special interest because of its structural, catalytic, 
and allosteric properties. The N-terminal 300 to 350 amino acids form an 
ATP-grasp domain (Figure 16.25), which is an ATP-activating domain 
found in many enzymes, to be considered in more detail when we examine 
nucleotide biosynthesis (Chapter 25). The C-terminal 80 amino acids 
constitute a biotin-binding domain (Figure 16.26) that we will see again in 
fatty acid synthesis (Section 22.4). Biotin is a covalently attached prosthetic 


ATP-grasp Biotin-binding 
domain domain 
1 350 1100 1180 


group, which serves as a carrier of activated CO. The carboxylate group of 
biotin is linked to the e-amino group of a specific lysine residue by an amide 
bond (Figure 16.27). Note that biotin is attached to pyruvate carboxylase 
by a long, flexible chain. 

The carboxylation of pyruvate takes place in three stages: 


HCO; + ATP = = HOCO,-PO;? + ADP 
Biotin-enzyme + HOCO -PO3;7~ == CO, — biotin-enzyme + P; 
CO, — biotin-enzyme + pyruvate == biotin-enzyme + oxaloacetate 


Recall that, in aqueous solutions, CO. exists primarily as HCO3° with 
the aid of carbonic anhydrase (Section 9.2). HCO3_ is activated to carboxy- 
phosphate. This activated CO is subsequently bonded to the N-l atom 
of the biotin ring to form the carboxybiotin—-enzyme intermediate (see 
Figure 16.27). The COz attached to biotin is quite activated. The AG®’ for 
its cleavage 


CO,-biotin-enzyme + H* —> CO, + biotin-enzyme 


is —20 kJ mol! (-4.7 kcal mol‘). This negative AG®°’ indicates that 
carboxybiotin is able to transfer CO» to acceptors without the input of 
additional free energy. 

The activated carboxyl group is then transferred from carboxybiotin to 
pyruvate to form oxaloacetate. The long, flexible link between biotin and 
the enzyme enables this prosthetic group to rotate from one active site of the 
enzyme (the ATP-bicarbonate site) to the other (the pyruvate site). 

The first partial reaction of pyruvate carboxylase, the formation of 
carboxybiotin, depends on the presence of acetyl CoA. Biotin is not 
carboxylated unless acetyl CoA is bound to the enzyme. Acetyl CoA has no 
effect on the second partial reaction. The allosteric activation of pyruvate 
carboxylase by acetyl CoA is an important physiological control mechanism 
that will be discussed in Section 17.4. 


Oxaloacetate is shuttled into the cytoplasm and converted 
into phosphoenolpyruvate 


Pyruvate carboxylase is a mitochondrial enzyme, whereas the other enzymes 
of gluconeogenesis are present primarily in the cytoplasm. Oxaloacetate, 
the product of the pyruvate carboxylase reaction, must thus be transported 
to the cytoplasm to complete the pathway. Oxaloacetate is transported from 
a mitochondrion in the form of malate: oxaloacetate is reduced to malate 
inside the mitochondrion by an NADH-linked malate dehydrogenase. 
After malate has been transported across the mitochondrial membrane, it is 
reoxidized to oxaloacetate by an NAD* -linked malate dehydrogenase in the 
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hk Figure 16.26 Biotin-binding domain 
of pyruvate carboxylase. This likely 
structure is based on the structure of the 
homologous domain of the enzyme acetyl 
CoA carboxylase (Section 22.4). Notice that 
the biotin is on a flexible tether, allowing it to 
move between the ATP-bicarbonate site and 
the pyruvate site. [Drawn from 1BDO.pdb.] 
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Figure 16.27 Structure of carboxybiotin. 
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Figure 16.28 Compartmental 
cooperation. Oxaloacetate used in the 
cytoplasm for gluconeogenesis is formed in 
the mitochondrial matrix by the carboxylation 
of pyruvate. Oxaloacetate leaves the 
mitochondrion by a specific transport system 
(not shown) in the form of malate, which is 
reoxidized to oxaloacetate in the cytoplasm. 


cytoplasm (Figure 16.28). The formation of oxaloacetate from malate also 
provides NADH for use in subsequent steps in gluconeogenesis. Finally, 
oxaloacetate is simultaneously decarboxylated and phosphorylated by 
phosphoenolpyruvate carboxykinase to generate phosphoenolpyruvate. 
The phosphoryl donor is GTP. The CO, that was added to pyruvate by 
pyruvate carboxylase comes off in this step. 

Why is a carboxylation and a decarboxylation required to form phos- 
phoenolpyruvate from pyruvate? Recall that, in glycolysis, the presence of a 
phosphoryl group traps the unstable enol isomer of pyruvate as phosphoe- 
nolpyruvate (p. 465). However, the addition of a phosphoryl group to pyru- 
vate is a highly unfavorable reaction: the AG®’ of the reverse of the glyco- 
lytic reaction catalyzed by pyruvate kinase is +31 kJ mol! (+7.5 kcal 
mol !). In gluconeogenesis, the use of the carboxylation and decarboxyla- 
tion steps results in a much more favorable AG®’. The formation of phos- 
phoenolpyruvate from pyruvate in the gluconeogenic pathway has a AG®’ 
of +0.8 kJ mol! (+0.2 kcal mol‘). A molecule of ATP is used to power 
the addition of a molecule of CO, to pyruvate in the carboxylation step. 
That CO, is then removed to power the formation of phosphoenolpyruvate 
in the decarboxylation step. Decarboxylations often drive reactions that are 
otherwise highly endergonic. This metabolic motif is used in the citric acid 
cycle (Chapter 17), the pentose phosphate pathway (Chapter 20), and fatty 
acid synthesis (Section 22.4). 


The conversion of fructose 1,6-bisphosphate into fructose 6-phosphate 
and orthophosphate is an irreversible step 


On formation, phosphoenolpyruvate is metabolized by the enzymes of 
glycolysis but in the reverse direction. These reactions are near equilibrium 
under intracellular conditions; so, when conditions favor gluconeogenesis, 
the reverse reactions will take place until the next irreversible step is 
reached. This step is the hydrolysis of fructose 1,6-bisphosphate to fructose 
6-phosphate and P;. 


Fructose 1,6-bisphosphatase 


Fructose 1,6-bisphosphate + H2,O 
fructose 6-phosphate + P; 


The enzyme responsible for this step is fructose 1,6-bisphosphatase. Like 
its glycolytic counterpart, it is an allosteric enzyme that participates in the 
regulation of gluconeogenesis. We will return to its regulatory properties 
later in the chapter. 


The generation of free glucose is an important control point 


The fructose 6-phosphate generated by fructose 1,6-bisphosphatase is read- 
ily converted into glucose 6-phosphate. In most tissues, gluconeogenesis 
ends here. Free glucose is not generated; rather, the glucose 6-phosphate is 
processed in some other fashion, notably to form glycogen. One advantage 
to ending gluconeogenesis at glucose 6-phosphate is that, unlike free glu- 
cose, the molecule is not transported out of the cell. To keep glucose inside 
the cell, the generation of free glucose is controlled in two ways. First, the 
enzyme responsible for the conversion of glucose 6-phosphate into glucose, 
glucose 6-phosphatase, is regulated. Second, the enzyme is present only in 
tissues whose metabolic duty is to maintain blood-glucose homeostasis— 
tissues that release glucose into the blood. These tissues are the liver and to 
a lesser extent the kidney. 

This final step in the generation of glucose does not take place in the 
cytoplasm. Rather, glucose 6-phosphate is transported into the lumen of 
the endoplasmic reticulum, where it is hydrolyzed to glucose by glucose 


6-phosphatase, which is bound to the membrane 
(Figure 16.29). An associated Ca" -binding stabi- 
lizing protein is essential for phosphatase activity. 
Glucose and P; are then shuttled back to the cyto- 
plasm by a pair of transporters. The glucose trans- 
porter in the endoplasmic reticulum membrane 
is like those found in the plasma membrane. It is 
striking that five proteins are needed to transform 
cytoplasmic glucose 6-phosphate into glucose. 


Six high-transfer-potential phosphoryl groups 
are spent in synthesizing glucose from pyruvate 


The formation of glucose from pyruvate is ener- 


9 
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OOS 
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H,0 + _— glucose 
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Figure 16.29 Generation of glucose from glucose 6-phosphate. Several 
endoplasmic reticulum (ER) proteins play a role in the generation of glucose 
from glucose 6-phosphate. T1 transports glucose 6-phosphate into the lumen of the 
ER, whereas T2 and T3 transport P; and glucose, respectively, back into the cytoplasm. 
Glucose 6-phosphatase is stabilized by a Ca”*-binding protein (SP). [After A. Buchell 
and |. D. Waddel. Biochem. Biophys. Acta 1092:129-137, 19911] 


getically unfavorable unless it is coupled to reac- 
tions that are favorable. Compare the stoichiometry 
of gluconeogenesis with that of the reverse of gly- 
colysis. 

The stoichiometry of gluconeogenesis is 


2 Pyruvate + 4ATP + 2GTP + 2NADH + 6H20 > 


glucose + 4 ADP + 2GDP + 6P; + 2NAD* +2H* 
AG°' = —48kJ mol (—11 kcal mol 7) 


In contrast, the stoichiometry for the reversal of glycolysis is 


2 Pyruvate + 2ATP + NADH + 2H20 > 


glucose + 2 ADP + 2P; + 2NAD* +2H* 
AG°' = +84kJ mol” '(+20 kcal mol 7 ') 


Note that six nucleoside triphosphate molecules are hydrolyzed to synthe- 
size glucose from pyruvate in gluconeogenesis, whereas only two molecules 
of ATP are generated in glycolysis in the conversion of glucose into pyru- 
vate. Thus, the extra cost of gluconeogenesis is four high-phosphoryl- 
transfer-potential molecules for each molecule of glucose synthesized from 
pyruvate. The four additional molecules having high phosphoryl-transfer 
potential are needed to turn an energetically unfavorable process (the rever- 
sal of glycolysis) into a favorable one (gluconeogenesis). Here we have a 
clear example of the coupling of reactions: NTP hydrolysis is used to power 
an energetically unfavorable reaction. The reactions of gluconeogenesis are 
summarized in Table 16.6. 


Table 16.6 Reactions of gluconeogenesis 


Reaction 


Pyruvate + CO, + ATP + HyO —— oxaloacetate + ADP + P; + 2H* 
Oxaloacetate + GTP phosphoenolpyruvate + GDP + CO, 
Phosphoenolpyruvate + H2,O0 === 2-phosphoglycerate 


2-Phosphoglycerate 3-phosphoglycerate 

3-Phosphoglycerate + ATP === 1,3-bisphosphoglycerate + ADP 

1,3-Bisphosphoglycerate + NADH + H* glyceraldehyde 3-phosphate + NAD* + P; 
Glyceraldehyde 3-phosphate dihydroxyacetone phosphate 

Glyceraldehyde 3-phosphate + dihydroxyacetone phosphate fructose 1,6-bisphosphate 
Fructose 1,6-bisphosphate + H,0 ——— fructose 6-phosphate + P; 

Fructose 6-phosphate glucose 6-phosphate 

Glucose 6-phosphate+ H,0 ——~> + glucose + P; 
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Figure 16.30 Reciprocal regulation of 
gluconeogenesis and glycolysis in the 
liver. The level of fructose 2,6-bisphosphate 
is high in the fed state and low in starvation. 
Another important control is the inhibition of 
pyruvate kinase by phosphorylation during 
starvation. 


16.4 Gluconeogenesis and Glycolysis Are 
Reciprocally Regulated 


Gluconeogenesis and glycolysis are coordinated so that, within a cell, one 
pathway is relatively inactive while the other is highly active. If both sets of 
reactions were highly active at the same time, the net result would be the 
hydrolysis of four nucleoside triphosphates (two ATP molecules plus two 
GTP molecules) per reaction cycle. Both glycolysis and gluconeogenesis are 
highly exergonic under cellular conditions, and so there is no thermody- 
namic barrier to such simultaneous activity. However, the amounts and 
activities of the distinctive enzymes of each pathway are controlled so that 
both pathways are not highly active at the same time. The rate of glycolysis 
is also determined by the concentration of glucose, and the rate of gluconeo- 
genesis by the concentrations of lactate and other precursors of glucose. The 
basic premise of the reciprocal regulation is that, when energy is needed, 
glycolysis will predominate. When there is a surplus of energy, gluconeo- 
genesis will take over. 


Energy charge determines whether glycolysis or gluconeogenesis 
will be most active 


The first important regulation site is the interconversion of fructose 
6-phosphate and fructose 1,6-bisphosphate (Figure 16.30). Consider first a 
situation in which energy is needed. In this case, the concentration of AMP 
is high. Under this condition, AMP stimulates phosphofructokinase but 
inhibits fructose 1,6-bisphosphatase. Thus, glycolysis is turned on and 
gluconeogenesis is inhibited. Conversely, high levels of ATP and citrate 
indicate that the energy charge is high and that biosynthetic intermediates 
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are abundant. ATP and citrate inhibit phosphofructokinase, whereas citrate 
activates fructose 1,6-bisphosphatase. Under these conditions, glycolysis is 
nearly switched off and gluconeogenesis is promoted. Why does citrate take 
part in this regulatory scheme? As we will see in Chapter 17, citrate reports 
on the status of the citric acid cycle, the primary pathway for oxidizing fuels 
in the presence of oxygen. High levels of citrate indicate an energy-rich 
situation and the presence of precursors for biosynthesis. 

Glycolysis and gluconeogenesis are also reciprocally regulated at the 
interconversion of phosphoenolpyruvate and pyruvate in the liver. The 
glycolytic enzyme pyruvate kinase is inhibited by allosteric effectors ATP 
and alanine, which signal that the energy charge is high and that building 
blocks are abundant. Conversely, pyruvate carboxylase, which catalyzes the 
first step in gluconeogenesis from pyruvate, is inhibited by ADP. Likewise, 
ADP inhibits phosphoenolpyruvate carboxykinase. Pyruvate carboxylase is 
activated by acetyl CoA, which, like citrate, indicates that the citric acid 
cycle is producing energy and biosynthetic intermediates (Chapter 17). 
Hence, gluconeogenesis is favored when the cell is rich in biosynthetic 
precursors and ATP. 


The balance between glycolysis and gluconeogenesis in the liver is 
sensitive to blood-glucose concentration 


In the liver, rates of glycolysis and gluconeogenesis are adjusted to maintain 
blood-glucose levels. The signal molecule fructose 2,6-bisphosphate strongly 
stimulates phosphofructokinase (PFK) and inhibits fructose 1, 6-bisphosphatase 
(p. 475). When blood glucose is low, fructose 2,6-bisphosphate loses a phos- 
phoryl group to form fructose 6-phosphate, which no longer binds to PFK. 
How is the concentration of fructose 2,6-bisphosphate controlled to rise and 
fall with blood-glucose levels? Two enzymes regulate the concentration of 
this molecule: one phosphorylates fructose 6-phosphate and the other 
dephosphorylates fructose 2,6-bisphosphate. Fructose 2,6-bisphosphate is 
formed in a reaction catalyzed by phosphofructokinase 2 (PFK2), a different 
enzyme from phosphofructokinase. Fructose 6-phosphate is formed through 
the hydrolysis of fructose 2,6-bisphosphate by a specific phosphatase, fruc- 
tose bisphosphatase 2 (FBPase2). The striking finding is 
that both PFK2 and FBPase2 ave present in a single 55-kd 
polypeptide chain (Figure 16.31). This bifunctional enzyme 
contains an N-terminal regulatory domain, followed by a 
kinase domain and a phosphatase domain. PFK2 resembles 
adenylate kinase in having a P-loop NTPase domain 
(Section 9.4), whereas FBPase2 resembles phosphoglycer- 
ate mutase (p. 464). Recall that the mutase is essentially a 
phosphatase. In the bifunctional enzyme, the phosphatase 
activity evolved to become specific for F-2,6-BP. The 
bifunctional enzyme itself probably arose by the fusion of 
genes encoding the kinase and phosphatase domains. 
What controls whether PFK2 or FBPase2 dominates 
the bifunctional enzyme’s activities in the liver? The activ- 
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Phosphatase domain 


ities of PFK2 and FBPase2 are reciprocally controlled by i 32 250 470 
phosphorylation of a single serine residue. When glucose is preceay 

scarce, such as during a night’s fast, a rise in the blood level 

of the hormone glucagon triggers a cyclic AMP signal cas- h Figure 16.31 Domain structure of the bifunctional enzyme 


cade (Section 14.1), leading to the phosphorylation of this 
bifunctional enzyme by protein kinase A (Figure 16.32). 


This covalent modification activates FBPase2 and inhibits The bar represents the amino acid sequence of the enzyme. 


PFK2, lowering the level of F-2,6-BP. Gluconeogenesis [Drawn from 1BIF.pdb] 


phosphofructokinase 2. The kinase domain (purple) is fused to 
the phosphatase domain (red). The kinase domain is a P-loop NTP 
hydrolase domain, as indicated by the purple shading (Section 9.4). 


GLUCOSE ABUNDANT 
(glycolysis active) 


Fructose 2,6-bisphosphate 
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Figure 16.32 Control of the synthesis 
and degradation of fructose 
2,6-bisphosphate. A low blood-glucose level 
as signaled by glucagon leads to the 
phosphorylation of the bifunctional enzyme 
and hence to a lower level of fructose 
2,6-bisphosphate, slowing glycolysis. High 
levels of fructose 6-phosphate accelerate the 
formation of fructose 2,6-bisphosphate by 
facilitating the dephosphorylation of the 
bifunctional enzyme. 


Figure 16.33 The promoter of the 
phosphoenolpyruvate carboxykinase 
gene. This promoter is approximately 500 bp 
in length and contains regulatory sequences 
(response elements) that mediate the action 
of several hormones. Abbreviations: IRE, 
insulin response element; GRE, glucocorticoid 
response element; TRE, thyroid hormone 
response element; CREI and CREII, cAMP 
response elements. [After M. M. McGrane, J. 
S. Jun, Y. M. Patel, and R. W. Hanson. Trends 
Biochem. Sci. 17:40-44, 1992.] 
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predominates. Glucose formed by the liver under these conditions is essen- 
tial for the viability of the brain. Glucagon stimulation of protein kinase A 
also inactivates pyruvate kinase in the liver (p. 476). 

Conversely, when blood-glucose levels are high, such as after a meal, 
gluconeogenesis is not needed. Insulin is secreted and initiates a signal path- 
way that activates a protein phosphatase, which removes the phosphoryl 
group from the bifunctional enzyme. This covalent modification activates 
PFK2 and inhibits FBPase2. The resulting rise in the level of F-2,6-BP 
accelerates glycolysis. The coordinated control of glycolysis and gluconeo- 
genesis is facilitated by the location of the kinase and phosphatase domains 
on the same polypeptide chain as the regulatory domain. 

The hormones insulin and glucagon also regulate the amounts of essen- 
tial enzymes. These hormones alter gene expression primarily by changing 
the rate of transcription. Insulin levels rise subsequent to eating, when there 
is plenty of glucose for glycolysis. To encourage glycolysis, insulin stimu- 
lates the expression of phosphofructokinase, pyruvate kinase, and the 
bifunctional enzyme that makes and degrades F-2,6-BP. Glucagon rises 
during fasting, when gluconeogenesis is needed to replace scarce glucose. To 
encourage gluconeogenesis, glucagon inhibits the expression of the three 
regulated glycolytic enzymes and stimulates instead the production of two 
key gluconeogenic enzymes, phosphoenolpyruvate carboxykinase and 
fructose 1,6-bisphosphatase. Transcriptional control in eukaryotes is much 
slower than allosteric control, taking hours or days instead of seconds to 
minutes. The richness and complexity of hormonal control are graphically 
displayed by the promoter of the phosphoenolpyruvate carboxykinase gene, 
which contains regulatory sequences that respond to insulin, glucagon 
(through the cAMP response elements), glucocorticoids, and thyroid 
hormone (Figure 16.33). 
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Substrate cycles amplify metabolic signals and produce heat 


A pair of reactions such as the phosphorylation of fructose 6-phosphate to 
fructose 1,6-bisphosphate and its hydrolysis back to fructose 6-phosphate 
is called a substrate cycle. As already mentioned, both reactions are not 
simultaneously fully active in most cells, because of reciprocal allosteric 
controls. However, isotope-labeling studies have shown that some fructose 
6-phosphate is phosphorylated to fructose 1,6-bisphosphate even during 
gluconeogenesis. There also is a limited degree of cycling in other pairs of 
opposed irreversible reactions. This cycling was regarded as an imperfec- 
tion in metabolic control, and so substrate cycles have sometimes been 
called futile cycles. Indeed, there are pathological conditions, such as malig- 
nant hyperthermia, in which control is lost and both pathways proceed 
rapidly. One result is the rapid, uncontrolled hydrolysis of ATP, which 
generates heat. 

Despite such extraordinary circumstances, substrate cycles now seem 
likely to be biologically important. One possibility is that substrate cycles 
amplify metabolic signals. Suppose that the rate of conversion of A into B is 
100 and of B into A is 90, giving an initial net flux of 10. Assume that an 
allosteric effector increases the A — B rate by 20% to 120 and reciprocally 
decreases the B— A rate by 20% to 72. The new net flux is 48, and so a 20% 
change in the rates of the opposing reactions has led to a 380% increase in 
the net flux. In the example shown in Figure 16.34, this amplification is 
made possible by the rapid hydrolysis of ATP. The flux down the glyco- 
lytic pathway has been suggested to increase as much as 1000-fold at the 
initiation of intense exercise. Because the allosteric activation of enzymes 
alone seems unlikely to explain this increased flux, the existence of substrate 
cycles may partly account for the rapid rise in the rate of glycolysis. 

The other potential biological role of substrate cycles is the generation of 
heat produced by the hydrolysis of ATP. In European bumblebees, cycling 
is used for both signal amplification and heat generation. Phosphofruc- 
tokinase and fructose 1,6-bisphosphatase in a bee’s flight muscle are simul- 
taneously active. The cycling augments other means of thermogenesis, 
such as shivering, and amplifies the flux down the glycolytic pathway in 
preparation for the transition from rest to flight. 


Lactate and alanine formed by contracting muscle are 
used by other organs 


Lactate produced by active skeletal muscle and erythrocytes is a source of 
energy for other organs. Erythrocytes lack mitochondria and can never oxi- 
dize glucose completely. In contracting skeletal muscle during vigorous 
exercise, the rate at which glycolysis produces pyruvate exceeds the rate at 
which the citric acid cycle oxidizes it. In these cells, lactate dehydrogenase 
reduces excess pyruvate to lactate to restore redox balance (p. 466). However, 
lactate is a dead end in metabolism. It must be converted back into pyruvate 
before it can be metabolized. Both pyruvate and lactate diffuse out of these 
cells through carriers into the blood. In contracting skeletal muscle, the 
formation and release of lactate lets the muscle generate ATP in the absence of 
oxygen and shifts the burden of metabolizing lactate from muscle to other 
organs. The pyruvate and lactate in the bloodstream have two fates. In one 
fate, the plasma membranes of some cells, particularly cells in cardiac 
muscle, contain carriers that make the cells highly permeable to lactate and 
pyruvate. These molecules diffuse from the blood into such permeable cells. 
Once inside these well-oxygenated cells, lactate can be reverted back to 
pyruvate and metabolized through the citric acid cycle and oxidative phos- 
phorylation to generate ATP. The use of lactate in place of glucose by these 
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Figure 16.34 Substrate cycle. This 
ATP-driven cycle operates at two different 
rates. A small change in the rates of the two 
opposing reactions results in a large change 
in the net flux of product B. 
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Figure 16.35 The Cori cycle. Lactate formed by active muscle is 
converted into glucose by the liver. This cycle shifts part of the 


metabolic burden of active muscle to the liver. 


Figure 16.36 PATHWAY INTEGRATION: 
Cooperation between glycolysis and 
gluconeogenesis during a sprint. 
Glycolysis and gluconeogenesis are 
coordinated, in a tissue-specific fashion, to 
ensure that the energy needs of all cells are 
met. Consider a sprinter. In skeletal leg 
muscle, glucose will be metabolized 
aerobically to COz and H20 or, more likely 
(thick arrows) during a sprint, anaerobically to 
lactate. In cardiac muscle, the lactate can be 
converted into pyruvate and used as a fuel, 
along with glucose, to power the heartbeats to 
keep the sprinter’s blood flowing. 
Gluconeogenesis, a primary function of the 
liver, will be taking place rapidly (thick arrows) 
to ensure that enough glucose is present in 
the blood for skeletal and cardiac muscle, as 
well as for other tissues. Glycogen, glycerol, 
and amino acids are other sources of energy 
that we will learn about in later chapters. 
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cells makes more circulating glucose available to the active 
muscle cells. In the other fate, excess lactate enters the liver 
and is converted first into pyruvate and then into glucose by 
the gluconeogenic pathway. Contracting skeletal muscle sup- 
plies lactate to the liver, which uses it to synthesize and release 
glucose. Thus, the liver restores the level of glucose necessary for 
active muscle cells, which derive ATP from the glycolytic con- 
version of glucose into lactate. These reactions constitute the 
Cori cycle (Figure 16.35). 

Studies have shown that alanine, like lactate, is a major 
precursor of glucose in the liver. The alanine is generated in 
muscle when the carbon skeletons of some amino acids are 
used as fuels. The nitrogens from these amino acids are 
transferred to pyruvate to form alanine; the reverse reaction 
takes place in the liver. This process also helps maintain 
nitrogen balance. The interplay between glycolysis and glu- 


coneogenesis is summarized in Figure 16.36, which shows how these path- 
ways help meet the energy needs of different cell types. 


wy 


Isozymic forms of lactate dehydrogenase in different tissues catalyze 
the interconversions of pyruvate and lactate (Section 10.2). Lactate 


dehydrogenase is a tetramer of two kinds of 35-kd subunits encoded by 
similar genes: the H type predominates in the heart, and the homologous 
M type in skeletal muscle and the liver. These subunits associate to form 
five types of tetramers: Hy, H3M1, H2Mo2, HiM3, and My. The Hy isozyme 
(type 1) has higher affinity for substrates than that of the M4 isozyme 
(type 5) and, unlike My, is allosterically inhibited by high levels of pyru- 
vate. The other isozymes have intermediate properties, depending on 
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the ratio of the two kinds of chains. The H4 isozyme oxidizes lactate to 
pyruvate, which is then used as a fuel by the heart through aerobic meta- 
bolism. Indeed, heart muscle never functions anaerobically. In contrast, M4 
is optimized to operate in the reverse direction, to convert pyruvate into 
lactate to allow glycolysis to proceed under anaerobic conditions. We see 
here an example of how gene duplication and divergence generate a series of 
homologous enzymes that foster metabolic cooperation between organs. 


Glycolysis and gluconeogenesis are evolutionarily intertwined 


Ba The metabolism of glucose has ancient origins. Organisms living in the 

early biosphere depended on the anaerobic generation of energy until 
significant amounts of oxygen began to accumulate 2 billion years ago. 
Glycolytic enzymes were most likely derived independently rather than by 
gene duplication, because glycolytic enzymes with similar properties do not 
have similar amino acid sequences. Although there are four kinases and two 
isomerases in the pathway, both sequence and structural comparisons do 
not suggest that these sets of enzymes are related to one another by diver- 
gent evolution. The common dinucleotide-binding domain found in the 
dehydrogenases (see Figure 16.12) and the aB barrels are the only major 
recurring elements. 

We can speculate on the relationship between glycolysis and gluconeo- 
genesis if we think of glycolysis as consisting of two segments: the metabo- 
lism of hexoses (the upper segment) and the metabolism of trioses (the 
lower segment). The enzymes of the upper segment are different in some 
species and are missing entirely in some archaea, whereas enzymes of the 
lower segment are quite conserved. In fact, four enzymes of the lower 
segment are present in all species. This lower part of the pathway is common 
to glycolysis and gluconeogenesis. This common part of the two pathways 
may be the oldest part, constituting the core to which the other steps 
were added. The upper part would have varied according to the sugars that 
were available to evolving organisms in particular niches. Interestingly, this 
core part of carbohydrate metabolism can generate triose precursors for 
ribose sugars, a component of RNA and a critical requirement for the RNA 
world. Thus, we are left with the unanswered question, Was the original 
core pathway used for energy conversion or biosynthesis? 
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16.1 Glycolysis Is an Energy-Conversion Pathway in Many Organisms 
Glycolysis is the set of reactions that converts glucose into pyruvate. 
The 10 reactions of glycolysis take place in the cytoplasm. In the first 
stage, glucose is converted into fructose 1,6-bisphosphate by a phos- 
phorylation, an isomerization, and a second phosphorylation reaction. 
Fructose 1,6-bisphosphate is then cleaved by aldolase into dihydroxy- 
acetone phosphate and glyceraldehyde 3-phosphate, which are readily 
interconvertible. Two molecules of ATP are consumed per molecule 
of glucose in these reactions, which are the prelude to the net synthesis 
of ATP. In the second stage, ATP is generated. Glyceraldehyde 
3-phosphate is oxidized and phosphorylated to form 1,3-bisphospho- 
glycerate, an acyl phosphate with a high phosphoryl-transfer poten- 
tial. This molecule transfers a phosphoryl group to ADP to form ATP 
and 3-phosphoglycerate. A phosphoryl shift and a dehydration form 
phosphoenolpyruvate, a second intermediate with a high phosphoryl- 
transfer potential. Another molecule of ATP is generated as phospho- 
enolpyruvate is converted into pyruvate. There is a net gain of two 
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molecules of ATP in the formation of two molecules of pyruvate from 
one molecule of glucose. 

The electron acceptor in the oxidation of glyceraldehyde 3-phos- 
phate is NAD*, which must be regenerated for glycolysis to continue. 
In aerobic organisms, the NADH formed in glycolysis transfers its 
electrons to O» through the electron-transport chain, which thereby 
regenerates NAD. Under anaerobic conditions and in some micro- 
organisms, NAD* is regenerated by the reduction of pyruvate to 
lactate. In other microorganisms, NAD* is regenerated by the reduc- 
tion of pyruvate to ethanol. These two processes are examples of 
fermentations. 


16.2 The Glycolytic Pathway Is Tightly Controlled 


The glycolytic pathway has a dual role: it degrades glucose to generate 
ATP, and it provides building blocks for the synthesis of cellular 
components. The rate of conversion of glucose into pyruvate is regu- 
lated to meet these two major cellular needs. Under physiological 
conditions, the reactions of glycolysis are readily reversible except for 
those catalyzed by hexokinase, phosphofructokinase, and pyruvate 
kinase. Phosphofructokinase, the most important control element in 
glycolysis, is inhibited by high levels of ATP and citrate, and it is 
activated by AMP and fructose 2,6-bisphosphate. In the liver, this 
bisphosphate signals that glucose is abundant. Hence, phosphofruc- 
tokinase is active when either energy or building blocks are needed. 
Hexokinase is inhibited by glucose 6-phosphate, which accumulates 
when phosphofructokinase is inactive. ATP and alanine allosterically 
inhibit pyruvate kinase, the other control site, and fructose 1,6-bis- 
phosphate activates the enzyme. Consequently, pyruvate kinase is 
maximally active when the energy charge is low and glycolytic 
intermediates accumulate. 


16.3 Glucose Can Be Synthesized from Noncarbohydrate Precursors 


Gluconeogenesis is the synthesis of glucose from noncarbohydrate 
sources, such as lactate, amino acids, and glycerol. Several of the reac- 
tions that convert pyruvate into glucose are common to glycolysis. 
Gluconeogenesis, however, requires four new reactions to bypass the 
essential irreversibility of three reactions in glycolysis. In two of the 
new reactions, pyruvate is carboxylated in mitochondria to oxaloace- 
tate, which in turn is decarboxylated and phosphorylated in the 
cytoplasm to phosphoenolpyruvate. Two molecules having high 
phosphoryl-transfer potential are consumed in these reactions, which 
are catalyzed by pyruvate carboxylase and phosphoenolpyruvate 
carboxykinase. Pyruvate carboxylase contains a biotin prosthetic 
group. The other distinctive reactions of gluconeogenesis are the 
hydrolyses of fructose 1,6-bisphosphate and glucose 6-phosphate, 
which are catalyzed by specific phosphatases. The major raw materials 
for gluconeogenesis by the liver are lactate and alanine produced from 
pyruvate by active skeletal muscle. The formation of lactate during 
intense muscular activity buys time and shifts part of the metabolic 
burden from muscle to the liver. 


16.4 Gluconeogenesis and Glycolysis Are Reciprocally Regulated 


Gluconeogenesis and glycolysis are reciprocally regulated so that one 
pathway 1s relatively inactive while the other is highly active. Phospho- 
fructokinase and fructose 1,6-bisphosphatase are key control points. 
Fructose 2,6-bisphosphate, an intracellular signal molecule present 


at higher levels when glucose is abundant, activates glycolysis and 493 


inhibits gluconeogenesis by regulating these enzymes. Pyruvate kinase 


Problems 


and pyruvate carboxylase are regulated by other effectors so that both 
are not maximally active at the same time. Allosteric regulation and 
reversible phosphorylation, which are rapid, are complemented by 
transcriptional control, which takes place in hours or days. 


Key Terms 


glycolysis (p. 453) substrate-level 
lactic acid fermentation (p. 453) 
alcoholic fermentation (p. 453) 
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Problems 


1. Gross versus net. The gross yield of ATP from the metab- 
olism of glucose to two molecules of pyruvate is four mole- 
cules of ATP. However, the net yield is only two molecules 
of ATP. Why are the gross and net values different? 


2. Who takes? Who gives? Lactic acid fermentation and 
alcoholic fermentation are oxidation—reduction reactions. 
Identify the ultimate electron donor and electron acceptor. 


3. ATP yield. Each of the following molecules is processed 
by glycolysis to lactate. How much ATP is generated from 
each molecule? 


(a) Glucose 6-phosphate 

(b) Dihydroxyacetone phosphate 

(c) Glyceraldehyde 3-phosphate 

(d) Fructose 

(e) Sucrose 

4, Enzyme redundancy? Why is it advantageous for the liver 
to have both hexokinase and glucokinase to phosphorylate 
glucose? 


5. Corporate sponsors. Some of the early research on glyco- 
lysis was supported by the brewing industry. Why would 
the brewing industry be interested in glycolysis? 


6. Recommended daily allowance. The recommended daily 
allowance for the vitamin niacin is 15 mg per day. How 
would glycolysis be affected by niacin deficiency? 


7. Who’s on first? Although both hexokinase and phospho- 
fructokinase catalyze irreversible steps in glycolysis and the 
hexokinase-catalyzed step is first, phosphofructokinase is 
nonetheless the pacemaker of glycolysis. What does 
this information tell you about the fate of the glucose 
6-phosphate formed by hexokinase? 


8. The tortoise and the hare. Why is the regulation of phos- 
phofructokinase by energy charge not as important in the 
liver as it is in muscle? 


9. Running in reverse. Why can’t the reactions of the 
glycolytic pathway simply be run in reverse to synthesize 
glucose? 


10. Road blocks. What reactions of glycolysis are not 
readily reversible under intracellular conditions? 


11. No pickling. Why is it in the muscle’s best interest to 
export lactic acid into the blood during intense exercise? 


12. Aprés vous. Why is it physiologically advantageous for 
the pancreas to use GLUT 2, with a high Ky, as the trans- 
porter that allows glucose entry into B cells? 


13. Bypass. In the liver, fructose can be converted into glyc- 
eraldehyde 3-phosphate and dihydroxyacetone phosphate 
without passing through the phosphofructokinase-regulat- 
ed reaction. Show the reactions that make this conversion 
possible. Why might ingesting high levels of fructose have 
deleterious physiological effects? 


14. Trouble ahead. Suppose that a microorganism that was 
an obligate anaerobe suffered a mutation that resulted in the 
loss of triose phosphate isomerase activity. How would this 
loss affect the ATP yield of fermentation? Could such an 
organism survive? 


15. Kitchen chemistry. Sucrose is commonly used to preserve 
fruits. Why is glucose not suitable for preserving foods? 


16. Tracing carbon atoms 1. Glucose labeled with “C at C-1 
is incubated with the glycolytic enzymes and necessary 
cofactors. 
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(a) What is the distribution of *C in the pyruvate that is 
formed? (Assume that the interconversion of glyceralde- 
hyde 3-phosphate and dihydroxyacetone phosphate is very 
rapid compared with the subsequent step.) 

(b) Ifthe specific activity of the glucose substrate is 10 mCi 
mmol | (millicuries per mole, a measure of radioactivity 
per mole), what is the specific activity of the pyruvate that 
is formed? 


17. Lactic acid fermentation. (a) Write a balanced equation 
for the conversion of glucose into lactate. (b) Calculate the 
standard free-energy change of this reaction by using the 
data given in Table 16.1 and the fact that AG®°’ is —25 kJ 
mol! (—6 kcal mol!) for the following reaction: 


Pyruvate + NADH + H* = lactate + NAD* 


What is the free-energy change (AG, not AG®°’) of this reac- 
tion when the concentrations of reactants are: glucose, 
5 mM; lactate, 0.05 mM; ATP, 2 mM; ADP, 0.2 mM; and 
Pi, 1 mM? 


18. High potential. What is the equilibrium ratio of phos- 
phoenolpyruvate to pyruvate under standard conditions 


when [ATP]/[ADP] = 10? 


19. Hexose-triose equilibrium. What are the equilibrium 
concentrations of fructose 1,6-bisphosphate, dihydroxyac- 
etone phosphate, and glyceraldehyde 3-phosphate when 
1 mM fructose 1,6-bisphosphate is incubated with aldolase 
under standard conditions? 


20. Double labeling. 3-Phosphoglycerate labeled uniformly 
with 'C is incubated with 1,3-BPG labeled with *’P at 
C-1. What is the radioisotope distribution of the 2,3-BPG 
that is formed on addition of BPG mutase? 


21. An informative analog. Xylose has the same structure as 
that of glucose except that it has a hydrogen atom at G-5 in 
place of a hydroxymethyl group. The rate of ATP hydroly- 
sis by hexokinase is markedly enhanced by the addition of 
xylose. Why? 


22. Distinctive sugars. The intravenous infusion of fructose 
into healthy volunteers leads to a two- to fivefold increase in 
the level of lactate in the blood, a far greater increase 
than that observed after the infusion of the same amount of 
glucose. 


(a) Why is glycolysis more rapid after the infusion of 
fructose? 

(b) Fructose has been used in place of glucose for intrave- 
nous feeding. Why is this use of fructose unwise? 


23. It is not hard to meet expenses. They are everywhere. 
What energetic barrier prevents glycolysis from simply 
running in reverse to synthesis glucose? What is the 
energetic cost to overcome this barrier? 


24. Waste not, want not. Why is the conversion of lactic 
acid from the blood into glucose in the liver in an organism’s 
best interest? 


25. Road blocks bypassed. How are the irreversible reactions 
of glycolysis bypassed in gluconeogenesis? 


26. Pointlessness averted. What are the regulatory means 
that prevent high levels of activity in glycolysis and gluco- 
neogenesis simultaneously? 


27. Different needs. Liver is primarily a gluconeogenic 
tissue, whereas muscle is primarily glycolytic. Why does 
this division of labor make good physiological sense? 


28. Metabolic mutants. What would be the effect on an 
organism’s ability to use glucose as an energy source if a 
mutation inactivated glucose 6-phosphatase in the liver? 


29. Never let me go. Why does the lack of glucose 6-phos- 
phatase activity in the brain and muscle make good physi- 
ological sense? 


30. Counting high-energy compounds 1. How many NTP 
molecules are required for the synthesis of one molecule of 
glucose from two molecules of pyruvate? How many NADH 
molecules? 


31. Counting high-energy compounds 2. How many NTP 
molecules are required to synthesize glucose from each of 
the following compounds? 


(a) Glucose 6-phosphate 

(b) Fructose 1,6-bisphosphate 

(c) Two molecules of oxaloacetate 

(d) Two molecules of dihydroxyacetone phosphate 


32. Lending a hand. How might enzymes that remove 
amino groups from alanine and aspartate contribute to 
gluconeogenesis? 


33. More metabolic mutants. Predict the effect of each of the 
following mutations on the pace of glycolysis in liver cells: 


(a) Loss of the allosteric site for ATP in phosphofructo- 
kinase 

(b) Loss of the binding site for citrate in phosphofructo- 
kinase 

(c) Loss of the phosphatase domain of the bifunctional 
enzyme that controls the level of fructose 2,6-bisphosphate 
(d) Loss of the binding site for fructose 1,6-bisphosphate 
in pyruvate kinase 


34. Yet another metabolic mutant. What are the likely con- 
sequences of a genetic disorder rendering fructose 1,6-bis- 
phosphatase in the liver less sensitive to regulation by 
fructose 2,6-bisphosphate? 


35. Biotin snatcher. Avidin, a 70-kd protein in egg white, 
has very high affinity for biotin. In fact, it is a highly 


specific inhibitor of biotin enzymes. Which of the follow- 
ing conversions would be blocked by the addition of avi- 
din to a cell homogenate? 


a) Glucose > pyruvate 

b) Pyruvate > glucose 

c) Oxaloacetate — glucose 

d) Malate — oxaloacetate 

e) Pyruvate > oxaloacetate 

f) Glyceraldehyde 3-phosphate — fructose 1,6-bisphos- 
phate 
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36. Tracing carbon atoms 2. If cells synthesizing glucose 
from lactate are exposed to CO, labeled with “C, what 
will be the distribution of label in the newly synthesized 
glucose? 


37. Arsenate poisoning. Arsenate (AsO, ) closely resem- 
bles P; in structure and reactivity. In the reaction catalyzed 
by glyceraldehyde 3-phosphate dehydrogenase, arsenate 
can replace phosphate in attacking the energy-rich thioester 
intermediate. The product of this reaction, 1-arseno-3- 
phosphoglycerate, is unstable. It and other acyl arsenates 
are rapidly and spontaneously hydrolyzed. What is the 
effect of arsenate on energy generation in a cell? 


38. Reduce, reuse, recycle. In the conversion of glucose into 
two molecules of lactate, the NADH generated earlier in the 
pathway is oxidized to NAD*. Why is it not to the cell’s 
advantage to simply make more NAD™ so that the regen- 
eration would not be necessary? After all, the cell would 
save much energy because it would no longer need to syn- 
thesize lactic acid dehydrogenase. 


39. Adenylate kinase again. Adenylate kinase, an enzyme 
considered in great detail in Chapter 9, is responsible for 
interconverting the adenylate nucleotide pool: 


ADP + ADP == ATP + AMP 


The equilibrium constant for this reaction is close to 1, 
inasmuch as the number of phosphoanhydride bonds is 
the same on each side of the equation. Using the equation 
for the equilibrium constant for this reaction, show why 
changes in [AMP] are a more effective indicator of the 
adenylate pool than [ATP]. 


40. Working at cross-purposes? Gluconeogenesis takes place 
during intense exercise, which seems counterintuitive. Why 
would an organism synthesize glucose and at the same time 
use glucose to generate energy? 


41. Powering pathways. Compare the stoichiometries of 
glycolysis and gluconeogenesis. Recall that the input of one 
ATP equivalent changes the equilibrium constant of a reac- 
tion by a factor of about 10° (Section 15.2). By what factor 
do the additional high-phosphoryl-transfer compounds 
alter the equilibrium constant of gluconeogenesis? 
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Problems 


Mechanism Problem 


42. Argument by analogy. Propose a mechanism for the con- 
version of glucose 6-phosphate into fructose 6-phosphate 
by phosphoglucose isomerase based on the mechanism of 
triose phosphate isomerase. 


Chapter Integration Problems 


43. Not just for energy. People with galactosemia display 
central nervous system abnormalities even if galactose is 
eliminated from the diet. The precise reason for it is not 
known. Suggest a plausible explanation. 


44, State function. Fructose 2,6-bisphosphate is a potent 
stimulator of phosphofructokinase. Explain how fructose 
2,6-bisphosphate might function in the concerted model for 
allosteric enzymes. 


Data Interpretation Problems 


45. Now, that’s unusual. Phosphofructokinase has recently 
been isolated from the hyperthermophilic archaeon Pyro- 
coccus furiosus. It was subjected to standard biochemical 
analysis to determine basic catalytic parameters. The pro- 
cesses under study were of the form 


Fructose 6-phosphate + (x — P;) > 
fructose 1,6-bisphosphate + (x) 


The assay measured the increase in fructose 1,6-bisphos- 
phate. Selected results are shown in the adjoining graph. 
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[Data from J. E. Tuininga et al. J. Biol, Chem. 274:21023-21028, 1999|] 


(a) How does the P. furiosus phosphofructokinase differ 
from the phosphofructokinase considered in this chapter? 
(b) What effects do AMP and ATP have on the reaction 
with ADP? 


46. Cool bees. In principle, a futile cycle that includes phos- 
phofructokinase and fructose 2,6-bisphosphatase could be 
used to generate heat. The heat could be used to warm tis- 
sues. For instance, certain bumblebees have been reported 


496 
CHAPTER 16 Glycolysis and Gluconeogenesis 


Fe Hprk 
Hi FBPase 


N 
ro) 
l 


Enzyme activity (units g-! thorax) 
o 8 6 8 8 8 

ieaisdca sds asics lal 
ae — 


[After J. F. Staples, E. L. Koen, and T. M. Laverty, J. Exp. Biol. 207:749-754, 
2004, p. 751.] 


to use such a futile cycle to warm their flight muscles on 
cool mornings. 

Scientists undertook a series of experiments to deter- 
mine if a number of species of bumblebee use this futile 
cycle. Their approach was to measure the activity of PFK 
and F-1,6-BPase in flight muscle. 


(a) What was the rationale for comparing the activities of 
these two enzymes? 

(b) The data at the left show the activites of both enzymes 
for a variety of bumblebee species (genera Bombus and 
Psithyrus). Do these results support the notion that bumble- 
bees use futile cycles to generate heat? Explain. 

(c) Inwhich species might futile cycling take place? Explain 
your reasoning. 

(d) Do these results prove that futile cycling does not 
participate in heat generation? 


The Citric Acid Cycle 


he metabolism of glucose to pyruvate in glycolysis, an anaerobic process, 

harvests but a fraction of the ATP available from glucose. Most of the 
ATP generated in metabolism is provided by the aerobic processing of glu- 
cose. This process starts with the complete oxidation of glucose derivatives 
to carbon dioxide. This oxidation takes place in a series of reactions called 
the citric acid cycle, also known as the tricarboxylic acid (TCA) cycle or the 
Krebs cycle. The citric acid cycle is the final common pathway for the oxida- 
tion of fuel molecules—carbohydrates, fatty acids, and amino acids. Most 
fuel molecules enter the cycle as acetyl coenzyme A. 
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Roundabouts, or traffic circles, function as hubs to 
facilitate traffic flow. The citric acid cycle is the 
biochemical hub of the cell, oxidizing carbon fuels, 
usually in the form of acetyl CoA, as well as serving as a 
source of precursors for biosynthesis. [(Left) Lynn 
Saville/Getty Images.] 
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Figure 17.1 Mitochondrion. The double 
membrane of the mitochondrion is evident in 
this electron micrograph. The numerous 
invaginations of the inner mitochondrial 
membrane are called cristae. The oxidative 
decarboxylation of pyruvate and the sequence 
of reactions in the citric acid cycle take place 
within the matrix. [(Left) Omikron/Photo 
Researchers. ] 
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Figure 17.2 Overview of the citric acid 
cycle. The citric acid cycle oxidizes 
two-carbon units, producing two molecules 

of CO3, one molecule of ATP, and high-energy 
electrons in the form of NADH and FADH>. 
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Under aerobic conditions, the pyruvate generated from glucose is oxida- 
tively decarboxylated to form acetyl CoA. In eukaryotes, the reactions of 
the citric acid cycle take place inside mitochondria (Figure 17.1), in contrast 
with those of glycolysis, which take place in the cytoplasm. 


The citric acid cycle harvests high-energy electrons 


The citric acid cycle is the central metabolic hub of the cell. It is the gateway 
to the aerobic metabolism of any molecule that can be transformed into an 
acetyl group or a component of the citric acid cycle. The cycle is also an 
important source of precursors for the building blocks of many other mol- 
ecules such as amino acids, nucleotide bases, and porphyrin (the organic 
component of heme). The citric acid cycle component, oxaloacetate, is also 
an important precursor to glucose (Section 16.3). 

What is the function of the citric acid cycle in transforming fuel mole- 
cules into ATP? Recall that fuel molecules are carbon compounds that are 
capable of being oxidized—that is, of losing electrons (Chapter 15). The 
citric acid cycle includes a series of oxidation—reduction reactions that result 
in the oxidation of an acetyl group to two molecules of carbon dioxide. This 
oxidation generates high-energy electrons that will be used to power the 
synthesis of ATP. The function of the citric acid cycle is the harvesting of high- 
energy electrons from carbon fuels. 

The overall pattern of the citric acid cycle is shown in Figure 17.2. A 
four-carbon compound (oxaloacetate) condenses with a two-carbon acetyl 
unit to yield a six-carbon tricarboxylic acid. The six-carbon compound 
releases CO2 twice in two successive oxidative decarboxylations that yield 
high-energy electrons. A four-carbon compound remains. This four-carbon 
compound is further processed to regenerate oxaloacetate, which can initiate 
another round of the cycle. Two carbon atoms enter the cycle as an acetyl 
unit and two carbon atoms leave the cycle in the form of two molecules 
of CO». 

Note that the citric acid cycle itself neither generates a large amount of 
ATP nor includes oxygen as a reactant (Figure 17.3). Instead, the citric acid 
cycle removes electrons from acetyl CoA and uses these electrons to form 
NADH and FADH). Three hydride ions (hence, six electrons) are trans- 
ferred to three molecules of nicotinamide adenine dinucleotide (NAD*), 
and one pair of hydrogen atoms (hence, two electrons) is transferred to one 
molecule of flavin adenine dinucleotide (FAD). These electron carriers 
yield nine molecules of ATP when they are oxidized by O2 in oxidative 
phosphorylation (Chapter 18). Electrons released in the reoxidation of 
NADH and FADH; flow through a series of membrane proteins (referred 
to as the electron-transport chain) to generate a proton gradient across the 
membrane. These protons then flow through ATP synthase to generate 
ATP from ADP and inorganic phosphate. 

The citric acid cycle, in conjunction with oxidative phosphorylation, 
provides the vast preponderance of energy used by aerobic cells—in human 
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beings, greater than 90%. It is highly efficient because the oxidation of a 
limited number of citric acid cycle molecules can generate large amounts of 
NADH and FADH). Note in Figure 17.2 that the four-carbon molecule, 
oxaloacetate, that initiates the first step in the citric acid cycle is regenerated 
at the end of one passage through the cycle. Thus, one molecule of oxaloac- 
etate is capable of participating in the oxidation of many acetyl molecules. 


17.1 Pyruvate Dehydrogenase Links Glycolysis to 
the Citric Acid Cycle 


Carbohydrates, most notably glucose, are processed by glycolysis into 
pyruvate (Chapter 16). Under anaerobic conditions, the pyruvate is 
converted into lactate or ethanol, depending on the organism. Under aero- 
bic conditions, the pyruvate is transported into mitochondria by a specific 
carrier protein embedded in the mitochondrial membrane. In the mito- 
chondrial matrix, pyruvate is oxidatively decarboxylated by the pyruvate 
dehydrogenase complex to form acetyl CoA. 


Pyruvate + CoA + NAD* —> acetyl CoA + CO, + NADH + H* 


This irreversible reaction is the link between glycolysis and the citric acid cycle 
(Figure 17.4). Note that the pyruvate dehydrogenase complex produces 
CQO, and captures high-transfer-potential electrons in the form of NADH. 
Thus, the pyruvate dehydrogenase reaction has many of the key features of 
the reactions of the citric acid cycle itself. 

The pyruvate dehydrogenase complex is a large, highly integrated com- 
plex of three distinct enzymes (Table 17.1). Pyruvate dehydrogenase com- 
plex is a member of a family of homologous complexes that include the 
citric acid cycle enzyme a-ketoglutarate dehydrogenase complex (p. 507). 
These complexes are giant, larger than ribosomes, with molecular masses 
ranging from 4 million to 10 million daltons (Figure 17.5). As we will see, 


Table 17.1 Pyruvate dehydrogenase complex of E. coli 


Number of Prosthetic 
Enzyme Abbreviation chains group Reaction catalyzed 
Pyruvate E; 24 TPP Oxidative decarboxylation 
dehydrogenase of pyruvate 
component 
Dihydrolipoyl E, 24 Lipoamide Transfer of acetyl group 
transacetylase to CoA 
Dihydrolipoyl E3 12 FAD Regeneration of the 
dehydrogenase oxidized form of lipoamide 
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Figure 17.3 Cellular respiration. The citric 
acid cycle constitutes the first stage in cellular 
respiration, the removal of high-energy 
electrons from carbon fuels in the form of 
NADH and FADH> (left). These electrons 
reduce O> to generate a proton gradient (red 
pathway), which is used to synthesize ATP 
(green pathway). The reduction of O2 and the 
synthesis of ATP constitute oxidative 
phosphorylation. 
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Figure 17.4 The link between glycolysis 
and the citric acid cycle. Pyruvate produced 
by glycolysis is converted into acetyl CoA, the 
fuel of the citric acid cycle. 


Figure 17.5 Electron micrograph 
of the pyruvate dehydrogenase 
complex from E. coli. [Courtesy of 
Dr. Lester Reed.] 
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their elaborate structures allow groups to travel from one active site to 
another, connected by tethers to the core of the structure. 


Mechanism: The synthesis of acetyl coenzyme A from pyruvate requires 
three enzymes and five coenzymes 


The mechanism of the pyruvate dehydrogenase reaction is wonderfully 
complex, more so than is suggested by its simple stoichiometry. The reaction 
requires the participation of the three enzymes of the pyruvate dehydroge- 
nase complex and five coenzymes. The coenzymes thiamine pyrophosphate 
(TPP), lipoic acid, and FAD serve as catalytic cofactors, and CoA and 
NAD * are stoichiometric cofactors, cofactors that function as substrates. 
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The conversion of pyruvate into acetyl CoA consists of three steps: 
decarboxylation, oxidation, and transfer of the resultant acetyl group to 
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These steps must be coupled to preserve the free energy derived from the 
decarboxylation step to drive the formation of NADH and acetyl CoA. 


1. Decarboxylation. Pyruvate combines with TPP and is then decarboxy- 
lated to yield hydroxyethyl- TPP (Figure 17.6). 
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Carbanion Pyruvate Hydroxyethyl-TPP 
of TPP 


This reaction is catalyzed by the pyruvate dehydrogenase component (FE) of 
the multienzyme complex. TPP is the prosthetic group of the pyruvate 
dehydrogenase component. 


2. Oxidation. The hydroxyethyl group attached to TPP is oxidized to form 
an acetyl group while being simultaneously transferred to lipoamide, a 
derivative of lipoic acid that is linked to the side chain of a lysine residue by 
an amide linkage. Note that this transfer results in the formation of an 
energy-rich thioester bond. 
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The oxidant in this reaction is the disulfide group of lipoamide, which is 
reduced to its disulfhydryl form. This reaction, also catalyzed by the pyru- 
vate dehydrogenase component EF), yields acetyllipoamide. 


3. Formation of Acetyl CoA. The acetyl group is transferred from acetyllipo- 
amide to CoA to form acetyl CoA. 
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Dihydrolipoyl transacetylase (Ez) catalyzes this reaction. The energy-rich 
thioester bond is preserved as the acetyl group is transferred to CoA. Recall 
that CoA serves as a carrier of many activated acyl groups, of which acetyl 
is the simplest (Section 15.3). Acetyl CoA, the fuel for the citric acid cycle, 
has now been generated from pyruvate. 

The pyruvate dehydrogenase complex cannot complete another cata- 
lytic cycle until the dihydrolipoamide is oxidized to lipoamide. In a fourth 
step, the oxidized form of lipoamide is regenerated by dihydrolipoyl dehydroge- 
nase (E3). Two electrons are transferred to an FAD prosthetic group of the 
enzyme and then to NAD™. 
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Figure 17.6 Mechanism of the E, 
decarboxylation reaction. E, is the 
pyruvate dehydrogenase component of the 
pyruvate dehydrogenase complex. A key 
feature of the prosthetic group, TPP, is that 
the carbon atom between the nitrogen and 
sulfur atoms in the thiazole ring is much more 
acidic than most =CH— groups, with a pKa 
value near 10. (1) This carbon center ionizes 
to form a carbanion. (2) The carbanion 
readily adds to the carbonyl group of pyruvate. 
(3) This addition is followed by the 
decarboxylation of pyruvate. The positively 
charged ring of TPP acts as an electron sink 
that stabilizes the negative charge that is 
transferred to the ring as part of the 
decarboxylation. (4) Protonation yields 
hydroxyethyl-TPP. 
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Figure 17.7 Schematic representation of 
the pyruvate dehydrogenase complex. 
The transacetylase core (E2) is shown in red, 
the pyruvate dehydrogenase component (E1) 
in yellow, and the dihydrolipoyl dehydrogenase 
(Ez) in green. 


Figure 17.8 Structure of the 
transacetylase (E2) core. Each red ball 
represents a trimer of three Ep subunits. 
Notice that each subunit consists of three 
domains: a lipoamide-binding domain, a small 
domain for interaction with Ez, and a large 
transacetylase catalytic domain. The 
transacetylase domain has three identical 
subunits, with one depicted in red and the 
others in white in the ribbon representation. 


This electron transfer from FAD to NAD* is unusual because the common 
role for FAD is to receive electrons from NADH. The electron-transfer 
potential of FAD is increased by its chemical environment within the 
enzyme, enabling it to transfer electrons to NAD“. Proteins tightly associ- 
ated with FAD or flavin mononucleotide (FMN) are called flavoproteins. 


Flexible linkages allow lipoamide to move between 
different active sites 


The structures of all of the component enzymes of the pyruvate dehydroge- 
nase complex are known, albeit from different complexes and species. Thus, 
it is now possible to construct an atomic model of the complex to under- 
stand its activity (Figure 17.7). 

The core of the complex is formed by the transacetylase component E9. 
Transacetylase consists of eight catalytic trimers assembled to form a hol- 
low cube. Each of the three subunits forming a trimer has three major 
domains (Figure 17.8). At the amino terminus is a small domain that con- 
tains a bound flexible lipoamide cofactor attached to a lysine residue. This 
domain is homologous to biotin-binding domains such as that of pyruvate 
carboxylase (see Figure 16.26). The lipoamide domain is followed by a small 
domain that interacts with E3 within the complex. A larger transacetylase 
domain completes an Ey subunit. E; is an a8 tetramer, and E3 is an a8 
dimer. Multiples copies of E; and E3 surround the E, core. How do the 
three distinct active sites work in concert (Figure 17.9)? The key is the long, 
flexible lipoamide arm of the Ey subunit, which carries substrate from 
active site to active site. 


1. Pyruvate is decarboxylated at the active site of E,, forming the hydroxy- 
ethyl-TPP intermediate, and COy leaves as the first product. This active 
site lies deep within the E, complex, connected to the enzyme surface by a 
20-A-long hydrophobic channel. 


Lipoamide 
domain 


Lipoamide 


Domain 
interacting with 
Ez component 
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@ 
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2. Ep inserts the lipoamide arm of the lipoamide domain into the deep 
channel in FE, leading to the active site. 


3. E, catalyzes the transfer of the acetyl group to the lipoamide. The acety- 
lated arm then leaves E, and enters the E2 cube to visit the active site of Eo, 
located deep in the cube at the subunit interface. 


4. The acetyl moiety is then transferred to CoA, and the second product, 
acetyl CoA, leaves the cube. The reduced lipoamide arm then swings to the 
active site of the E3 flavoprotein. 


5. At the E3 active site, the lipoamide is oxidized by coenzyme FAD. The 
reactivated lipoamide is ready to begin another reaction cycle. 


6. The final product, NADH, is produced with the reoxidation of FADH) 
to FAD. 


The structural integration of three kinds of enzymes and the long, flexible 
lipoamide arm make the coordinated catalysis of a complex reaction possible. 
The proximity of one enzyme to another increases the overall reaction rate 
and minimizes side reactions. All the intermediates in the oxidative decar- 
boxylation of pyruvate remain bound to the complex throughout the reac- 
tion sequence and are readily transferred as the flexible arm of E calls on 
each active site in turn. 


17.2 The Citric Acid Cycle Oxidizes Two-Carbon Units 


The conversion of pyruvate into acetyl CoA by the pyruvate dehydrogenase 
complex is the link between glycolysis and cellular respiration because 
acetyl CoA is the fuel for the citric acid cycle. Indeed, all fuels are ultimately 
metabolized to acetyl CoA or components of the citric acid cycle. 


Figure 17.9 Reactions of the pyruvate 
dehydrogenase complex. At the top (left), 
the enzyme (represented by a yellow, a 
green, and two red spheres) is unmodified 
and ready for a catalytic cycle. (1) Pyruvate is 
decarboxylated to form hydroxyethyl-TPP. 

(2) The lipoamide arm of Ez moves into the 
active site of E). (3) E, catalyzes the transfer 
of the two-carbon group to the lipoamide 
group to form the acetyl-lipoamide complex. 
(4) E> catalyzes the transfer of the acetyl 
moiety to CoA to form the product acetyl CoA. 
The dihydrolipoamide arm then swings to the 
active site of Ez. Ez catalyzes (5) the oxidation 
of the dihydrolipoamide acid and (6) the 
transfer of the protons and electrons to NAD* 
to complete the reaction cycle. 
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Citrate synthase forms citrate from oxaloacetate and acetyl coenzyme A 


The citric acid cycle begins with the condensation of a four-carbon unit, 
oxaloacetate, and a two-carbon unit, the acetyl group of acetyl CoA. 
Oxaloacetate reacts with acetyl CoA and H20 to yield citrate and CoA. 
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This reaction, which is an aldol condensation followed by a hydrolysis, is 
catalyzed by citrate synthase. Oxaloacetate first condenses with acetyl CoA 


Synthase 

An enzyme catalyzing a synthetic reaction in 
which two units are joined usually without 
the direct participation of ATP (or another 
nucleoside triphosphate). 


to form citryl CoA, a molecule that is energy rich because it contains the 
thioester bond that originated in acetyl CoA. The hydrolysis of citryl CoA 
thioester to citrate and CoA drives the overall reaction far in the direction 
of the synthesis of citrate. In essence, the hydrolysis of the thioester powers 
the synthesis of a new molecule from two precursors. 


Mechanism: The mechanism of citrate synthase prevents 
undesirable reactions 


Because the condensation of acetyl CoA and oxaloacetate initiates the citric 
acid cycle, it is very important that side reactions, notably the hydrolysis of 
acetyl CoA to acetate and CoA, be minimized. Let us briefly consider how 
the citrate synthase prevents the wasteful hydrolysis of acetyl CoA. 

Mammalian citrate synthase is a dimer of identical 49-kd subunits. Each 
active site is located in a cleft between the large and the small domains of a 
subunit, adjacent to the subunit interface. X-ray crystallographic studies of 
citrate synthase and its complexes with several substrates and inhibitors 
revealed that the enzyme undergoes large conformational changes in the 
course of catalysis. Citrate synthase exhibits sequential, ordered kinetics: 
oxaloacetate binds first, followed by acetyl CoA. The reason for the ordered 
binding is that oxaloacetate induces a major structural rearrangement leading 
to the creation of a binding site for acetyl CoA. The binding of oxaloacetate 
converts the open form of the enzyme into a closed form (Figure 17.10). In 
each subunit, the small domain rotates 19 degrees relative to the large 
domain. Movements as large as 15 A are produced by the rotation of a helices 
elicited by quite small shifts of side chains around bound oxaloacetate. These 
structural changes create a binding site for acetyl CoA. This conforma- 
tional transition is reminiscent of the cleft closure in hexokinase induced by 
the binding of glucose (Section 16.1). 

Citrate synthase catalyzes the condensation reaction by bringing the 
substrates into close proximity, orienting them, and polarizing certain 
bonds (Figure 17.11). The donation and removal of protons transforms 
acetyl CoA into an enol intermediate. The enol attacks oxaloacetate to form 
a carbon-carbon double bond linking acetyl CoA and oxaloacetate. The 
newly formed citryl CoA induces additional structural changes in the 
enzyme, causing the active site to become completely enclosed. The 
enzyme cleaves the citryl CoA thioester by hydrolysis. CoA leaves the 
enzyme, followed by citrate, and the enzyme returns to the initial open 
conformation. 


Oxaloacetate- 
binding sites 


hk Figure 17.10 Conformational 

We can now understand how the wasteful hydrolysis of acetyl CoA changes in citrate synthase on binding 
is prevented. Citrate synthase is well suited to hydrolyze citryl CoA but oxaloacetate. The small domain of each 
not acetyl CoA. How is this discrimination accomplished? First, acetyl sUDUmit OF tis iormiodimnict testi in yeLGW, 

: : : the large domains are shown in blue. (Left) 

CoA does not bind to the enzyme until oxaloacetate is bound and ready for pen torical ayac alone. Glen Closed 
condensation. Second, the catalytic residues crucial for the hydrolysis of the form of the liganded enzyme. [Drawn from 
thioester linkage are not appropriately positioned until citryl CoA is formed. 5CSC.pdb and 4CTS.pdb.} 
As with hexokinase and triose phosphate isomerase (Section 16.1), induced 


fit prevents an undesirable side reaction. 


His 274 
Is 274 | y 


Subtrate complex Enol intermediate Citryl CoA complex 


Figure 17.11 Mechanism of synthesis of citryl CoA by citrate synthase. (1) In the substrate 
complex (left), His 274 donates a proton to the carbonyl oxygen of acetyl CoA to promote the 
removal of a methyl proton by Asp 375 to form the enol intermediate (center). (2) Oxaloacetate is 
activated by the transfer of a proton from His 320 to its carbonyl carbon atom. (3) Simultaneously, 
the enol of acetyl CoA attacks the carbonyl carbon of oxaloacetate to form a carbon-carbon 
bond linking acetyl CoA and oxaloacetate. His 274 is reprotonated. Citryl CoA is formed. His 
274 participates again as a proton donor to hydrolyze the thioester (not shown), yielding citrate 
and CoA. 
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hk Figure 17.12 Binding of citrate to 
the iron-sulfur complex of aconitase. A 
4Fe-4S iron-sulfur cluster is a component of 
the active site of aconitase. Notice 
that one of the iron atoms of the cluster binds 
to a COO” group and an OH group of citrate. 
[Drawn from 1C96.pdb.] 


Citrate is isomerized into isocitrate 


The hydroxyl group is not properly located in the citrate molecule for the 
oxidative decarboxylations that follow. Thus, citrate is isomerized into 
isocitrate to enable the six-carbon unit to undergo oxidative decarboxyla- 
tion. The isomerization of citrate is accomplished by a dehydration step 
followed by a hydration step. The result is an interchange of an H and an 
OH. The enzyme catalyzing both steps is called aconitase because 
cis-dconitate is an intermediate. 
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Aconitase is an iron-sulfur protein, or nonheme-iron protein, in that it 
contains iron that is not bonded to heme. Rather, its four iron atoms are 
complexed to four inorganic sulfides and three cysteine sulfur atoms, 
leaving one iron atom available to bind citrate through one of its COO ~ 
groups and an OH group (Figure 17.12). This Fe-S cluster participates in 
dehydrating and rehydrating the bound substrate. 


Isocitrate is oxidized and decarboxylated to alpha-ketoglutarate 


We come now to the first of four oxidation—reduction reactions in the citric 
acid cycle. The oxidative decarboxylation of isocitrate is catalyzed by 
isocitrate dehydrogenase. 


Isocitrate + NAD* —> a-ketoglutarate + CO, + NADH 


The intermediate in this reaction is oxalosuccinate, an unstable B-ketoacid. 
While bound to the enzyme, it loses CO. to form a-ketoglutarate. 
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The rate of formation of a-ketoglutarate is important in determining the 
overall rate of the cycle, as will be discussed on page 514. This oxidation 
generates the first high-transfer-potential electron carrier, NADH, in the 
cycle. 


Succinyl coenzyme A is formed by the oxidative decarboxylation of 
alpha-ketoglutarate 


The conversion of isocitrate into a-ketoglutarate is followed by a second 
oxidative decarboxylation reaction, the formation of succinyl CoA from 
a-ketoglutarate. 
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This reaction is catalyzed by the a-ketoglutarate dehydrogenase com- 

plex, an organized assembly of three kinds of enzymes that is homolo- 
gous to the pyruvate dehydrogenase complex. In fact, the oxidative decar- 
boxylation of a-ketoglutarate closely resembles that of pyruvate, also an 
a-ketoacid. 


Pyruvate dehydrogenase complex 


Pyruvate + CoA + NAD* 
acetyl CoA + CO, + NADH + H* 
a-Ketoglutarate dehydrogenase complex 
a-Ketoglutarate + CoA + ————————eE 
succinyl CoA + CO, + NADH 


Both reactions include the decarboxylation of an a-ketoacid and the subse- 
quent formation of a thioester linkage with CoA that has a high transfer 
potential. The reaction mechanisms are entirely analogous (p. 500). 


A compound with high phosphoryl-transfer potential is generated from 
succinyl coenzyme A 


Succinyl CoA is an energy-rich thioester compound. The AG®’ for the 
hydrolysis of succinyl CoA is about —33.5 kJ mol! (-8.0 kcal mol”), 
which is comparable to that of ATP (30.5 kJ mol~!, or —7.3 kcal mol” !). 
In the citrate synthase reaction, the cleavage of the thioester bond powers 
the synthesis of the six-carbon citrate from the four-carbon oxaloacetate 
and the two-carbon fragment. The cleavage of the thioester bond of succinyl 
CoA is coupled to the phosphorylation of a purine nucleoside diphosphate, usu- 
ally ADP. This reaction, which is readily reversible, is catalyzed by succinyl 
CoA synthetase (succinate thiokinase). 
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This reaction is the only step in the citric acid cycle that directly yields a 
compound with high phosphoryl-transfer potential. In mammals, there are 
two isozymic forms of the enzyme, one specific for ADP and one for GDP. 
In tissues that perform large amounts of cellular respiration, such as skeletal 
and heart muscle, the ADP-requiring isozyme predominates. In tissues that 
perform many anabolic reactions, such as the liver, the GDP-requiring 
enzyme is common. The GDP-requiring enzyme is believed to work in 
reverse of the direction observed in the TCA cycle; that is, GTP is used to 
power the synthesis of succinyl CoA, which is a precursor for heme synthe- 
sis. The E. coli enzyme uses either GDP or ADP as the phosphoryl-group 
acceptor. 

Note that the enzyme nucleoside diphosphokinase, which catalyzes the 
following reaction, 


Figure 17.13 Reaction mechanism of GTP + ADP = — GDP + ATP 


inyl CoA thetase. Th ti : 
Ese ae frien nie safle cay = allows the y phosphoryl group to be readily transferred from GTP to form 


intermediate. (1) Orthophosphate displaces ATP, thereby allowing the adjustment of the concentration of GTP or ATP 
coenzyme A, which generates another to meet the cell’s need. 

energy-rich compound, succinyl phosphate. 
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(3) The phosphohistidine residue then 
swings over to a bound nucleoside 
diphosphate, and (4) the phosphoryl group 


The mechanism of this reaction is a clear example of an energy transfor- 
mation: energy inherent in the thioester molecule is transformed into 


is transferred to form the nucleoside phosphoryl-group-transfer potential (Figure 17.13). The first step is the 
triphosphate. displacement of coenzyme A by orthophosphate, which generates another 
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energy-rich compound, succinyl phosphate. A histidine residue 
plays a key role as a moving arm that detaches the phosphoryl 
group, then swings over to a bound nucleoside diphosphate and 
transfers the group to form the nucleoside triphosphate. The 
participation of high-energy compounds in all the steps is attest- 
ed to by the fact that the reaction is readily reversible: AG°’ = 
~3.4 kJ mol | (-0.8 kcal mol‘). The formation of ATP at the 
expense of succinyl CoA is an example of substrate-level phos- 
phorylation. 


ye Succinyl CoA synthetase is an a 2B. heterodimer; the 

functional unit is one af pair. The enzyme mechanism 
shows that a phosphory] group is transferred first to succinyl 
CoA bound in the a subunit and then to a nucleoside diphos- 
phate bound in the B subunit. Examination of the three- 
dimensional structure of succinyl CoA synthetase reveals that 
each subunit comprises two domains (Figure 17.14). The ami- 
no-terminal domains of the two subunits have different struc- 
tures, each characteristic of its role in the mechanism. The 
amino-terminal domain of the a subunit forms a Rossmann 
fold (Section 16.1), which binds the ADP substrate of succinyl 
CoA synthetase. The amino-terminal domain of the B subunit 
is an ATP-grasp domain, found in many enzymes, which here 
binds and activates ADP. Succinyl CoA synthetase has evolved 
by adopting these domains and harnessing them to capture the 
energy associated with succinyl CoA cleavage, which is used to 
drive the generation of a nucleoside triphosphate. 


Oxaloacetate is regenerated by the oxidation of succinate 
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h Figure 17.14 Structure of succinyl CoA synthetase. 
The enzyme is composed of two subunits. The a subunit 
contains a Rossmann fold that binds the ADP component 
of CoA, and the B subunit contains a nucleotide-activating 
region called the ATP-grasp domain. The ATP-grasp domain 
is shown here binding a molecule of ADP. Notice that the 
histidine residue is between the CoA and the ADP. This 
histidine residue picks up the phosphoryl group from near 
the CoA and swings over to transfer it to the nucleotide 
bound in the ATP-grasp domain. [Drawn from 1CGI.pdb_] 


Reactions of four-carbon compounds constitute the final stage of the citric 


acid cycle: the regeneration of oxaloacetate. 
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The reactions constitute a metabolic motif that we will see again in fatty acid 
synthesis and degradation as well as in the degradation of some amino acids. 
A methylene group (CHz) is converted into a carbonyl group (C=O) in 
three steps: an oxidation, a hydration, and a second oxidation reaction. 
Oxaloacetate is thereby regenerated for another round of the cycle, and 


more energy is extracted in the form of FADH, and NADH. 


Succinate is oxidized to fumarate by succinate dehydrogenase. The hydro- 
gen acceptor is FAD rather than NAD*, which is used in the other three 
oxidation reactions in the cycle. FAD is the hydrogen acceptor in this reac- 
tion because the free-energy change is insufficient to reduce NAD*. FAD 
is nearly always the electron acceptor in oxidations that remove two hydro- 
gen atoms from a substrate. In succinate dehydrogenase, the isoalloxazine 
ring of FAD is covalently attached to a histidine side chain of the enzyme 


(denoted E-FAD). 
E-FAD + succinate === E-FADH) + fumarate 
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Succinate dehydrogenase, like aconitase, is an iron-sulfur protein. 
Indeed, succinate dehydrogenase contains three different kinds of iron— 
sulfur clusters: 2Fe-25S (two iron atoms bonded to two inorganic sulfides), 
3Fe-48, and 4Fe-48. Succinate dehydrogenase—which consists of a 70-kd 
and a 27-kd subunit—differs from other enzymes in the citric acid cycle in 
being embedded in the inner mitochondrial membrane. In fact, succinate 
dehydrogenase is directly associated with the electron-transport chain, the link 
between the citric acid cycle and ATP formation. FADH)2 produced by the 
oxidation of succinate does not dissociate from the enzyme, in contrast with 
NADH produced in other oxidation—reduction reactions. Rather, two elec- 
trons are transferred from FADH) directly to iron-sulfur clusters of the 
enzyme, which in turn passes the electrons to coenzyme Q (CoQ). 
Coenzyme Q, an important member of the electron-transport chain, passes 
electrons to the ultimate acceptor, molecular oxygen, as we shall see in 
Chapter 18. 

The next step is the hydration of fumarate to form L-malate. Fumarase 
catalyzes a stereospecific trans addition of H* and OH. The OH” group 
adds to only one side of the double bond of fumarate; hence, only the L iso- 
mer of malate is formed. 
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Finally, malate is oxidized to form oxaloacetate. This reaction is cata- 
lyzed by malate dehydrogenase, and NAD* is again the hydrogen acceptor. 


Malate + NAD* = oxaloacetate + NADH + H* 


The standard free energy for this reaction, unlike that for the other steps in 
the citric acid cycle, is significantly positive (AG = +29.7 kJ mol7}, or 
+7.1 kcal mol‘). The oxidation of malate is driven by the use of the 
products—oxaloacetate by citrate synthase and NADH by the electron- 
transport chain. 


The citric acid cycle produces high-transfer-potential electrons, 
ATP, and COz 


The net reaction of the citric acid cycle is 


Acetyl CoA + 3NAD* + FAD + ADP + P; + 2H,O —> 
2 CO, + 3NADH + FADH, + ATP + 2H* + CoA 


Let us recapitulate the reactions that give this stoichiometry (Figure 17.15 
and Table 17.2): 


1. Two carbon atoms enter the cycle in the condensation of an acetyl unit 
(from acetyl CoA) with oxaloacetate. Two carbon atoms leave the cycle in 
the form of CO, in the successive decarboxylations catalyzed by isocitrate 
dehydrogenase and a-ketoglutarate dehydrogenase. 


2. Four pairs of hydrogen atoms leave the cycle in four oxidation reactions. 
Two NAD* molecules are reduced in the oxidative decarboxylations of 
isocitrate and a-ketoglutarate, one FAD molecule is reduced in the oxidation 
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eap ATP that since succinate is a symmetric molecule, 
COO- + CoA the identity of the carbons from the acetyl unit 
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Table 17.2 Citric acid cycle 
AG*’ 
Prosthetic 
Step Reaction Enzyme group Type* kJ mol”! kcal mol! 
i Acetyl CoA + oxaloacetate + HzO > Citrate a 31.4 75 
citrate + CoA + H* synthase 
2a Citrate == cis-aconitate + H,O Aconitase Fe-S b +8.4 e220) 
2b cis-Aconitate + H,0 == isocitrate Aconitase Fe-S € —2.1 —0.5 
3 Isocitrate + NAD* == Isocitrate dte -8.4 —2.0 
a-ketoglutarate + CO, + NADH dehydrogenase 
4 a-Ketoglutarate + NAD* + CoA == a-Ketoglutarate Lipoic acid, dte 30.1 7.2 
succinyl CoA + CO, + NADH dehydrogenase FAD, TPP 
complex 
5 Succinyl CoA + P; + ADP == Succinyl CoA it 3.3 0.8 
succinate + ATP + CoA synthetase 
6 Succinate + FAD (enzyme-bound) === Succinate FAD, Fe-S e 0 0 
fumarate + FADH,(enzyme-bound) dehydrogenase 
7 Fumarate + H,O == L-malate Fumarase c 3.8 —0.9 
8 L-Malate + NAD* == Malate e +29.7 +7.1 
oxaloacetate + NADH + H* dehydrogenase 


*Reaction type: (a) condensation; (b) dehydration; (c) hydration; (d) decarboxylation; (e) oxidation; (f) substrate-level phosphorylation. 
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of succinate, and one NAD* molecule is reduced in the oxidation of malate. 
Recall also that one NAD* molecule is reduced in the oxidative decarboxy- 
lation of pyruvate to form acetyl CoA. 


3. One compound with high phosphoryl-transfer potential, usually ATP, 
is generated from the cleavage of the thioester linkage in succinyl CoA. 


4. Two water molecules are consumed: one in the synthesis of citrate by the 
hydrolysis of citryl CoA and the other in the hydration of fumarate. 


Isotope-labeling studies revealed that the two carbon atoms that enter 
each cycle are not the ones that leave. The two carbon atoms that enter the 
cycle as the acetyl group are retained during the initial two decarboxylation 
reactions (see Figure 17.15) and then remain incorporated in the four- 
carbon acids of the cycle. Note that succinate is a symmetric molecule. 
Consequently, the two carbon atoms that enter the cycle can occupy any of 
the carbon positions in the subsequent metabolism of the four-carbon acids. 
The two carbons that enter the cycle as the acetyl group will be released as 
CO, in subsequent trips through the cycle. To understand why citrate is not 
processed as a symmetric molecule, see Problems 27 and 28. 

Evidence is accumulating that the enzymes of the citric acid cycle are 
physically associated with one another. The close arrangement of enzymes 
enhances the efficiency of the citric acid cycle because a reaction product 
can pass directly from one active site to the next through connecting chan- 
nels, a process called substrate channeling. The word metabolon has been 
suggested as the name for such multienzyme complexes. 

As will be considered in Chapter 18, the electron-transport chain 
oxidizes the NADH and FADH) formed in the citric acid cycle. The trans- 
fer of electrons from these carriers to Oo, the ultimate electron acceptor, 
leads to the generation of a proton gradient across the inner mitochondrial 
membrane. This proton-motive force then powers the generation of ATP; 
the net stoichiometry is about 2.5 ATP per NADH, and 1.5 ATP per 
FADH)b. Consequently, nine high-transfer-potential phosphoryl groups are 
generated when the electron-transport chain oxidizes 3 NADH molecules 
and 1 FADH) molecule, and one high-transfer-potential phosphoryl group 
is directly formed in one round of the citric acid cycle. Thus, one acetyl unit 
generates approximately 10 molecules of ATP. In dramatic contrast, the 
anaerobic glycolysis of 1 glucose molecule generates only 2 molecules of 
ATP (and 2 molecules of lactate). 

Recall that molecular oxygen does not participate directly in the citric 
acid cycle. However, the cycle operates only under aerobic conditions 
because NAD* and FAD can be regenerated in the mitochondrion only by 
the transfer of electrons to molecular oxygen. Glycolysis has both an aerobic 
and an anaerobic mode, whereas the citric acid cycle is strictly aerobic. 
Glycolysis can proceed under anaerobic conditions because NAD” is 
regenerated in the conversion of pyruvate into lactate or ethanol. 


17.3 Entry to the Citric Acid Cycle and Metabolism 
Through It Are Controlled 


The citric acid cycle is the final common pathway for the aerobic oxidation 
of fuel molecules. Moreover, as we will see shortly (Section 17.4) and 
repeatedly elsewhere in our study of biochemistry, the cycle is an important 
source of building blocks for a host of important biomolecules. As befits its 
role as the metabolic hub of the cell, entry into the cycle and the rate of the 
cycle itself are controlled at several stages. 


The pyruvate dehydrogenase complex is regulated allosterically 
and by reversible phosphorylation 


As stated earlier, glucose can be formed from pyruvate (Section 16.3). 
However, the formation of acetyl CoA from pyruvate is an irreversible step in 
animals and thus they are unable to convert acetyl CoA back into glucose. The 
oxidative decarboxylation of pyruvate to acetyl CoA commits the carbon 
atoms of glucose to one of two principal fates: oxidation to CO, by the citric 
acid cycle, with the concomitant generation of energy, or incorporation into 
lipid (Figure 17.16). As expected of an enzyme at a critical branch point in 
metabolism, the activity of the pyruvate dehydrogenase complex is strin- 
gently controlled. High concentrations of reaction products inhibit the reac- 
tion: acetyl CoA inhibits the transacetylase component (E2) by binding 
directly, whereas NADH inhibits the dihydrolipoyl dehydrogenase (E53). 
High concentrations of NADH and acetyl CoA inform the enzyme that the 
energy needs of the cell have been met or that fatty acids are being degraded 
to produce acetyl CoA and NADH. In either case, there is no need to 
metabolize pyruvate to acetyl CoA. This inhibition has the effect of sparing 
glucose, because most pyruvate is derived from glucose by glycolysis 
(Section 16.1). 

The key means of regulation of the complex in eukaryotes is covalent 
modification (Figure 17.17). Phosphorylation of the pyruvate dehydrogenase 
component (E,) by pyruvate dehydrogenase kinase I (PDK) switches off the 
activity of the complex. Deactivation is reversed by the pyruvate dehydrogenase 
phosphatase (PDP). The kinase is associated with the transacetylase compo- 
nent (E2), again highlighting the structural and mechanistic importance of 
this core. Both the kinase and the phosphatase are regulated. To see how this 
regulation works in biological conditions, consider muscle that is becoming 
active after a period of rest (Figure 17.18). At rest, the muscle will not have 
significant energy demands. Consequently, the NADH/NAD’%, acetyl 
CoA/CoA, and ATP/ADP ratios will be high. These high ratios promote 
phosphorylation and, hence, deactivation of the pyruvate dehydrogenase 
complex. In other words, high concentrations of immediate (acetyl CoA 
and NADH) and ultimate (ATP) products inhibit the activity. Thus, 
pyruvate dehydrogenase is switched off when the energy charge is high. 


(A) HIGH ENERGY CHARGE 


Pyruvate 
ATP ADP NAD+ 


\ ——— F 


[PDH | 
NADH NADH 
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Figure 17.16 From glucose to acetyl CoA. 
The synthesis of acetyl CoA by the pyruvate 
dehydrogenase complex is a key irreversible 
step in the metabolism of glucose. 


(B) LOW ENERGY CHARGE 
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Figure 17.17 Regulation of the pyruvate dehydrogenase 
complex. A specific kinase phosphorylates and inactivates pyruvate 
dehydrogenase (PDH), and a phosphatase activates the dehydrogenase 
by removing the phosphoryl group. The kinase and the phosphatase 
also are highly regulated enzymes. 
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Figure 17.18 Response of the pyruvate dehydrogenase complex 
to the energy charge. The pyruvate dehydrogenase complex is 
regulated to respond to the energy charge of the cell. (A) The complex 
is inhibited by its immediate products, NADH and acetyl CoA, as well as 
by the ultimate product of cellular respiration, ATP. (B) The complex is 
activated by pyruvate and ADP, which inhibit the kinase that 
phosphorylates PDH. 
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Figure 17.19 Control of the citric acid 
cycle. The citric acid cycle is regulated primarily 
by the concentration of ATP and NADH. The 
key control points are the enzymes isocitrate 
dehydrogenase and a-ketoglutarate 
dehydrogenase. 
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As exercise begins, the concentrations of ADP and pyruvate will increase 
as muscle contraction consumes ATP and glucose is converted into pyru- 
vate to meet the energy demands. Both ADP and pyruvate activate the 
dehydrogenase by inhibiting the kinase. Moreover, the phosphatase is 
stimulated by Ca**, the same signal that initiates muscle contraction. A rise 
in the cytoplasmic Ca’* level (Section 35.2) elevates the mitochondrial 
Ca’ level. The rise in mitochondrial Ca’~ activates the phosphatase, 
enhancing pyruvate dehydrogenase activity. 

In some tissues, the phosphatase is regulated by hormones. In liver, 
epinephrine binds to the a-adrenergic receptor to initiate the phosphati- 
dylinositol pathway (Section 14.1), causing an increase in Ca”* concentra- 
tion that activates the phosphatase. In tissues capable of fatty acid synthesis, 
such as the liver and adipose tissue, insulin, the hormone that signifies the 
fed state, stimulates the phosphatase, increasing the conversion of pyruvate 
into acetyl CoA. Acetyl CoA is the precursor for fatty acid synthesis 
(Section 22.4). In these tissues, the pyruvate dehydrogenase complex is 
activated to funnel glucose to pyruvate and then to acetyl CoA and 
ultimately to fatty acids. 


e In people with a phosphatase deficiency, pyruvate dehydrogenase is 
always phosphorylated and thus inactive. Consequently, glucose is 
processed to lactate rather than acetyl CoA. This condition results in unre- 
mitting lactic acidosis—high blood levels of lactic acid. In such an acidic 
environment, many tissues malfunction, most notably the central nervous 
system. 


The citric acid cycle is controlled at several points 


The rate of the citric acid cycle is precisely adjusted to meet an animal cell’s 
needs for ATP (Figure 17.19). The primary control points are the allosteric 
enzymes isocitrate dehydrogenase and a-ketoglutarate dehydrogenase, the 
first two enzymes in the cycle to generate high-energy electrons. 

The first control site is isocitrate dehydrogenase. The enzyme is allosteri- 
cally stimulated by ADP, which enhances the enzyme’s affinity for sub- 
strates. The binding of isocitrate, NAD*, Mg”*, and ADP is mutually 
cooperative. In contrast, ATP is inhibitory. The reaction product NADH 
also inhibits isocitrate dehydrogenase by directly displacing NAD”. It is 
important to note that several steps in the cycle require NAD~ or FAD, 
which are abundant only when the energy charge is low. 

A second control site in the citric acid cycle is a-ketoglutarate dehydroge- 
nase. Some aspects of this enzyme’s control are like those of the pyruvate 
dehydrogenase complex, as might be expected from the homology of the 
two enzymes. a-Ketoglutarate dehydrogenase is inhibited by succinyl CoA 
and NADH, the products of the reaction that it catalyzes. In addition, 
a-ketoglutarate dehydrogenase is inhibited by a high energy charge. Thus, 
the vate of the cycle is reduced when the cell has a high level of ATP. 

The use of isocitrate dehydrogenase and a-ketoglutarate dehydrogenase 
as control points integrates the citric acid cycle with other pathways and 
highlights the central role of the citric acid cycle in metabolism. For instance, 
the inhibition of isocitrate dehydrogenase leads to a buildup of citrate, 
because the interconversion of isocitrate and citrate is readily reversible 
under intracellular conditions. Citrate can be transported to the cytoplasm, 
where it signals phosphofructokinase to halt glycolysis (Section 16.2) and 
where it can serve as a source of acetyl CoA for fatty acid synthesis 
(Section 22.4). The a-ketoglutarate that accumulates when a-ketoglutarate 


dehydrogenase is inhibited can be used as a precursor for several amino 
acids and the purine bases (Chapter 23 and Chapter 25). 

In many bacteria, the funneling of two-carbon fragments into the cycle 
also is controlled. The synthesis of citrate from oxaloacetate and acetyl CoA 
carbon units is an important control point in these organisms. ATP is an allos- 
teric inhibitor of citrate synthase. The effect of ATP is to increase the value 
of Kyg for acetyl CoA. Thus, as the level of ATP increases, less of this 
enzyme is saturated with acetyl CoA and so less citrate is formed. 


Defects in the citric acid cycle contribute to the development of cancer 


e Three enzymes crucial to cellular respiration are known to contribute 
to the development of cancer: succinate dehydrogenase, fumarase, 
and pyruvate dehydrogenase kinase. Mutations that alter the activity of all 
three of these enzymes enhance aerobic glycolysis. In aerobic glycolysis, 
cancer cells preferentially metabolize glucose to lactate even in the presence 
of oxygen. Defects in all of these enzymes share acommon biochemical link: 
the transcription factor hypoxia inducible factor 1 (HIF-1). 

Normally, HIF-1 up-regulates the enzymes and transporters that 
enhance glycolysis only when oxygen concentration falls, a condition called 
hypoxia. Under normal conditions, HIF-1 is hydroxylated by prolyl 
hydroxylase 2 and is subsequently destroyed by the proteasome, a large 
complex of proteolytic enzymes (Chapter 23). The degradation of HIF-1 
prevents the stimulation of glycolysis. Prolyl hydroxylase 2 requires 
a-ketoglutarate, ascorbate, and oxygen for activity. Thus, when oxygen 
concentration falls, the prolyl hydroxylase 2 is inactive, HIF-1 is not 
hydroxylated and not degraded, and the synthesis of proteins required for 
glycolysis is stimulated. As a result, the rate of glycolysis is increased. 

Defects in the enzymes of the citric acid cycle can significantly affect the 
regulation of prolyl hydroxylase 2. When either succinate dehydrogenase or 
fumarase is defective, succinate and fumarate accumulate in the mitochon- 
dria and spill over into the cytoplasm. Both succinate and fumarate are 
competitive inhibitors of prolyl hydroxylase 2. The inhibition of prolyl 
hydroxylase 2 results in the stabilization of HIF-1, since HIF-1 is no longer 
hydroxylated. Lactate, the end product of glycolysis, also appears to inhibit 
prolyl hydroxylase 2 by interfering with the action of ascorbate. In addition 
to increasing the amount of the proteins required for glycolysis, HIF-1 also 
stimulates the production of pyruvate dehydrogenase kinase (PDK). The 
kinase inhibits the pyruvate dehydrogenase complex, preventing the con- 
version of pyruvate into acetyl CoA. The pyruvate remains in the cyto- 
plasm, further increasing the rate of aerobic glycolysis. Moreover, muta- 
tions in PDK that lead to enhanced activity contribute to increased aerobic 
glycolysis and the subsequent development of cancer. By enhancing 
glycolysis and increasing the concentration of lactate, the mutations in PDK 
result in the inhibition of hydroxylase and the stabilization of HIF-1. 

These observations linking citric acid cycle enzymes to cancer suggest 
that cancer is also a metabolic disease, not simply a disease of mutant growth 
factors and cell cycle control proteins. The realization that there is a 
metabolic component to cancer opens the door to new thinking about the 
control of cancer. Indeed, preliminary experiments suggest that if cancer 
cells undergoing aerobic glycolysis are forced by pharmacological 
manipulation to use oxidative phosphorylation, the cancer cells lose their 
malignant properties. It is also interesting to note that the citric acid cycle, 
which has been studied for decades, still has secrets to be revealed by future 
biochemists. 
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Figure 17.21 PATHWAY INTEGRATION: 
Pathways active during exercise after a 
night's rest. The rate of the citric acid cycle 
increases during exercise, requiring the 
replenishment of oxaloacetate and acetyl CoA. 
Oxaloacetate is replenished by its formation 
from pyruvate. Acetyl CoA may be produced 
from the metabolism of both pyruvate and 
fatty acids. 
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Figure 17.20 Biosynthetic roles of the citric acid cycle. Intermediates are drawn off for 
biosyntheses (shown by red arrows) when the energy needs of the cell are met. Intermediates are 
replenished by the formation of oxaloacetate from pyruvate. 


17.4 The Citric Acid Cycle Is a Source of 
Biosynthetic Precursors 


Thus far, discussion has focused on the citric acid cycle as the major 
degradative pathway for the generation of ATP. Asa major metabolic hub of 
the cell, the citric acid cycle also provides intermediates for biosyntheses 
(Figure 17.20). For example, most of the carbon atoms in porphyrins come 
from succinyl CoA. Many of the amino acids are derived from a-ketoglutarate 
and oxaloacetate. These biosynthetic processes will be considered in subse- 
quent chapters. 


The citric acid cycle must be capable of being rapidly replenished 


The important point now is that citric acid cycle intermediates must be replen- 
ished if any are drawn off for biosyntheses. Suppose that much oxaloacetate is 
converted into amino acids for protein synthesis and, subsequently, the 
energy needs of the cell rise. The citric acid cycle will operate to a reduced 
extent unless new oxaloacetate is formed, because acetyl CoA cannot enter 
the cycle unless it condenses with oxaloacetate. Even though oxaloacetate is 
recycled, a minimal level must be maintained to allow the cycle to 
function. 

How is oxaloacetate replenished? Mammals lack the enzymes for the net 
conversion of acetyl CoA into oxaloacetate or any other citric acid cycle 
intermediate. Rather, oxaloacetate is formed by the carboxylation of pyru- 
vate, in a reaction catalyzed by the biotin-dependent enzyme pyruvate car- 
boxylase (Figure 17.21). 


Pyruvate + CO, + ATP + H,O — oxaloacetate + ADP + P; + 2H* 


Recall that this enzyme plays a crucial role in gluconeogenesis 
(Section 16.3). It is active only in the presence of acetyl CoA, which signifies 
the need for more oxaloacetate. If the energy charge is high, oxaloacetate 
is converted into glucose. If the energy charge is low, oxaloacetate replen- 
ishes the citric acid cycle. The synthesis of oxaloacetate by the carboxylation 
of pyruvate is an example of an anaplerotic reaction (anaplevotic is of Greek 
origin, meaning to “fill up’), a reaction that leads to the net synthesis, or 


replenishment, of pathway components. Note that because the citric 
acid cycle is a cycle, it can be replenished by the generation of any of the 
intermediates. 


The disruption of pyruvate metabolism is the cause of beriberi 
and poisoning by mercury and arsenic 


Beriberi, a neurologic and cardiovascular disorder, is caused by a 

dietary deficiency of thiamine (also called vitamin B,). The disease 
has been and continues to be a serious health problem in the Far East 
because rice, the major food, has a rather low content of thiamine. This 
deficiency is partly ameliorated if the whole rice grain is soaked in water 
before milling; some of the thiamine in the husk then leaches into the rice 
kernel. The problem is exacerbated if the rice is polished (that is, converted 
from brown to white rice), because only the outer layer contains significant 
amounts of thiamine. Beriberi is also occasionally seen in alcoholics who are 
severely malnourished and thus thiamine deficient. The disease is charac- 
terized by neurologic and cardiac symptoms. Damage to the peripheral 
nervous system is expressed as pain in the limbs, weakness of the muscula- 
ture, and distorted skin sensation. The heart may be enlarged and the 
cardiac output inadequate. 

Which biochemical processes might be affected by a deficiency of thia- 
mine? Thiamine is the precursor of the cofactor thiamine pyrophosphate. 
This cofactor is the prosthetic group of three important enzymes: pyruvate 
dehydrogenase, a-ketoglutarate dehydrogenase, and transketolase. Transketolase 
functions in the pentose phosphate pathway, which will be considered in 
Chapter 20. The common feature of enzymatic reactions utilizing TPP is the 
transfer of an activated aldehyde unit. In beriberi, the levels of pyruvate and 
a-ketoglutarate in the blood are higher than normal. The increase in the 
level of pyruvate in the blood is especially pronounced after the ingestion of 
glucose. A related finding is that the activities of the pyruvate and 
a-ketoglutarate dehydrogenase complexes in vivo are abnormally low. The 
low transketolase activity of red blood cells in beriberi is an easily measured 
and reliable diagnostic indicator of the disease. 

Why does TPP deficiency lead primarily to neurological disorders? The 
nervous system relies essentially on glucose as its only fuel. The product of 
glycolysis, pyruvate, can enter the citric acid cycle only through the pyruvate 
dehydrogenase complex. With that enzyme deactivated, the nervous system 
has no source of fuel. In contrast, most other tissues can use fats as a source 
of fuel for the citric acid cycle. 

Symptoms similar to those of beriberi appear in organisms exposed to 
mercury or arsenite (AsO3° ). Both materials have a high affinity for neigh- 
boring sulfhydryls, such as those in the reduced dihydrolipoyl groups of the 
E3 component of the pyruvate dehydrogenase complex (Figure 17.22). The 
binding of mercury or arsenite to the dihydrolipoyl groups inhibits the com- 
plex and leads to central nervous system pathologies. The proverbial phrase 
“mad as a hatter” refers to the strange behavior of poisoned hat makers who 
used mercury nitrate to soften and shape animal furs. This form of mercury 
is absorbed through the skin. Similar symptoms afflicted the early photog- 
raphers, who used vaporized mercury to create daguerreotypes. 

Treatment for these poisons is the administration of sulfhydryl reagents 
with adjacent sulfhydryl groups to compete with the dihydrolipoyl residues 
for binding with the metal ion. The reagent—metal complex is then excreted 
in the urine. Indeed, 2,3-dimercaptopropanol (see Figure 17.22) was devel- 
oped after World War I as an antidote to lewisite, an arsenic-based chemical 
weapon. This compound was initially called BAL, for British anti-lewisite. 
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Beriberi 


A vitamin-deficiency disease first described in 


1630 by Jacob Bonitus, a Dutch physician 
working in Java: 


“A certain very troublesome affliction, which 
attacks men, is called by the inhabitants 
Beriberi (which means sheep). | believe 
those, whom this same disease attacks, with 
their knees shaking and the legs raised up, 
walk like sheep. It is a kind of paralysis, or 
rather Tremor: for it penetrates the motion 
and sensation of the hands and feet indeed 
sometimes of the whole body.’ 


{The Granger Collection.] 


Figure 17.22 Arsenite poisoning. Arsenite inhibits the pyruvate 
dehydrogenase complex by inactivating the dihydrolipoamide 
component of the transacetylase. Some sulfhydryl reagents, such as HS 
2,3-dimercaptoethanol, relieve the inhibition by forming a complex 


with the arsenite that can be excreted. 
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The manuscript proposing the citric acid 
cycle was submitted for publication to 
Nature but was rejected in June 1937. That 
same year it was published in Enzymologia. 
Dr. Krebs proudly displayed the rejection 
letter throughout his career as 
encouragement for young scientists. 


“The editor of NATURE presents his 
compliments to Dr. H. A. Krebs and regrets 
that as he has already sufficient letters to fill 
the correspondence columns of NATURE for 
seven or eight weeks, it is undesirable to 
accept further letters at the present time on 
account of the time delay which must occur 
in their publication. 

If Dr. Krebs does not mind much delay 
the editor is prepared to keep the letter until 
the congestion is relieved in the hope of 
making use of it. 

He returns it now, in case Dr. Krebs 
prefers to submit it for early publication to 
another periodical.” 
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The citric acid cycle may have evolved from preexisting pathways 


Ww How did the citric acid cycle come into being? Although definitive 

answers are elusive, informed speculation is possible. We can perhaps 
begin to comprehend how evolution might work at the level of biochemical 
pathways. 

The citric acid cycle was most likely assembled from preexisting reaction 
pathways. As noted earlier, many of the intermediates formed in the citric 
acid cycle are used in metabolic pathways for amino acids and porphyrins. 
Thus, compounds such as pyruvate, a-ketoglutarate, and oxaloacetate were 
likely present early in evolution for biosynthetic purposes. The oxidative 
decarboxylation of these a-ketoacids is quite favorable thermodynamically 
and can be used to drive the synthesis of both acyl CoA derivatives and 
NADH. These reactions almost certainly formed the core of processes that 
preceded the citric acid cycle evolutionarily. Interestingly, a-ketoglutarate can 
be directly converted into oxaloacetate by transamination of the respective 
amino acids by aspartate aminotransferase, another key biosynthetic enzyme. 
Thus, cycles comprising smaller numbers of intermediates used for a variety 
of biochemical purposes could have existed before the present form evolved. 


17.5 The Glyoxylate Cycle Enables Plants and Bacteria to 
Grow on Acetate 


Acetyl CoA that enters the citric acid cycle has but one fate: oxidation to 
CO, and HO. Most organisms thus cannot convert acetyl CoA into glu- 
cose, because although oxaloacetate, a key precursor to glucose, is formed in 
the citric acid cycle, the two decarboxylations that take place before the 
regeneration of oxaloacetate preclude the net conversion of acetyl CoA into 
glucose. 

In plants and in some microorganisms, there is a metabolic pathway that 
allows the conversion of acetyl CoA generated from fats stores into glucose. 
This reaction sequence, called the glyoxylate cycle, is similar to the citric acid 
cycle but bypasses the two decarboxylation steps of the cycle. Another 
important difference is that two molecules of acetyl CoA enter per turn of 
the glyoxylate cycle, compared with one in the citric acid cycle. 

The glyoxylate cycle (Figure 17.23), like the citric acid cycle, begins with 
the condensation of acetyl CoA and oxaloacetate to form citrate, which is 
then isomerized to isocitrate. Instead of being decarboxylated, as in the 
citric acid cycle, isocitrate is cleaved by isocitrate lyase into succinate and 
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glyoxylate. The ensuing steps regenerate oxaloacetate from glyoxylate. 
First, acetyl CoA condenses with glyoxylate to form malate in a reaction 
catalyzed by malate synthase, and then malate is oxidized to oxaloacetate, as 
in the citric acid cycle. The sum of these reactions is 


2 Acetyl CoA + NAD* + 2H,O —> 
succinate + 2CoASH + NADH + 2H* 


In plants, these reactions take place in organelles called glyoxysomes. This 
cycle is especially prominent in oil-rich seeds, such as those from sunflow- 
ers, cucumbers, and castor beans. Succinate, released midcycle, can be 
converted into carbohydrates by a combination of the citric acid cycle and 
gluconeogenesis. The carbohydrates power seedling growth until the cell 
can begin photosynthesis. Thus, organisms with the glyoxylate cycle gain a 
metabolic versatility because they can use acetyl CoA as a precursor of 
glucose and other biomolecules. 


Summary 


The citric acid cycle is the final common pathway for the oxidation 
of fuel molecules. It also serves as a source of building blocks for 
biosyntheses. 


17.1 Pyruvate Dehydrogenase Links Glycolysis to the Citric Acid Cycle 
Most fuel molecules enter the cycle as acetyl CoA. The link between 
glycolysis and the citric acid cycle is the oxidative decarboxylation of 
pyruvate to form acetyl CoA. In eukaryotes, this reaction and those of 
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Summary 


Figure 17.23 The glyoxylate pathway. The 
glyoxylate cycle allows plants and some 
microorganisms to grow on acetate because 
the cycle bypasses the decarboxylation steps 
of the citric acid cycle. The reactions of this 
cycle are the same as those of the citric acid 
cycle except for the ones catalyzed by 
isocitrate lyase and malate synthase, which 
are boxed in blue. 
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the cycle take place inside mitochondria, in contrast with glycolysis, 
which takes place in the cytoplasm. 


17.2 The Citric Acid Cycle Oxidizes Two-Carbon Units 


17.3 


17.4 


17.5 


The cycle starts with the condensation of oxaloacetate (C4) and acetyl 
CoA (Co) to give citrate (C6), which is isomerized to isocitrate (Cg). 
Oxidative decarboxylation of this intermediate gives a-ketoglutarate 
(Cs). The second molecule of carbon dioxide comes off in the next 
reaction, in which a-ketoglutarate is oxidatively decarboxylated to 
succinyl CoA (C4). The thioester bond of succinyl CoA is cleaved by 
orthophosphate to yield succinate, and an ATP is concomitantly gen- 
erated. Succinate is oxidized to fumarate (C4), which is then hydrated 
to form malate (C4). Finally, malate is oxidized to regenerate oxaloac- 
etate (C4). Thus, two carbon atoms from acetyl CoA enter the cycle, 
and two carbon atoms leave the cycle as CO» in the successive decar- 
boxylations catalyzed by isocitrate dehydrogenase and a-ketoglutarate 
dehydrogenase. In the four oxidation—reduction reactions in the cycle, 
three pairs of electrons are transferred to NAD* and one pair to 
FAD. These reduced electron carriers are subsequently oxidized by 
the electron-transport chain to generate approximately 9 molecules 
of ATP. In addition, 1 molecule of a compound having a high 
phosphoryl-transfer potential is directly formed in the citric acid 
cycle. Hence, a total of 10 molecules of compounds having high 
phosphoryl-transfer potential are generated for each two-carbon 
fragment that is completely oxidized to H2O and COg. 


Entry to the Citric Acid Cycle and Metabolism 

Through It Are Controlled 

The citric acid cycle operates only under aerobic conditions because it 
requires a supply of NAD™ and FAD. The irreversible formation of 
acetyl CoA from pyruvate is an important regulatory point for the 
entry of glucose-derived pyruvate into the citric acid cycle. The activ- 
ity of the pyruvate dehydrogenase complex is stringently controlled by 
reversible phosphorylation. The electron acceptors are regenerated 
when NADH and FADH; transfer their electrons to O2 through the 
electron-transport chain, with the concomitant production of ATP. 
Consequently, the rate of the citric acid cycle depends on the need for 
ATP. In eukaryotes, the regulation of two enzymes in the cycle also is 
important for control. A high energy charge diminishes the activities 
of isocitrate dehydrogenase and a-ketoglutarate dehydrogenase. These 
mechanisms complement each other in reducing the rate of formation 
of acetyl CoA when the energy charge of the cell is high and when 
biosynthetic intermediates are abundant. 


The Citric Acid Cycle Is a Source of Biosynthetic Precursors 

When the cell has adequate energy available, the citric acid cycle 
can also provide a source of building blocks for a host of important 
biomolecules, such as nucleotide bases, proteins, and heme groups. 
This use depletes the cycle of intermediates. When the cycle again 
needs to metabolize fuel, anaplerotic reactions replenish the cycle 
intermediates. 


The Glyoxylate Cycle Enables Plants and Bacteria to Grow on Acetate 
The glyoxylate cycle enhances the metabolic versatility of many plants 
and bacteria. This cycle, which uses some of the reactions of the citric 
acid cycle, enables these organisms to subsist on acetate because it 
bypasses the two decarboxylation steps of the citric acid cycle. 


Key Terms 

citric acid (tricarboxylic acid, TCA; 
Krebs) cycle (p. 497) 

acetyl CoA (p. 497) 

oxidative phosphorylation (p. 498) 


(p. 506) 


pyruvate dehydrogenase complex (p. 499) 


flavoprotein (p. 502) metabolon (p. 512) 


Problems 


1. Naming names. What are the five enzymes (including 
regulatory enzymes) that constitute the pyruvate dehydro- 
genase complex? Which reactions do they catalyze? 


2. Coenzymes. What coenzymes are required by the 
pyruvate dehydrogenase complex? What are their roles? 


3. More coenzymes. Distinguish between catalytic coen- 
zymes and stoichiometric coenzymes in the pyruvate dehy- 
drogenase complex. 


4, Joined at the hip. List some of the advantages of organiz- 
ing the enzymes that catalyze the formation of acetyl CoA 
from pyruvate into a single large complex. 


5. Flow of carbon atoms. What is the fate of the radioactive 
label when each of the following compounds is added to a 
cell extract containing the enzymes and cofactors of the 
glycolytic pathway, the citric acid cycle, and the pyruvate 
dehydrogenase complex? (The !*C label is printed in red.) 


(a) 9 (b) 7 
C C 
H;C~ ~COO- H;C~ ~COO- 
(c) fe) (d) fe) 
I 
Cc oe 
H;C~ ~COO- HsC~ con 


(e) Glucose 6-phosphate labeled at C-1. 

6. Co + Co Cy. 

(a) Which enzymes are required to get net synthesis of 
oxaloacetate from acetyl CoA? 

(b) Write a balanced equation for the net synthesis. 

(c) Do mammalian cells contain the requisite enzymes? 


7. Driving force. What is the AG®’ for the complete oxida- 
tion of the acetyl unit of acetyl CoA by the citric acid cycle? 


8. Acting catalytically. The citric acid cycle itself, which is 
composed of enzymatically catalyzed steps, can be thought 
of essentially as a supramolecular enzyme. Explain. 


citrate synthase (p. 504) 


iron-sulfur (nonheme iron) protein 


isocitrate dehydrogenase (p. 506) 
a-ketoglutarate dehydrogenase (p. 507) 
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Problems 


anaplerotic reaction (p. 516) 
beriberi (p. 517) 

glyoxylate cycle (p. 518) 
isocitrate lyase (p. 518) 
malate synthase (p. 519) 
glyoxysome (p. 519) 


9. A potent inhibitor. Thiamine thiazolone pyrophosphate 
binds to pyruvate dehydrogenase about 20,000 times as 
strongly as does thiamine pyrophosphate, and it competi- 


tively inhibits the enzyme. Why? 
HC - 
N 
Xr . 
R S 


H al 
3c < A 
| ya 
R Ss 
TPP Thiazolone analog 
of TPP 
10. Lactic acidosis. Patients in shock often suffer from lactic 
acidosis owing to a deficiency of Oz. Why does a lack of O2 
lead to lactic acid accumulation? One treatment for shock is 
to administer dichloroacetate, which inhibits the kinase 
associated with the pyruvate dehydrogenase complex. What 
is the biochemical rationale for this treatment? 


11. Energy rich. What are the thioesters in the reaction 
catalyzed by PDH complex? 


12. Alternative fates. Compare the regulation of the pyru- 
vate dehydrogenase complex in muscle and in liver. 


13. Mutations. (a) Predict the effect of a mutation that 
enhances the activity of the kinase associated with the PDH 
complex. (b) Predict the effect of a mutation that reduces 
the activity of the phosphatase associated with the PDH 
complex. 


14. Flaking paint, green wallpaper. Clare Boothe Luce, 
ambassador to Italy in the 1950s (and Connecticut con- 
gressperson, playwright, editor of Vanity Fair, and the wife 
of Henry Luce, founder of Time magazine and Sports 
Illustrated) became ill when she was staying at the ambas- 
sadorial residence in Rome. The paint on the dining room 
ceiling, an arsenic-based paint, was flaking; the wallpaper 
of her bedroom in the ambassadorial residence was colored 
a mellow green owing to the presence of cupric arsenite in 
the pigment. Suggest a possible cause of Ambassador 
Luce’s illness. 


15. A hoax, perhaps? The citric acid cycle is part of aerobic 
respiration, but no O is required for the cycle. Explain this 
paradox. 
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16. Coupling reactions. The oxidation of malate by NAD* 
to form oxaloacetate is a highly endergonic reaction under 
standard conditions [AG°’ = 29 kJ mol! (7 kcal mol” !)]. 
The reaction proceeds readily under physiological 
conditions. 

(a) Why? 

(b) Assuming an [NAD*]/[NADH] ratio of 8 and a pH of 
7, what is the lowest [malate]/[oxaloacetate] ratio at which 
oxaloacetate can be formed from malate? 


17. Synthesizing a-ketoglutarate. It is possible, with the use 
of the reactions and enzymes considered in this chapter, to 
convert pyruvate into a-ketoglutarate without depleting 
any of the citric acid cycle components. Write a balanced 
reaction scheme for this conversion, showing cofactors and 
identifying the required enzymes. 


18. Seven o’clock roadblock. Malonate is a competitive 
inhibitor of succinate dehydrogenase. How will the concen- 
trations of citric acid cycle intermediates change immedi- 
ately after the addition of malonate? Why is malonate not a 
substrate for succinate dehydrogenase? 


CcoO- 
CH2 


CcCoO- 
Malonate 


19. No signal, no activity. Why is acetyl CoA an especially 
appropriate activator for pyruvate carboxylase? 


20. Power differentials. As we will see in the next chapter, 
when NADH reacts with oxygen 2.5 ATP are generated. 
When FADH)> reduces oxygen only 1.5 ATP are generated. 
Why then does succinate dehydrogenase produce FADH) 
and not NADH when succinate is reduced to fumarate? 


21. Back to Orgo. Before any oxidation can occur in the 
citric acid cycle, citrate must be isomerized into isocitrate. 
Why is this the case? 


22. A nod is as good as a wink to a blind horse. Explain 
why a GTP molecule, or another nucleoside triphosphate, 
is energetically equivalent to an ATP molecule in 
metabolism. 


23. One from two. The synthesis of citrate from acetyl CoA 
and oxaloacetate is a biosynthetic reaction. What is the 
energy source that drives formation of citrate? 


Chapter Integration Problems 


24. Fats into glucose? Fats are usually metabolized into 
acetyl CoA and then further processed through the citric 
acid cycle. In Chapter 16, we saw that glucose can be 
synthesized from oxaloacetate, a citric acid cycle intermedi- 
ate. Why, then, after a long bout of exercise depletes our 
carbohydrate stores, do we need to replenish those stores by 


eating carbohydrates? Why do we not simply replace them 
by converting fats into carbohydrates? 


25. Alternative fuels. As we will see (Chapter 22), fatty acid 
breakdown generates a large amount of acetyl CoA. What 
will be the effect of fatty acid breakdown on pyruvate dehy- 
drogenase complex activity? On glycolysis? 


Mechanism Problems 


26. Theme and variation. Propose a reaction mechanism for 
the condensation of acetyl CoA and glyoxylate in the gly- 
oxylate cycle of plants and bacteria. 


27. Symmetry problems. In experiments carried out in 1941 
to investigate the citric acid cycle, oxaloacetate labeled with 
4 in the carboxyl carbon atom farthest from the keto 
group was introduced to an active preparation of 
mitochondria. 


CcoO- 


COoO- 
Oxaloacetate 


Analysis of the a-ketoglutarate formed showed that 
none of the radioactive label had been lost. Decarboxylation 
of a-ketoglutarate then yielded succinate devoid of radioac- 
tivity. All the label was in the released CO». Why were the 
early investigators of the citric acid cycle surprised that all 
the label emerged in the CO? 


28. Symmetric molecules reacting asymmetrically. The inter- 
pretation of the experiments described in Problem 27 was 
that citrate (or any other symmetric compound) cannot be 
an intermediate in the formation of a-ketoglutarate, because 
of the asymmetric fate of the label. This view seemed com- 
pelling until Alexander Ogston incisively pointed out in 
1948 that “it is possible that an asymmetric enzyme which 
attacks a symmetrical compound can distinguish between its 
identical groups [italics added].” For simplicity, consider a 
molecule in which two hydrogen atoms, a group X, anda 
different group Y are bonded to a tetrahedral carbon atom 
as a model for citrate. Explain how a symmetric molecule 
can react with an enzyme in an asymmetric way. 


Data Interpretation Problem 


29. A little goes a long way. As will become clearer in 
Chapter 18, the activity of the citric acid cycle can be moni- 
tored by measuring the amount of O» consumed. The 
greater the rate of O2 consumption, the faster the rate of 
the cycle. Hans Krebs used this assay to investigate the 
cycle in 1937. He used as his experimental system minced 
pigeon-breast muscle, which is rich in mitochondria. In one 
set of experiments, Krebs measured the O consumption in 
the presence of carbohydrate only and in the presence of 


carbohydrate and citrate. The results are shown in the 
following table. 


Effect of citrate on oxygen consumption by minced 
pigeon-breast muscle 


Micromoles of oxygen consumed 


Carbohydrate Carbohydrate plus 
Time (min) only 3 pmol of citrate 
10 26 28 
60 43 62 
90 46 77 
150 49 85 


(a) How much O) would be absorbed if the added citrate 
were completely oxidized to HO and CO,? 


(b) On the basis of your answer to part a, what do the 
results given in the table suggest? 


30. Arsenite poisoning. The effect of arsenite on the experi- 
mental system of Problem 29 was then examined. Experi- 
mental data (not presented here) showed that the amount of 
citrate present did not change in the course of the experi- 
ment in the absence of arsenite. However, if arsenite was 
added to the system, different results were obtained, as 
shown in the following table. 


Disappearance of citric acid in pigeon-breast muscle in the 
presence of arsenite 


Micromoles of Micromoles of citrate Micromoles 
citrate added found after 40 minutes of citrate used 
22 00.6 21 
44 20.0 24 
90 56.0 34 


(a) What is the effect of arsenite on the disappearance of 
citrate? 


(b) How is the action of arsenite altered by the addition of 
more citrate? 


(c) What do these data suggest about the site of action of 
arsenite? 


31. Isocitrate lyase and tuberculosis. The bacterium 
Mycobacterium tuberculosis, the cause of tuberculosis, can 
invade the lungs and persist in a latent state for years. 
During this time, the bacteria reside in granulomas— 
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Problems 


nodular scars containing bacteria and host-cell debris in the 
center and surrounded by immune cells. The granulomas 
are lipid-rich, oxygen-poor environments. How these 
bacteria manage to persist is something of a mystery. The 
results of recent research suggest that the glyoxylate cycle 
is required for the persistence. The following data show the 
amount of bacteria [presented as colony-forming units 
(cfu)] in mice lungs in the weeks after an infection. 

In graph A, the black circles represent the results for 
wild-type bacteria and the red circles represent the results 
for bacteria from which the gene for isocitrate lyase was 


deleted. 
(A) 


Isocitrate lyase 
gene deleted 


logio cfu 


ce) 2 4 6 8 10 12 #14 += «16 
Weeks postinfection 


(a) What is the effect of the absence of isocitrate lyase? 


The techniques described in Chapter 5 were used to rein- 
sert the gene encoding isocitrate lyase into bacteria from 
which it had previously been deleted. 

In graph B, black circles represent bacteria into which 
the gene was reinserted and red circles represent bacteria in 
which the gene was still missing. 


(b) Do these results support those obtained in part a? 
(c) What is the purpose of the experiment in part b? 


(d) Why do these bacteria perish in the absence of the 
glyoxylate cycle? 


(B) Isocitrate lyase 
gene restored 
6 }— 
Bo. Isocitrate lyase 
a gene deleted 
Bab 
3 


0 2 4 6 8 10 12 14 #16 
Weeks postinfection 


(Data after McKinney et al., Nature 406(2000):735-738.] 
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Oxidative Phosphorylation 


ATP 


he amount of ATP that human beings require to go about their lives is 

staggering. A sedentary male of 70 kg (154 lbs) requires about 8400 kJ 
(2000 kcal) for a day’s worth of activity. To provide this much energy 
requires 83 kg of ATP. However, human beings possess only about 250 g of 
ATP at any given moment. The disparity between the amount of ATP that 
we have and the amount that we require is compensated by recycling ADP 
back to ATP. Each ATP molecule is recycled approximately 300 times per 
day. This recycling takes place primarily through oxidative phosphorylation. 

We begin our study of oxidative phosphorylation by examining the 
oxidation—reduction reactions that allow the flow of electrons from NADH 
and FADH)> to oxygen. The electron flow takes place in four large protein 
complexes that are embedded in the inner mitochondrial membrane, 
together called the respiratory chain or the electron-transport chain. 


NADH + 140, + Ht —>H,0 + NAD+ 
AG” = —220.1 kJ mol! (—52.6 kcal mol~') 


The overall reaction is exergonic. Importantly, three of the complexes of 
the electron-transport chain use the energy released by the electron flow to 
pump protons from the mitochondrial matrix into the cytoplasm. In 
essence, energy is transformed. The resulting unequal distribution of 
protons generates a pH gradient and a transmembrane electrical potential 
that creates a proton-motive force. ATP is synthesized when protons flow 
back to the mitochondrial matrix through an enzyme complex. 


ADP + P; + H* + ATP + H,O0 
AG*' = +30.5 kJ mol” !(+7.3 kcal mol!) 
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Mitochondria, stained green, form a network inside a 
fibroblast cell (left). Mitochondria oxidize carbon fuels to 
form cellular energy in the form of ATP. [(Left) Courtesy of 
Michael P. Yaffee, Department of Biology, University of 
California at San Diego.] 
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Eukaryotic Oxidative 
Phosphorylation Takes Place in 
Mitochondria 


Oxidative Phosphorylation 
Depends on Electron Transfer 


The Respiratory Chain Consists 
of Four Complexes: Three 
Proton Pumps and a Physical 
Link to the Citric Acid Cycle 


A Proton Gradient Powers the 
Synthesis of ATP 


Many Shuttles Allow Movement 
Across the Mitochondrial 
Membranes 


The Regulation of Cellular 
Respiration Is Governed 
Primarily by the Need for ATP 


Figure 18.1 Overview of oxidative 
phosphorylation. Oxidation and ATP 
synthesis are coupled by transmembrane 
proton fluxes. Electrons flow from NADH and 
FADH> through four protein complexes to 
reduce oxygen to water. Three of the 
complexes pump protons from the 
mitochondrial matrix to the exterior of the 
mitochondria. The protons return to the matrix 
by flowing through another protein complex, 
ATP synthase, powering the synthesis of ATP. 


Respiration 


An ATP-generating process in which an 
inorganic compound (such as molecular 
oxygen) serves as the ultimate electron 
acceptor. The electron donor can be either 
an organic compound or an inorganic one. 
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Thus, the oxidation of fuels and the phosphorylation of ADP are coupled 
by a proton gradient across the inner mitochondrial membrane (Figure 18.1). 

Collectively, the generation of high-transfer-potential electrons by the 
citric acid cycle, their flow through the respiratory chain, and the accompa- 
nying synthesis of ATP is called respiration or cellular respiration. 


18.1 Eukaryotic Oxidative Phosphorylation Takes 
Place in Mitochondria 


Recall that the biochemical purpose of the citric acid cycle, which takes 
place in mitochondria, is to generate high-energy electrons. It is fitting, 
therefore, that oxidative phosphorylation, which will convert the energy of 
these electrons into ATP, also takes place in mitochondria. Mitochondria 
are oval-shaped organelles, typically about 2 wm in length and 0.5 pm in 
diameter, about the size of a bacterium. Eugene Kennedy and Albert 
Lehninger discovered more than a half-century ago that mitochondria con- 
tain the respiratory assembly, the enzymes of the citric acid cycle, and the 
enzymes of fatty acid oxidation. 


Mitochondria are bounded by a double membrane 


Electron microscopic studies by George Palade and Fritjof Sjéstrand 
revealed that mitochondria have two membrane systems: an outer membrane 
and an extensive, highly folded inner membrane. The inner membrane is 
folded into a series of internal ridges called cristae. Hence, there are two 
compartments in mitochondria: (1) the intermembrane space between the 
outer and the inner membranes and (2) the matrix, which is bounded by the 
inner membrane (Figure 18.2). The mitochondrial matrix is the site of most 
of the reactions of the citric acid cycle and fatty acid oxidation. In contrast, 
oxidative phosphorylation takes place in the inner mitochondrial mem- 
brane. The increase in surface area of the inner mitochondrial membrane 
provided by the cristae creates more sites for oxidative phosphorylation 
than would be the case with a simple, unfolded membrane. Humans contain 
an estimated 14,000 m? of inner mitochondrial membrane, which is the 
approximate equivalent of three football fields in the United States. 

The outer membrane is quite permeable to most small molecules and 
ions because it contains many copies of mitochondrial porin, a 30- to 35-kd 
pore-forming protein also known as VDAC, for voltage-dependent 
anion channel. VDAC, the most prevalent protein in the outer mitochon- 
drial membrane, plays a role in the regulated flux of metabolites—usually 
anionic species such as phosphate, chloride, organic anions, and the adenine 
nucleotides—across the outer membrane. In contrast, the inner membrane 
is impermeable to nearly all ions and polar molecules. A large family of 


Intermembrane — Cristae 


space 


Matrix 


Inner membrane 


Outer membrane 


(B) 


transporters shuttles metabolites such as ATP, pyruvate, and citrate across 
the inner mitochondrial membrane. The two faces of this membrane will be 
referred to as the matrix side and the cytoplasmic side (the latter because it is 
freely accessible to most small molecules in the cytoplasm). They are also 
called the N and P sides, respectively, because the membrane potential is 
negative on the matrix side and positive on the cytoplasmic side. 

In prokaryotes, the electron-driven proton pumps and AT P-synthesizing 
complex are located in the cytoplasmic membrane, the inner of two 
membranes. The outer membrane of bacteria, like that of mitochondria, is 
permeable to most small metabolites because of the presence of porins. 


Mitochondria are the result of an endosymbiotic event 


a4 Mitochondria are semiautonomous organelles that live in an endosym- 

biotic relation with the host cell. These organelles contain their own 
DNA, which encodesa variety of different proteinsand RNAs. Mitochondrial 
DNA is usually portrayed as being circular, but recent research suggests that 
the mitochondrial DNA of many organisms may be linear. The genomes of 
mitochondria range broadly in size across species. The mitochondrial 
genome of the protist Plasmodium falciparum consists of fewer than 6000 
base pairs (bp), whereas those of some land plants comprise more than 
200,000 bp (Figure 18.3). Human mitochondrial DNA comprises 16,569 bp 
and encodes 13 respiratory-chain proteins as well as the small and large ribo- 
somal RNAs and enough tRNAs to translate all codons. However, mito- 
chondria also contain many proteins encoded by nuclear DNA. Cells that 
contain mitochondria depend on these organelles for oxidative phosphoryla- 
tion, and the mitochondria in turn depend on the cell for their very existence. 
How did this intimate symbiotic relation come to exist? 

An endosymbiotic event is thought to have occurred whereby a free-living 
organism capable of oxidative phosphorylation was engulfed by another 
cell. The double-membrane, circular DNA (with exceptions) and the 
mitochondrial-specific transcription and translation machinery all point to 
this conclusion. Thanks to the rapid accumulation of sequence data for 
mitochondrial and bacterial genomes, speculation on the origin of the 
“original” mitochondrion with some authority is now possible. The most 
mitochondrial-like bacterial genome is that of Rickettsia prowazekii, the 
cause of louse-borne typhus. The genome for this organism is more than 
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Figure 18.2 Electron micrograph (A) 
and diagram (B) of a mitochondrion. 
[(A) Courtesy of George Palade. (B) From 
Wolfe, Biology of the Cell, 2e, © 1981 Brooks/ 
Cole, a part of Cengage Learning, Inc. 
Reproduced by permission www. cengage.com/ 
permission 3.] 


Rickettsia 
(a bacterium) 
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Figure 18.3 Sizes of mitochondrial 
genomes. The sizes of three mitochondrial 
genomes compared with the genome of 
Rickettsia, a relative of the presumed ancestor 
of all mitochondria. For genomes of more 
than 60 kbp, the DNA coding region for genes 
with known function is shown in red. 


Acanthamoeba 
(an ameba) 
Schizosaccharomyces_ Porphyra 


(a bacterium) (red alga) Reclinomonas 


Marchantia (a protozoan) 
(a moss) 


Plasmodium 
(a protozoan) 


rp!1,10,18,19,20,27,31,32,34 


Figure 18.4 Overlapping gene complements of mitochondria. 
The genes present within each oval are those present within the 
organism represented by the oval. Only rRNA- and protein-coding 
genes are shown. The genome of Reclinomonas contains all the 
protein-coding genes found in all the sequenced mitochondrial 
genomes. [After M. W. Gray, G. Burger, and B. F. Lang. Science 


1 million base pairs in size and contains 834 protein- 
encoding genes. Sequence data suggest that all extant 
mitochondria are derived from an ancestor of R. prowaze- 
kit as the result of a single endosymbiotic event. 

The evidence that modern mitochondria result from a 
single event comes from examination of the most bacteria- 
like mitochondrial genome, that of the protozoan 
Reclinomonas americana. Its genome contains 97 genes, of 
which 62 specify proteins. The genes encoding these pro- 
teins include all of the protein-coding genes found in all of 
the sequenced mitochondrial genomes (Figure 18.4). Yet, 
this genome encodes less than 2% of the protein-coding 
genes in the bacterium E. coli. In other words, a small frac- 
tion of bacterial genes—2%—are found in all examined 
mitochondria. How is it possible that all mitochondria 
have the same 2% of the bacterial genome? It seems unlike- 
ly that mitochondrial genomes resulting from several 
endosymbiotic events could have been independently 
reduced to the same set of genes found in R. americana. 
Thus, the simplest explanation is that the endosymbiotic 
event took place just once and all existing mitochondria are 
descendants of that ancestor. 

Note that transient engulfment of prokaryotic cells by 


283:1476-1481, 1999.] 
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Solution of 
1 MX and 1 M X~ 


1 M H* in equilibrium 
with 1 atm H, gas 


Figure 18.5 Measurement of redox 
potential. Apparatus for the measurement of 
the standard oxidation—reduction potential of 
a redox couple. Electrons, but not X or X~, can 
flow through the agar bridge. 
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larger cells is not uncommon in the microbial world. In 

regard to mitochondria, such a transient relation became 

permanent as the bacterial cell lost DNA, making it inca- 
pable of independent living, and the host cell became dependent on the 
ATP generated by its tenant. 


18.2 Oxidative Phosphorylation Depends 
on Electron Transfer 


In Chapter 17, the primary function of the citric acid cycle was identified 
as the generation of NADH and FADH) by the oxidation of acetyl CoA. In 
oxidative phosphorylation, electrons from NADH and FADH) are used to 
reduce molecular oxygen to water. The highly exergonic reduction of 
molecular oxygen by NADH and FADH) is accomplished by a number of 
electron-transfer reactions, which take place in a set of membrane proteins 
known as the electron-transport chain. 


The electron-transfer potential of an electron is 
measured as redox potential 


In oxidative phosphorylation, the electron-transfer potential of NADH or 
FADH) is converted into the phosphoryl-transfer potential of ATP. We need 
quantitative expressions for these forms of free energy. The measure of 
phosphoryl-transfer potential is already familiar to us: it is given by AG®’ 
for the hydrolysis of the activated phosphoryl compound. The corre- 
sponding expression for the electron-transfer potential is Eo, the reduction 
potential (also called the redox potential or oxidation—reduction potential). 
The reduction potential is an electrochemical concept. Consider a sub- 
stance that can exist in an oxidized form X and a reduced form X~. Such a 
pair is called a redox couple and is designated X :X” . The reduction potential 
of this couple can be determined by measuring the electromotive force gen- 
erated by an apparatus called a sample half-cell connected to a standard refer- 
ence half-cell (Figure 18.5). The sample half-cell consists of an electrode 


immersed in a solution of 1 M oxidant (X) and 1 M reductant (X_). The 
standard reference half-cell consists of an electrode immersed in a 1 M H* 
solution that is in equilibrium with H» gas at 1 atmosphere (1 atm) of 
pressure. The electrodes are connected to a voltmeter, and an agar bridge 
allows ions to move from one half-cell to the other, establishing electrical 
continuity between the half-cells. Electrons then flow from one half-cell to 
the other through the wire connecting the two half-cells to the voltmeter. If 
the reaction proceeds in the direction 


X + Ht>X+ 14H, 


the reactions in the half-cells (referred to as half-reactions or couples) 
must be 


Xx >Xt+e- Ht +e > 15H, 


Thus, electrons flow from the sample half-cell to the standard reference 
half-cell, and the sample-cell electrode is taken to be negative with respect 
to the standard-cell electrode. The reduction potential of the X: X couple is 
the observed voltage at the start of the experiment (when X, X, and H™ are 
1 M with 1 atm of H)). The reduction potential of the H* :H> couple is 
defined to be 0 volts. In oxidation—reduction reactions, the donor of elec- 
trons, in this case X, is called the reductant or reducing agent, whereas the 
acceptor of electrons, H* here, is called the oxidant. 

The meaning of the reduction potential is now evident. A negative 
reduction potential means that the oxidized form of a substance has lower 
affinity for electrons than does Hy, as in the preceding example. A positive 
reduction potential means that the oxidized form of a substance has higher 
affinity for electrons than does Hy. These comparisons refer to standard 
conditions—namely, 1 M oxidant, 1 M reductant, 1M H*, and 1 atm Ho. 
Thus, a strong reducing agent (such as NADH) is poised to donate electrons 
and has a negative reduction potential, whereas a strong oxidizing agent (such 
as Op») is ready to accept electrons and has a positive reduction potential. 

The reduction potentials of many biologically important redox couples 
are known (Table 18.1). Table 18.1 is like those presented in chemistry 


Table 18.1 Standard reduction potentials of some reactions 
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Oxidant Reductant n Eo (V) 
Succinate + CO2 a-Ketoglutarate 2 —0.67 
Acetate Acetaldehyde Z —0.60 
Ferredoxin (oxidized) Ferredoxin (reduced) 1 —0.43 
2 Tal Hp 2 —0.42 
NADT NADH + Ht 2 —0.32 
NADP* NADPH + H* 2 0,82 
Lipoate (oxidized) Lipoate (reduced) 2 —0.29 
Glutathione (oxidized) Glutathione (reduced) D. =0,23) 
FAD FADH) 2 —0,22 
Acetaldehyde Ethanol 2 0:20 
Pyruvate Lactate 2 = 0,19 
Fumarate Succinate 2, +0.03 
Cytochrome b (+3) Cytochrome b (+2) 1 +0.07 
Dehydroascorbate Ascorbate z +0.08 
Ubiquinone (oxidized) Ubiquinone (reduced) 2 +0.10 
Cytochrome c (+3) Cytochrome c (+2) 1 0 22 
Fe (+3) Fe (+2) 1 +0.77 
1% O, +2 H* H,O 2 +0.82 


Note: EQ is the standard oxidation—reduction potential (pH 7, 25°C) and n is the number of electrons 
transferred. Eo refers to the partial reaction written as Oxidant + e — reductant 


18.2 Electron Transfer 
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textbooks, except that a hydrogen ion concentration of 10-’ M (pH 7) 
instead of 1 M (pH 0) is the standard state adopted by biochemists. This 
difference is denoted by the prime in E6. Recall that the prime in AG®’ 
denotes a standard free-energy change at pH 7. 

The standard free-energy change AG’ is related to the change in reduc- 
tion potential AE by 


AG” =—nFAE} 


in which n is the number of electrons transferred, F is a proportionality 
constant called the faraday [96.48 kJ] mol! V7! (23.06 kcal mol”! V~!)], 
AE 6 is in volts, and AG®’ is in kilojoules or kilocalories per mole. 

The free-energy change of an oxidation-reduction reaction can be 
readily calculated from the reduction potentials of the reactants. For exam- 
ple, consider the reduction of pyruvate by NADH, catalyzed by lactate 
dehydrogenase. Recall that this reaction maintains redox balance in lactic 
acid fermentation (see Figure 16.11). 


Pyruvate + NADH + H* == lactate + NAD* (A) 


The reduction potential of the NAD* : NADH couple, or half-reaction, is 
—0.32 V, whereas that of the pyruvate : lactate couple is —0.19 V. By conven- 
tion, reduction potentials (as in Table 18.1) refer to partial reactions written 
as reductions: oxidant + e —>reductant. Hence, 


Pyruvate + 2H* + 2e —> lactate Eo = —0.19V (B) 
NAD* +H*+2e— NADH = E)=-—0.32V (C) 


To obtain reaction A from reactions B and C, we need to reverse the direc- 
tion of reaction C so that NADH appears on the left side of the arrow. In 
doing so, the sign of Ej must be changed. 


Pyruvate + 2H* + 2e —> lactate Eo = —0.19V (B) 
NADH -> NAD* + H* + 2e, Eo = +0.32V (D) 


For reaction B, the free energy can be calculated with n = 2. 


AG*' = —2 X 96.48 kJ mol! V~! x —0.19V 
= +36.7 kJ mol | (+8.8 kcal mol!) 


Likewise, for reaction D, 


AG*' = —2 X 96.48kJ mol)! V7! x +0.32V 
= —61.8kJ mol | (—14.8 kcal mol!) 


Thus, the free energy for reaction A is given by 


AG*' = AG*°’ (for reaction B) + AG®’ (for reaction D) 
+36.7 kJ mol! + (—61.8 kJ mol!) 
—25.1 kJ mol! (—6.0 kcal mol!) 


A 1.14-volt potential difference between NADH and molecular 
oxygen drives electron transport through the chain and favors the 
formation of a proton gradient 


The driving force of oxidative phosphorylation is the electron-transfer 


potential of NADH or FADH) relative to that of O2. How much energy is 
released by the reduction of O2 with NADH? Let us calculate AG®’ for this 


reaction. The pertinent half-reactions are 


pO, +2Ht+2e —> HO +0.82V (A) 
NADt+Ht+2e— NADH E)=-0.32V (B) 


on 
] 


The combination of the two half-reactions, as it proceeds in the electron- 
transport chain, yields 


1/)O7 + NADH + H*-> H,O + NAD* (C) 
The standard free energy for this reaction is then given by 


AG* = (—2 X 96.48 kJ mol”! V7! x +0.82V) — 
(—2 X 96.48 kJ mol! V~! x 0.32 V) 

—158.2 kJ mol! —61.9 kJ mol! 

= —220.1 kJ mol! (—52.6 kcal mol!) 


This release of free energy is substantial. Recall that AG®’ for the hydrolysis 
of ATP is —30.5 kJ mol ! (—7.3 kcal mol~!). The released energy is 
initially used to generate a proton gradient that is then used for the syn- 
thesis of ATP and the transport of metabolites across the mitochondrial 
membrane. 

How can the energy associated with a proton gradient be quantified? 
Recall that the free-energy change for a species moving from one side of a 
membrane where it is at concentration c; to the other side where it is at a 
concentration Cc) is given by 


AG = RT In (co/c) + ZFAV 


in which Z is the electrical charge of the transported species and AV is the 
potential in volts across the membrane (Section 13.1). Under typical condi- 
tions for the inner mitochondrial membrane, the pH outside is 1.4 units 
lower than inside [corresponding to In(c2/c,) of 1.4] and the membrane 
potential is 0.14 V, the outside being positive. Because Z = +1 for protons, 
the free-energy change is (8.32 X 10 *>kJ mol | K™! X 310K X 1.4) + 
(+1 X 96.48 kJ mol |! V7! X 0.14 V) = 21.8 kJ mol! (5.2 kcal mol‘). 
Thus, each proton that is transported out of the matrix to the cytoplasmic 
side corresponds to 21.8 kJ mol | of free energy. 


18.3. The Respiratory Chain Consists of Four Complexes: 
Three Proton Pumps and a Physical 
Link to the Citric Acid Cycle 


Electrons are transferred from NADH to O> through a chain of three large 
protein complexes called NADH-Q oxidoreductase, Q-cytochrome c oxi- 
doveductase, and cytochrome c oxidase (Figure 18.6 and Table 18.2). Electron 
flow within these transmembrane complexes leads to the transport of protons 
across the inner mitochondrial membrane. A fourth large protein complex, 
called succinate-Q reductase, contains the succinate dehydrogenase that 
generates FADH) in the citric acid cycle. Electrons from this FADH) enter 
the electron-transport chain at Q-cytochrome oxidoreductase. Succinate-Q 
reductase, in contrast with the other complexes, does not pump protons. 
NADH-Q oxidoreductase, succinate-Q reductase, Q-cytochrome c oxido- 
reductase, and cytochrome c oxidase are also called Complex I, I, III, 
and IV, respectively. Complexes I, II, and III appear to be associated in a 
supramolecular complex termed the respivasome. As we have seen before, 
such supramolecular complexes facilitate the rapid transfer of substrate and 
prevent the release of reaction intermediates. 
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Figure 18.6 Components of the electron- 
transport chain. Electrons flow down an 
energy gradient from NADH to O3. The flow is 
catalyzed by four protein complexes. Iron is a 
component of Complexes |, Ill, IV and 
cytochrome c. [After D. Sadava et al., Life, 8th 
ed. (Sinauer, 2008), p. 150.] 
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Two special electron carriers ferry the electrons from one complex to the 
next. The first is coenzyme Q (Q), also known as ubiquinone because it is a 
ubiquitous quinone in biological systems. Ubiquinone is a hydrophobic 
quinone that diffuses rapidly within the inner mitochondrial membrane. 
Electrons are carried from NADH-Q oxidoreductase to Q-cytochrome 
c oxidoreductase, the second complex of the chain, by the reduced form 
of Q. Electrons from the FADH)p generated by the citric acid cycle are 
transferred first to ubiquinone and then to the Q-cytochrome c oxi- 
doreductase complex. 

Coenzyme Q is a quinone derivative with a long tail consisting of five- 
carbon isoprene units that account for its hydrophobic nature. The number 
of isoprene units in the tail depends on the species. The most common 
mammalian form contains 10 isoprene units (coenzyme Qj). For simplicity, 


Table 18.2 Components of the mitochondrial electron-transport chain 


Enzyme complex 


Mass (kd) 


Oxidant or reductant 


Subunits Prosthetic group Matrix side Membrane core Cytoplasmic side 


NADH-Q oxidoreductase >900 


Succinate-Q reductase 140 


Q-cytochrome c oxidoreductase 250 


Cytochrome c oxidase 160 


46 FMN 
Fe-S 
4 FAD 
Fe-S 
11 Heme by Q 
Heme by, 
Heme cj 
Fe-S 
13 Heme a 
Heme a3 


NADH Q 
Succinate Q 


Cytochrome c 


Cytochrome c 


Cua and Cug 


Sources: J. W. DePierre and L. Ernster. Annu. Rev. Biochem. 46:215, 1977; Y. Hatefi. Annu Rev. Biochem. 
54:1015, 1985; and J. E. Walker. Q. Rev. Biophys. 25:253, 1992. 
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Oxidized form of coenzyme Q Semiquinone intermediate Reduced form of coenzyme Q 
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the subscript will be omitted from this abbreviation because all varieties 
function in an identical manner. Quinones can exist in three oxidation 
states. In the fully oxidized state (Q), coenzyme Q has two keto groups 
(Figure 18.7). The addition of one electron and one proton results in the 
semiquinone form (QH-:). The semiquinone can lose a proton to form a 
semiquinone radical anion (Q: ). The addition of a second electron and 
proton to the semiquinone generates ubiquinol (QH)), the fully reduced 
form of coenzyme Q, which holds its protons more tightly. Thus, for quino- 
nes, electron-transfer veactions are coupled to proton binding and release, a 
property that is key to transmembrane proton transport. Because ubiqui- 
none is soluble in the membrane, a pool of Q and QH»;—the Q pool—is 
thought to exist in the inner mitochondrial membrane. 

In contrast with Q, the second special electron carrier is a protein. 
Cytochromec, asmall soluble protein, shuttles electrons from Q-cytochrome 
c oxidoreductase to cytochrome c oxidase, the final component in the chain 
and the one that catalyzes the reduction of Oy. 


The high-potential electrons of NADH enter the respiratory 
chain at NADH-Q oxidoreductase 


The electrons of NADH enter the chain at NADH-Q oxidoreductase 
(also called Complex I and NADH dehydrogenase), an enormous enzyme 
(>900 kd) consisting of approximately 46 polypeptide chains. This proton 
pump, like that of the other two in the respiratory chain, is encoded by genes 
residing in both the mitochondriaand the nucleus. NADH-Q oxidoreductase 
is L-shaped, with a horizontal arm lying in the membrane and a vertical arm 
that projects into the matrix. 
The reaction catalyzed by this enzyme appears to be 


NADH + Q + 5 Hinatrix > NAD* + QH> + 4 Hotoplasm 


The initial step is the binding of NADH and the transfer of its two high- 
potential electrons to the flavin mononucleotide (FMN) prosthetic group of 
this complex to give the reduced form, FMNH)> (Figure 18.8). The electron 
acceptor of FMN,, the isoalloxazine ring, is identical with that of FAD. 
Electrons are then transferred from FMNH)> to a series of iron-sulfur 
clusters, the second type of prosthetic group in NADH-Q oxidoreductase. 


Figure 18.7 Oxidation states of quinones. 
The reduction of ubiquinone (Q) to ubiquinol 
(QH2) proceeds through a semiquinone 
intermediate (QH-). 
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Figure 18.8 Oxidation states of flavins. 
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Fe-S clusters in iron-sulfur proteins (also called nonheme iron proteins) play a 
critical role in a wide range of reduction reactions in biological systems. 
Several types of Fe-S clusters are known (Figure 18.9). In the simplest kind, 
a single iron ion is tetrahedrally coordinated to the sulfhydryl groups of four 
cysteine residues of the protein. A second kind, denoted by 2Fe-25, contains 
two iron ions, two inorganic sulfides, and usually four cysteine residues. A 
third type, designated 4Fe-45, contains four iron ions, four inorganic sul- 
fides, and four cysteine residues. NADH-Q oxidoreductase contains both 
2Fe-2S and 4Fe-45 clusters. Iron ions in these Fe-S complexes cycle 
between Fe?* (reduced) and Fe** (oxidized) states. Unlike quinones and 
flavins, iron-sulfur clusters generally undergo oxidation—reduction reactions 
without releasing or binding protons. 

All of the redox reactions take place in the extramembranous part of 
NADH-Q oxidoreductase. Although the details of electron transfer through 
this complex remain the subject of ongoing investigation, NADH clearly 
binds to a site in the extramembranous domain. NADH transfers its two 
electrons to FMN. These electrons flow through a series of Fe-S centers and 
then to coenzyme Q,. The flow of two electrons from NADH to coenzyme Q 
through NADH-Q oxidoreductase leads to the pumping of four hydrogen ions 
out of the matrix of the mitochondrion. In accepting two electrons, Q takes up 
two protons from the matrix as it is reduced to QH> (Figure 18.10). The QH» 
leaves the enzyme for the hydrophobic interior of the membrane. 

It is important to note that the citric acid cycle is not the only source of 
mitochondrial NADH. As we will see in Chapter 22, fatty acid degradation, 


(A) (B) 
Cys vad 
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Figure 18.9 Iron-sulfur clusters. (A) A single iron ion bound by four cysteine residues. 
(B) 2Fe-2S cluster with iron ions bridged by sulfide ions. (C) 4Fe-4S cluster. Each of these clusters 
can undergo oxidation—reduction reactions. 


which also takes place in mitochondria, is another crucial 
source of NADH for the electron-transport chain. 
Moreover, cytoplasmically generated NADH can be trans- 
ported into mitochondria for use by the electron-transport 
chain (Section 18.5). 


Ubiquinol is the entry point for electrons from FADH2 
of flavoproteins 


FADH) enters the electron-transport chain at the second —yap+ 
protein complex of the chain. Recall that FADH)p is 
formed in the citric acid cycle, in the oxidation of succinate 
to fumarate by succinate dehydrogenase (Section 17.2). 
Succinate dehydrogenase, a citric acid cycle enzyme, is 


4Ht 


Intermembrane 


Matrix 


part of the succinate-Q reductase complex (Complex II), an Figure 18.10 Coupled electron—-proton transfer reactions 


integral membrane protein of the inner mitochondrial 
membrane. FADH) does not leave the complex. Rather, 


through NADH-Q oxidoreductase. Electrons flow in Complex | from 
NADH through FMN and a series of iron—sulfur clusters to ubiquinone 
(Q). The electron flow (red arrows) results in the pumping of four 


its electrons are transferred to Fe-S centers and then final- protons and the uptake of two protons from the mitochondrial matrix. 
ly to Q to form QHp, which then is ready to transfer elec- _ [Based on U. Brandt et al. FEBS Letters 545:9—-17, 2003, Fig. 2] 


trons further down the electron-transport chain. The suc- 

cinate-Q reductase complex, in contrast with NADH-Q 

oxidoreductase, does not pump protons from one side of 

the membrane to the other. Consequently, less ATP is formed from the 
oxidation of FADH) than from NADH. 

Two other enzymes that we will encounter later, glycerol phosphate 
dehydrogenase (p. 551) and fatty acyl CoA dehydrogenase (Section 22.2), 
likewise transfer their high-potential electrons from FADH), to Q to form 
ubiquinol (QH>), the reduced state of ubiquinone. These enzymes oxidize 
glycerol and fats, respectively, providing electrons for oxidative phosphory- 
lation. These enzymes also do not pump protons. 


Electrons flow from ubiquinol to cytochrome c through 
Q-cytochrome c oxidoreductase 


What is the fate of ubiquinol generated by Complexes I and II? The 
electrons from QH> are passed on to cytochrome c by the second of the three 
proton pumps in the respiratory chain, Q-cytochrome c oxidoreductase (also 
known as Complex III and as cytochrome reductase). The function of 
Q-cytochrome c oxidoreductase is to catalyze the transfer of electrons from 
QHp» to oxidized cytochrome c (Cyt c), a water-soluble protein, and 
concomitantly pump protons out of the mitochondrial matrix. The flow of 
a pair of electrons through this complex leads to the effective net transport 
of 2 H* to the cytoplasmic side, half the yield obtained with NADH-Q 


reductase because of a smaller thermodynamic driving force. 
QHy + 2 Cyt cox + 2 Dea > Q + 2Cytcreq + 4 la Renee re 


Q-cytochrome c oxidoreductase itself contains two types of cytochromes, 
named b and c; (Figure 18.11). A cytochrome is an electron-transferring 
protein that contains a heme prosthetic group. The iron ion of a cytochrome 
alternates between a reduced ferrous (+2) state and an oxidized ferric (+3) 
state during electron transport. The two cytochrome subunits of 
Q-cytochrome c oxidoreductase contain a total of three hemes: two hemes 
within cytochrome b, termed heme bry (L for low affinity) and heme by 
(H for high affinity), and one heme within cytochrome c,. The heme pros- 
thetic group in cytochromes b, c;, and c is iron-protoporphyrin IX, the same 
heme present in myoglobin and hemoglobin (Section 7.1). These identical 
hemes have different electron affinities because they are in different 
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hk Figure 18.11 Structure of 
Q-cytochrome c oxidoreductase 
(cytochrome bc;). This enzyme is a 
homodimer with 11 distinct polypept 
chains. Notice that the major prosthe 
groups, three hemes and a 2Fe-2S cl 


ide 
ic 
uster, are 


located either near the cytoplasmic edge of 


the complex bordering the intermem 


Drane 


space (top) or in the region embedded in the 
membrane (a helices represented by tubes). 


They are well positioned to mediate 


he 


electron-transfer reactions between q 
in the membrane and cytochrome c 


uinones 
in the 


intermembrane space. [Drawn from 1BCC.pdb_] 


Rieske iron-sulfur center 


Cys His 


polypeptide environments. For example, heme by, which is located in a 
cluster of helices near the cytoplasmic face of the membrane, has lower 
affinity for an electron than does heme by, which is near the matrix side. 
Q-cytochrome c oxidoreductase is also known as cytochrome bc; after its 
cytochrome groups. 

In addition to the hemes, the enzyme contains an iron-sulfur protein 
with a 2Fe-2S center. This center, termed the Rieske center, is unusual in 
that one of the iron ions is coordinated by two histidine residues rather than 
two cysteine residues. This coordination stabilizes the center in its reduced 
form, raising its reduction potential so that it can readily accept electrons 


from QHo. 


The Q cycle funnels electrons from a two-electron carrier to a 
one-electron carrier and pumps protons 


QH) passes two electrons to Q-cytochrome c oxidoreductase, but the accep- 
tor of electrons in this complex, cytochrome c, can accept only one electron. 
How does the switch from the two-electron carrier ubiquinol to the one- 
electron carrier cytochrome c take place? The mechanism for the coupling of 
electron transfer from Q to cytochrome c to transmembrane proton trans- 
port is known as the Q cycle (Figure 18.12). Two QH> molecules bind to the 
complex consecutively, each giving up two electrons and two H™. These 
protons are released to the cytoplasmic side of the membrane. The first QH> to 
exit the Q pool binds to the first Q binding site (Q,), and its two electrons 
travel through the complex to different destinations. One electron flows, 
first, to the Rieske 2Fe-25 cluster; then, to cytochrome cy; and, finally, to a 


First half of Q cycle Second half of Q cycle 


2 Ht 


Figure 18.12 Q cycle. The Q cycle takes place in Complex III, which is represented in outline 
form. In the first half of the cycle, two electrons of a bound QHp are transferred, one to cytochrome 
c and the other to a bound Q in a second binding site to form the semiquinone radical anion Q:~. 
The newly formed Q dissociates and enters the Q pool. In the second half of the cycle, a second 
Qh) also gives up its electrons to complex II, one to a second molecule of cytochrome c and the 
other to reduce Q:~ to QHp. This second electron transfer results in the uptake of two protons from 
the matrix. The path of electron transfer is shown in red. 


molecule of oxidized cytochrome c, converting it into its reduced form. The 
reduced cytochrome c molecule is free to diffuse away from the enzyme to 
continue down the respiratory chain. 

The second electron passes through two heme groups of cytochrome b 
to an oxidized ubiquinone in a second Q binding site (Q;). The Q in the 
second binding site is reduced to a semiquinone radical anion (Q:_) by the 
electron from the first QH». The now fully oxidized Q leaves the first Q 
site, free to reenter the Q pool. 

A second molecule of QH> binds to the Q, site of Q-cytochrome c 
oxidoreductase and reacts in the same way as the first. One of the electrons 
is transferred to cytochrome c. The second electron passes through the two 
heme groups of cytochrome b to partly reduced ubiquinone bound in the Q; 
binding site. On the addition of the electron from the second QH) molecule, 
this quinone radical anion takes up two protons from the matrix side to form 
QH)b. The removal of these two protons from the matrix contributes to the for- 
mation of the proton gradient. In sum, four protons are released on the cyto- 
plasmic side, and two protons are removed from the mitochondrial matrix. 


2 QH, + Q + 2Cyt coy + 2 Hinatrix > 
2 Q ‘a QH)2 ae Cyt Cred + 4 i iaalea 


In one Q cycle, two QH» molecules are oxidized to form two Q mole- 
cules, and then one Q molecule is reduced to QH». The problem of how 
to efficiently funnel electrons from a two-electron carrier (QH,) to a 
one-electron carrier (cytochrome c) is solved by the Q cycle. The cyto- 
chrome b component of the reductase is in essence a recycling device that 
enables both electrons of QH> to be used effectively. 


Cytochrome c oxidase catalyzes the reduction of 
molecular oxygen to water 


The last of the three proton-pumping assemblies of the respiratory chain is 
cytochrome c oxidase (Complex IV). Cytochrome c oxidase catalyzes the 
transfer of electrons from the reduced form of cytochrome c to molecular 
oxygen, the final acceptor. 


4 Cyt Cred + 8 Hyhatrix + > 4 Cytc,, + 2H,O + 4 Fests 
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Figure 18.13 Structure of cytochrome c oxidase. This enzyme consists of 13 polypeptide 
chains. Notice that most of the complex, as well as two major prosthetic groups (heme a and 
heme az—Cug) are embedded in the membrane (a helices represented by vertical tubes). Heme 
a3—Cus Is the site of the reduction of oxygen to water. The Cua/Cu, prosthetic group is positioned 
near the intermembrane space to better accept electrons from cytochrome c. CO(bb) is a carbonyl 
group of the peptide backbone. [Drawn from 20CC.pdb.] 


The requirement of oxygen for this reaction is what makes “aerobic” organ- 
isms aerobic. To obtain oxygen for this reaction is the reason that human 
beings must breath. Four electrons are funneled to O» to completely reduce 
it to HO, and, concomitantly, protons are pumped from the matrix to the 
cytoplasmic side of the inner mitochondrial membrane. This reaction is 
quite thermodynamically favorable. From the reduction potentials in 
Table 18.1, the standard free-energy change for this reaction 1s calculated to 
be AG? = —231.8 kJ mol! (—55.4 kcal mol‘). As much of this free 
energy as possible must be captured in the form of a proton gradient for 
subsequent use in ATP synthesis. 

Bovine cytochrome c oxidase is reasonably well understood at the struc- 
tural level (Figure 18.13). It consists of 13 subunits, 3 of which are encoded 
by the mitochondrion’s own genome. Cytochrome c oxidase contains two 
heme A groups and three copper ions, arranged as two copper centers, desig- 
nated A and B. One center, Cu,/Cu,, contains two copper ions linked by 
two bridging cysteine residues. This center initially accepts electrons from 
reduced cytochrome c. The remaining copper ion, Cug, is coordinated by 
three histidine residues, one of which is modified by covalent linkage to a 
tyrosine residue. The copper centers alternate between the reduced Cu* 
(cuprous) form and the oxidized Cu?* (cupric) form as they accept and 
donate electrons. 

There are two heme A molecules, called heme a and heme a3, in cyto- 
chrome c oxidase. Heme A differs from the heme in cytochrome c and c, in 
three ways: (1) a formyl group replaces a methyl group, (2) a Cy7 hydrocar- 
bon chain replaces one of the vinyl groups, and (3) the heme is not cova- 
lently attached to the protein. 


Heme A 


Heme a and heme a; have distinct redox potentials because they are located 
in different environments within cytochrome c oxidase. An electron flows 
from cytochrome c to Cua/Cua, to heme a to heme a3 to Cup, and finally 
to O». Heme a3 and Cup are directly adjacent. Together, heme a3 and Cup 
form the active center at which O> is reduced to H2O. 

Four molecules of cytochrome c bind consecutively to the enzyme and 
transfer an electron to reduce one molecule of O2 to H2O (Figure 18.14). 


2. Reduced Cug and Fe in 
heme az bind O3, which 
forms a peroxide bridge. 


1. Two molecules of cytochrome c sequentially transfer 
electrons to reduce Cug and heme az. 


2 Cytochrome c 
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4. The addition of two more 
protons leads to the 
release of water. 


3. The addition of two more 
electrons and two more 
protons cleaves the 
peroxide bridge. 
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Figure 18.14 Cytochrome c oxidase 
mechanism. The cycle begins and ends with 
all prosthetic groups in their oxidized forms 
(shown in blue). Reduced forms are in red. 
Four cytochrome c molecules donate four 
electrons, which, in allowing the binding and 
cleavage of an Oz molecule, also makes 
possible the import of four H* from the 
matrix to form two molecules of H20, which 
are released from the enzyme to regenerate 
the initial state. 
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Figure 18.15 Peroxide bridge. The oxygen 
bound to heme az is reduced to peroxide by 
the presence of Cup. 


Cyt Creduced Cyt Coxidized 


4Ht 4Ht 
Pumped — Chemical 
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Figure 18.16 Proton transport by 
cytochrome c oxidase. Four protons are 
taken up from the matrix side to reduce one 
molecule of O> to two molecules of H20. 
These protons are called “chemical protons” 
because they participate in a clearly defined 
reaction with O>. Four additional “pumped” 
protons are transported out of the matrix and 
released on the cytoplasmic side in the 
course of the reaction. The pumped protons 
double the efficiency of free-energy storage in 
the form of a proton gradient for this final 
step in the electron-transport chain. 


1. Electrons from two molecules of reduced cytochrome c flow down an 
electron-transfer pathway within cytochrome c oxidase, one stopping at 
Cup and the other at heme a3. With both centers in the reduced state, they 
together can now bind an oxygen molecule. 


2. As molecular oxygen binds, it abstracts an electron from each of the 
. . . . 2— . 

nearby ions in the active center to form a peroxide (O,”  ) bridge between 

them (Figure 18.15). 


3. Two more molecules of cytochrome c bind and release electrons that 
travel to the active center. The addition of an electron as well as H* to 
each oxygen atom reduces the two ion—-oxygen groups to Cup~*’—OH 


and Fe?*—OH. 


4. Reaction with two more H™ ions allows the release of two molecules of 
HO and resets the enzyme to its initial, fully oxidized form. 


4 Cyt Creq + 4 Ee aie + Oo > 4Cytc, + 2H20 


The four protons in this reaction come exclusively from the matrix. Thus, the 
consumption of these four protons contributes directly to the proton 
gradient. Recall that each proton contributes 21.8 kJ mol”! (5.2 kcal mol‘) 
to the free energy associated with the proton gradient; so these four protons 
contribute 87.2 kJ mol ! (20.8 kcal mol‘), an amount substantially less 
than the free energy available from the reduction of oxygen to water. What 
is the fate of this missing energy? Remarkably, cytochrome c oxidase uses this 
energy to pump four additional protons from the matrix to the cytoplasmic side 
of the membrane in the course of each reaction cycle for a total of eight protons 
removed from the matrix (Figure 18.16). The details of how these protons are 
transported through the protein is still under study. However, two effects 
contribute to the mechanism. First, charge neutrality tends to be main- 
tained in the interior of proteins. Thus, the addition of an electron to a site 
inside a protein tends to favor the binding of H™ to a nearby site. Second, 
conformational changes take place, particularly around the heme a3-Cug 
center, in the course of the reaction cycle. Presumably, in one conformation, 
protons may enter the protein exclusively from the matrix side, whereas, in 
another, they may exit exclusively to the cytoplasmic side. Thus, the overall 
process catalyzed by cytochrome c oxidase is 


4 Cyt Creq + 8 Hee + Oo > 4Cyte, + 2H2O + 4 Ha igdiasss 


Figure 18.17 summarizes the flow of electrons from NADH and FADH» 
through the respiratory chain. This series of exergonic reactions is coupled 
to the pumping of protons from the matrix. As we will see shortly, the 
energy inherent in the proton gradient will be used to synthesize ATP. 


Toxic derivatives of molecular oxygen such as superoxide radical are 
scavenged by protective enzymes 


As discussed earlier, molecular oxygen is an ideal terminal electron accep- 
tor, because its high affinity for electrons provides a large thermodynamic 
driving force. However, danger lurks in the reduction of O». The transfer of 
four electrons leads to safe products (two molecules of H2O), but partial 
reduction generates hazardous compounds. In particular, the transfer of a 
single electron to Op» forms superoxide anion, whereas the transfer of two 
electrons yields peroxide. 
O» aes Oo" = 6 haa 
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Figure 18.17 The electron-transport 
chain. High-energy electrons in the form of 
NADH and FADH are generated by the citric 
acid cycle. These electrons flow through the 
respiratory chain, which powers proton 
Acetyl CoA pumping and results in the reduction of O2. 


Citric 
acid 
cycle 


Both compounds are potentially destructive. The strategy for the safe 
reduction of O> 1s clear: the catalyst does not release partly reduced intermedi- 
ates. Cytochrome c oxidase meets this crucial criterion by holding O2 
tightly between Fe and Cu ions. 

Although cytochrome c oxidase and other proteins that reduce Op» are 
remarkably successful in not releasing intermediates, small amounts of 
superoxide anion and hydrogen peroxide are unavoidably formed. 
Superoxide, hydrogen peroxide, and species that can be generated from 
them such as OH: are collectively referred to as reactive oxygen species or 
ROS. Oxidative damage caused by ROS has been implicated in the aging 
process as well as in a growing list of diseases (Table 18.3). 

What are the cellular defense strategies against oxidative damage by 
ROS? Chief among them is the enzyme superoxide dismutase. This enzyme 
scavenges superoxide radicals by catalyzing the conversion of two of these 
radicals into hydrogen peroxide and molecular oxygen. 


Dismutation 


A reaction in which a single reactant is 
converted into two different products. 


Superoxide 
dismutase 


O, + 2H* 


Eukaryotes contain two forms of this enzyme, a manganese-containing 
version located in mitochondria anda copper- and zinc-dependent cytoplasmic 


Oo + HO, 


Table 18.3 Pathological and other conditions that may entail free-radical injury 


Atherogenesis 

Emphysema; bronchitis 
Parkinson disease 

Duchenne muscular dystrophy 
Cervical cancer 

Alcoholic liver disease 
Diabetes 

Acute renal failure 

Down syndrome 

Retrolental fibroplasia (conversion of the retina into a fibrous mass in premature infants) 
Cerebrovascular disorders 
Ischemia; reperfusion injury 


Source: After D. B. Marks, A. D. Marks, and C. M. Smith, Basic Medical Biochemistry: A Clinical Approach 
(Williams & Wilkins, 1996), p. 331. 
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Figure 18.18 Superoxide dismutase 
mechanism. The oxidized form of superoxide 
dismutase (Mox) reacts with one superoxide 
ion to form Oz and generate the reduced 
form of the enzyme (Mreq). The reduced form 
then reacts with a second superoxide and two 
protons to form hydrogen peroxide and 
regenerate the oxidized form of the enzyme. 


form. These enzymes perform the dismutation reaction by a similar mecha- 
nism (Figure 18.18). The oxidized form of the enzyme is reduced by superox- 
ide to form oxygen. The reduced form of the enzyme, formed in this reaction, 
then reacts with a second superoxide ion to form peroxide, which takes up two 
protons along the reaction path to yield hydrogen peroxide. 

The hydrogen peroxide formed by superoxide dismutase and by other 
processes is scavenged by catalase, a ubiquitous heme protein that catalyzes 
the dismutation of hydrogen peroxide into water and molecular oxygen. 


Catalase 
Pe RUT, 


2 H,O, O, + 2H,O 


Superoxide dismutase and catalase are remarkably efficient, performing 
their reactions at or near the diffusion-limited rate (Section 8.4). Glutathione 
peroxidase also plays a role in scavenging HzO» (Section 20.5). Other 
cellular defenses against oxidative damage include the antioxidant vitamins, 
vitamins E and C. Because it is lipophilic, vitamin E is especially useful in 
protecting membranes from lipid peroxidation. 

A long-term benefit of exercise may be to increase the amount of super- 
oxide dismutase in the cell. The elevated aerobic metabolism during exer- 
cise causes more ROS to be generated. In response, the cell synthesizes 
more protective enzymes. The net effect is one of protection, because the 
increase in superoxide dismutase more effectively protects the cell during 
periods of rest. 

Despite the fact that reactive oxygen species are known hazards, recent 
evidence suggests that, under certain circumstances, the controlled genera- 
tion of these molecules may be important components of signal-transduction 
pathways. For instance, growth factors have been shown to increase ROS 
levels as part of their signaling pathway, and ROS regulate channels and 
transcription factors. The dual roles of ROS is a excellent example of the 
wondrous complexity of biochemistry of living systems: even potentially 
harmful substances can be harnessed to play useful roles. 


Electrons can be transferred between groups that are 
not in contact 


. Bie How are electrons transferred between electron-carrying 
a 12 oa groups of the respiratory chain? This question is intrigu- 
Ry - i ing because these groups are frequently buried in the 
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& ¢ | * | he provides more-efficient pathways for electron conduc- 
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van der Distance (A) factor of 10 every 1.7 A (Figure 18.19). For groups in con- 


Waals contact 


Figure 18.19 Distance dependence of electron-transfer rate. 
The rate of electron transfer decreases as the electron donor and the 
electron acceptor move apart. In a vacuum, the rate decreases by a 
factor of 10 for every increase of 0.8 A. In proteins, the rate decreases 
more gradually, by a factor of 10 for every increase of 1.7 A. This rate is 
only approximate because variations in the structure of the intervening 


protein medium can affect the rate. 


tact, electron-transfer reactions can be quite fast, with 
rates of approximately 10! s_'. Within proteins in the 
electron-transport chain, electron-carrying groups are 
typically separated by 15 A beyond their van der Waals 
contact distance. For such separations, we expect electron- 
transfer rates of approximately 10* s ! (ie., electron 
transfer in less than 1 ms), assuming that all other factors 


are optimal. Without the mediation of the protein, an electron transfer over 
this distance would take approximately 1 day. 

The case is more complicated when electrons must be transferred 
between two distinct proteins, such as when cytochrome c accepts electrons 
from Complex III or passes them on to Complex IV. A series of hydro- 
phobic interactions bring the heme groups of cytochrome c and c; to within 
4.5 A of each other, with the iron atoms separated by 17.4 A. This distance 


could allow cytochrome c reduction at a rate of 8.3 X 10°s 1. 


The conformation of cytochrome c has remained essentially constant 
for more than a billion years 


NE Cytochrome c is present in all organisms having mitochondrial respira- 

tory chains: plants, animals, and eukaryotic microorganisms. This elec- 
tron carrier evolved more than 1.5 billion years ago, before the divergence of 
plants and animals. Its function has been conserved throughout this period, 
as evidenced by the fact that the cytochrome c of any eukaryotic species reacts 
in vitro with the cytochrome c oxidase of any other species tested thus far. 
For example, wheat-germ cytochrome c reacts with human cytochrome 
c oxidase. Additionally, some prokaryotic cytochromes, such as cytochrome 
co from the photosynthetic bacterium Rhodospirillum rubrum and cyto- 
chrome cssy from the denitrifying bacterium Paracoccus denitrificans, 
closely resemble cytochrome c from tuna-heart mitochondria (Figure 18.20). 
This evidence attests to an efficient evolutionary solution to electron 
transfer bestowed by the structural and functional characteristics of 
cytochrome c. 


Rhodospirillum rubrum Paracoccus denitrificans 


The resemblance among cytochrome c molecules extends to the level of 
amino acid sequence. Because of the molecule’s small size and ubiquity, the 
amino acid sequences of cytochrome c from more than 80 widely ranging 
eukaryotic species have been determined by direct protein sequencing by 
Emil Smith, Emanuel Margoliash, and others. The striking finding is that 
21 of 104 residues have been invariant for move than one and a half billion 
years of evolution. A phylogenetic tree, constructed from the amino acid 
sequences of cytochrome c, reveals the evolutionary relationships between 
many animal species (Figure 18.21). 


18.4 A Proton Gradient Powers the Synthesis of ATP 


Thus far, we have considered the flow of electrons from NADH to Os, an 
exergonic process. 
NADH + 140, + H* =—H,0 + NAD* 

AG?” = —220.1 kJ mol”! (—52.6 kcal mol” !) 


543 
18.4 ATP Synthesis 


Figure 18.20 Conservation of the 
three-dimensional structure of 
cytochrome c. The side chains are shown for 
the 21 conserved amino acids and the heme. 
[Drawn from 3CYT.pdb, 3C2C.pdb, and 1SSC. 
pdb] 
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Figure 18.21 Evolutionary tree 
constructed from sequences of 
cytochrome c. Branch lengths are 
proportional to the number of amino acid 
changes that are believed to have occurred. 
This drawing is an adaptation of the work of 
Walter M. Fitch and Emanuel Margoliash. 
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Some have argued that, along with the 
elucidation of the structure of DNA, the 
discovery that ATP synthesis is powered by a 
proton gradient is one of the two major 
advances in biology in the twentieth century. 
Mitchell's initial postulation of the 
chemiosmotic theory was not warmly 
received by all. Efraim Racker, one of the 
early investigators of ATP synthase, recalls 
that some thought of Mitchell as a court 
jester, whose work was of no consequence. 
Peter Mitchell was awarded the Nobel Prize 
in chemistry in 1978 for his contributions to 
understanding oxidative phosphorylation. 


Figure 18.22 Chemiosmotic hypothesis. 
Electron transfer through the respiratory chain 
leads to the pumping of protons from the 
matrix to the cytoplasmic side of the inner 
mitochondrial membrane. The pH gradient 
and membrane potential constitute a proton- 
motive force that is used to drive ATP 
synthesis. 


Next, we consider how this process is coupled to the synthesis of ATP, an 
endergonic process. 


ADP + P; + H* ——ATP + H,0 
AG” = +30.5 kJ mol” '(+7.3 kcal mol7') 


A molecular assembly in the inner mitochondrial membrane carries out the 
synthesis of ATP. This enzyme complex was originally called the mitochon- 
drial ATPase or FF ATPase because it was discovered through its catalysis 
of the reverse reaction, the hydrolysis of ATP. ATP synthase, its preferred 
name, emphasizes its actual role in the mitochondrion. It is also called 
Complex V. 

How is the oxidation of NADH coupled to the phosphorylation of 
ADP? Electron transfer was first suggested to lead to the formation of a 
covalent high-energy intermediate that serves as a compound having a high 
phosphoryl-transfer potential, analogous to the generation of ATP by the 
formation of 1,3-bisphosphoglycerate in glycolysis (Section 16.1). An alter- 
native proposal was that electron transfer aids the formation of an activated 
protein conformation, which then drives ATP synthesis. The search for 
such intermediates for several decades proved fruitless. 

In 1961, Peter Mitchell suggested a radically different mechanism, the 
chemiosmotic hypothesis. He proposed that electron transport and ATP 
synthesis are coupled by a proton gradient across the inner mitochondrial 
membrane. In his model, the transfer of electrons through the respiratory 
chain leads to the pumping of protons from the matrix to the cytoplasmic 
side of the inner mitochondrial membrane. The H~ concentration becomes 
lower in the matrix, and an electric field with the matrix side negative is 
generated (Figure 18.22). Protons then flow back into the matrix to equalize 
the distribution. Mitchell’s idea was that this flow of protons drives the 
synthesis of ATP by ATP synthase. The energy-rich unequal distribution 
of protons is called the proton-motive force. The proton-motive force can be 
thought of as being composed of two components: a chemical gradient and 
a charge gradient. The chemical gradient for protons can be represented as 
a pH gradient. The charge gradient is created by the positive charge on the 
unequally distributed protons forming the chemical gradient. Mitchell pro- 
posed that both components power the synthesis of ATP. 


Proton-motive force (Ap) = 
chemical gradient (ApH) + charge gradient (Ay) 


Protons are pumped across this 
membrane as electrons flow 
through the respiratory chain. 


Outer mitochondrial 
membrane 


Inner mitochondrial 
membrane 


Intermembrane space 


Matrix 


Bacteriorhodopsin in y+ 
synthetic vesicle 


Figure 18.23 Testing the chemiosmotic 
hypothesis. ATP is synthesized when 
reconstituted membrane vesicles containing 
bacteriorhodopsin (a light-driven proton pump) 
and ATP synthase are illuminated. The 
orientation of ATP synthase in this reconstituted 
membrane is the reverse of that in the 
mitochondrion. 


ATP 
synthase 


Mitchell’s highly innovative hypothesis that oxidation and phosphoryla- 
tion are coupled by a proton gradient is now supported by a wealth of evi- 
dence. Indeed, electron transport does generate a proton gradient across the 
inner mitochondrial membrane. The pH outside is 1.4 units lower than 
inside, and the membrane potential is 0.14 V, the outside being positive. As 
calculated on page 531, this membrane potential corresponds to a free energy 
of 21.8 kJ (5.2 kcal) per mole of protons. 

An artificial system was created to elegantly demonstrate the basic prin- 
ciple of the chemiosmotic hypothesis. The role of the respiratory chain was 
played by bacteriorhodopsin, a membrane protein from halobacteria that 
pumps proteins when illuminated. Synthetic vesicles containing bacterior- 
hodopsin and mitochondrial ATP synthase purified from beef heart were 
created (Figure 18.23). When the vesicles were exposed to light, ATP was 
formed. This key experiment clearly showed that the respiratory chain and 
ATP synthase are biochemically separate systems, linked only by a pyroton- 
motive force. 


ATP synthase is composed of a proton-conducting unit 
and a catalytic unit 


Two parts of the puzzle of how NADH oxidation is coupled to ATP synthe- 
sis are now evident: (1) electron transport generates a proton-motive force; 
(2) ATP synthesis by ATP synthase can be powered by a proton-motive 
force. How is the proton-motive force converted into the high phosphoryl- 
transfer potential of ATP? 

Biochemical, electron microscopic, and crystallographic studies of ATP 
synthase have revealed many details of its structure (Figure 18.24). It is a 
large, complex enzyme that looks like a ball on a stick. Much of the “stick” 
part, called the Fo subunit, is embedded in the inner mitochondrial mem- 
brane. The 85-A-diameter ball, called the F; subunit, protrudes into the 
mitochondrial matrix. The F; subunit contains the catalytic activity of the 
synthase. In fact, isolated F; subunits display ATPase activity. 

The F; subunit consists of five types of polypeptide chains (a3, 83, y, 5, 
and s) with the indicated stoichiometry. The a and B subunits, which make 
up the bulk of the Fj, are arranged alternately in a hexameric ring; they are 
homologous to one another and are members of the P-loop NTPase family 
(Section 9.4). Both bind nucleotides but only the B subunits participate 
directly in catalysis. Beginning just below the a and B subunits is a central 
stalk consisting of the y and « proteins. The yy subunit includes a long helical 
coiled coil that extends into the center of the «383 hexamer. The y subunit 
breaks the symmetry of the a383 hexamer: each of the B subunits is distinct by 
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hk Figure 18.24 Structure of ATP 
synthase. A schematic structure is shown 
along with representations of the components 
for which structures have been determined to 


high resolution. The P-loop NTPase domains 
of the a and B subunits are indicated by 
purple shading. Notice that part of the 
enzyme complex is embedded in the inner 
mitochondrial membrane, whereas the 
remainder resides in the matrix. [Drawn from 
1E79.pdb and 1COV.pdb.] 
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virtue of its interaction with a different face of y. Distinguishing the three B 
subunits is crucial for understanding the mechanism of ATP synthesis. 

The Fo subunit is a hydrophobic segment that spans the inner mito- 
chondrial membrane. Fo contains the proton channel of the complex. This 
channel consists of a ring comprising from 10 to 14 ¢ subunits that are 
embedded in the membrane. A single a subunit binds to the outside of the 
ring. The Fy and F; subunits are connected in two ways: by the central ye 
stalk and by an exterior column. The exterior column consists of one a sub- 
unit, two b subunits, and the 6 subunit. 


Proton flow through ATP synthase leads to the release of tightly bound 
ATP: The binding-change mechanism 


ATP synthase catalyzes the formation of ATP from ADP and orthophosphate. 
ADP? + HPO,?- + Ht = ATP* + H,O 


The actual substrates are ADP and ATP complexed with Mg’, as in 
all known phosphoryl-transfer reactions with these nucleotides. A termi- 
nal oxygen atom of ADP attacks the phosphorus atom of P; to form a 
pentacovalent intermediate, which then dissociates into ATP and H,O 
(Figure 18.25). 
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Figure 18.25 ATP-synthesis mechanism. One of the oxygen atoms of ADP attacks the 
phosphorus atom of P; to form a pentacovalent intermediate, which then forms ATP and releases a 
molecule of H20. 


How does the flow of protons drive the synthesis of ATP? Isotopic- 
exchange experiments unexpectedly revealed that enzyme-bound ATP 
forms readily in the absence of a proton-motive force. When ADP and P; were 
added to ATP synthase in H,'*O, 'O became incorporated into P; through 
the synthesis of ATP’ and its subsequent hydrolysis (Figure 18.26). The rate 
of incorporation of '8O into P; showed that about equal amounts of bound 
ATP and ADP are in equilibrium at the catalytic site, even in the absence of 
a proton gradient. However, ATP does not leave the catalytic site unless 
protons flow through the enzyme. Thus, the role of the proton gradient is not 
to form ATP but to release it from the synthase. 

The fact that three B subunits are components of the F, moiety of the 
ATPase means that there are three active sites on the enzyme, each per- 
forming one of three different functions at any instant. The proton-motive 
force causes the three active sites to sequentially change functions as pro- 
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ATP ADP '8Q-labeled P; 
Figure 18.26 ATP forms without a proton-motive force but is not released. The results of 
isotopic-exchange experiments indicate that enzyme-bound ATP is formed from ADP and P; in the 
absence of a proton-motive force. 


tons flow through the membrane-embedded component of the enzyme. 
Indeed, we can think of the enzyme as consisting of a moving part and a 
stationary part: (1) the moving unit, or rotor, consists of the c ring and 
the ye stalk and (2) the stationary unit, or stator, is composed of the remain- 
der of the molecule. 

How do the three active sites of ATP synthase respond to the flow of 
protons? A number of experimental observations suggested a binding- 
change mechanism for proton-driven ATP synthesis. This proposal states 
that a B subunit can perform each of three sequential steps in the synthesis 
of ATP by changing conformation. These steps are (1) ADP and P; binding, 
(2) ATP synthesis, and (3) ATP release. As already noted, interactions with 
the y subunit make the three B subunits unequivalent (Figure 18.27). At 
any given moment, one B subunit will be in the L, or loose, conformation. 
This conformation binds ADP and P;. A second subunit will be in the T, or 
tight, conformation. This conformation binds ATP with great avidity, so 
much so that it will convert bound ADP and P; into ATP. Both the T and L 
conformations are sufficiently constrained that they cannot release bound 
nucleotides. The final subunit will be in the O, or open, form. This form has 
a more open conformation and can bind or release adenine nucleotides. 

The rotation of the y subunit drives the interconversion of these three 
forms (Figure 18.28). ADP and P; bound in the subunit in the T form are 
transiently combining to form ATP. Suppose that the y subunit is rotated 
by 120 degrees in a counterclockwise direction (as viewed from the top). 
This rotation converts the T-form site into an O-form site with the nucle- 
otide bound as ATP. Concomitantly, the L-form site is converted into a 
T-form site, enabling the transformation of an additional ADP and P; into 
ATP. The ATP in the O-form site can now depart from the enzyme to be 
replaced by ADP and P;. An additional 120-degree rotation converts this 
O-form site into an L-form site, trapping these substrates. Each subunit 
progresses from the T to the O to the L form with no two subunits ever 
present in the same conformational form. This mechanism suggests that 
ATP can be synthesized and released by driving the rotation of the y sub- 
unit in the appropriate direction. 


120° rotation 
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Figure 18.27 ATP synthase 
nucleotide-binding sites are not 
equivalent. The y subunit passes through 
the center of the az83 hexamer and makes 
the nucleotide-binding sites in the B subunits 
distinct from one another. Notice that each a 
subunit contains bound ATP, but these 
nucleotides do not participate in any reactions. 
The £8 subunits are colored to distinguish 
them from one another. 


Progressive alteration of the 
forms of the three active sites 
of ATP synthase 


Subunit] L+>T+O>L>T-O...... 
Subunit2 O->L>T>O>L-T...... 
Subunit3 T7>O>L>TSO-L...... 
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Rotational catalysis is the world’s smallest molecular motor 


Is it possible to observe the proposed rotation directly? Elegant experi- 
ments, using single-molecule techniques (Section 8.6), have demonstrated 
the rotation through the use of a simple experimental system consisting 
solely of cloned a383y subunits (Figure 18.29). The B subunits were engi- 
neered to contain amino-terminal polyhistidine tags, which have a high 
affinity for nickel ions (Section 3.1). This property of the tags allowed the 
0383 assembly to be immobilized on a glass surface that had been coated 
with nickel ions. The y subunit was linked to a fluorescently labeled actin 
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Figure 18.28 Binding-change mechanism 
for ATP synthase. The rotation of the y 
subunit interconverts the three B subunits. 
The subunit in the T (tight) form interconverts 
ADP and P; and ATP but does not allow ATP 
be released. When the -y subunit is rotated by 
120 degrees in a counterclockwise (CCW) 
direction, the T-form subunit is converted into 
the O form, allowing ATP release. ADP and P; 
can then bind to the O-form subunit. An 
additional 120-degree rotation (not shown) 
traps these substrates in an L-form subunit. 
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Figure 18.29 Direct observation of ATP- 
driven rotation in ATP synthase. The az83 
hexamer of ATP synthase is fixed to a surface, 
with the y subunit projecting upward and 
linked to a fluorescently labeled actin filament. 
The addition and subsequent hydrolysis of 
ATP result in the counterclockwise rotation of 
the y subunit, which can be directly seen 
under a fluorescence microscope. 
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Figure 18.30 Components of the 
proton-conducting unit of ATP synthase. 
The c subunit consists of two a helices that 
span the membrane. An aspartic acid residue 
in one of the helices lies on the center of the 
membrane. The structure of the a subunit has 
not yet been directly observed, but it appears 
to include two half-channels that allow 
protons to enter and pass partway but not 
completely through the membrane. 


Actin filament 


filament to provide a long segment that could be observed under a 
fluorescence microscope. Remarkably, the addition of ATP caused the actin 
filament to rotate unidirectionally in a counterclockwise direction. The y 
subunit was rotating, driven by the hydrolysis of ATP. Thus, the catalytic 
activity of an individual molecule could be observed. The counterclock- 
wise rotation is consistent with the predicted mechanism for hydrolysis 
because the molecule was viewed from below relative to the view shown in 
Figure 18.29. 

More-detailed analysis in the presence of lower concentrations of ATP 
revealed that the y subunit rotates in 120-degree increments. Each incre- 
ment corresponds to the hydrolysis of a single ATP molecule. In addition, 
from the results obtained by varying the length of the actin filament and 
measuring the rate of rotation, the enzyme appears to operate near 100% 
efficiency; that is, essentially all of the energy released by ATP hydrolysis is 
converted into rotational motion. 


Proton flow around the c ring powers ATP synthesis 


The direct observation of rotary motion of the y subunit is strong evidence 
for the rotational mechanism for ATP synthesis. The last remaining ques- 
tion is: How does proton flow through Fo drive the rotation of the y sub- 
unit? Howard Berg and George Oster proposed an elegant mechanism that 
provides a clear answer to this question. The mechanism depends on the 
structures of the a and c subunits of Fo (Figure 18.30). The stationary a 
subunit directly abuts the membrane-spanning ring formed by 10 to 14 c 
subunits. Although the structure of the a subunit has not yet been experi- 
mentally determined, a variety of evidence is consistent with a structure that 
includes two hydrophilic half-channels that do not span the membrane (see 
Figure 18.30). Thus, protons can pass into either of these channels, but they 
cannot move completely across the membrane. The a subunit is positioned 
such that each half-channel directly interacts with one c subunit. 

The structure of the c subunit was determined both by NMR methods 
and by x-ray crystallography. Each polypeptide chain forms a pair of a heli- 
ces that span the membrane. An aspartic acid residue (Asp 61) is found in 
the middle of one of the helices. The key to proton movement across the 
membrane is that, in a proton-rich environment, such as the cytoplasmic 
side of the mitochondrial membrane, a proton will enter a channel and bind 
the aspartate residue (Figure 18.31). The c subunit with the bound proton 
then rotates through the membrane until the aspartic acid is in a proton- 
poor environment of the other half-channel, where the proton is released. 
The movement of protons through the half-channels from the high proton con- 
centration of the cytoplasm to the low proton concentration of the matrix 
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Figure 18.31 Proton motion across the membrane drives rotation of the c ring. A proton 
enters from the intermembrane space into the cytoplasmic half-channel to neutralize the charge on 
an aspartate residue in a c subunit. With this charge neutralized, the c ring can rotate clockwise by 
one c subunit, moving an aspartic acid residue out of the membrane into the matrix half-channel. 
This proton can move into the matrix, resetting the system to its initial state. 


powers the rotation of the c ring. Its rotation is favored by the ability of the 
newly protonated (neutralized) aspartic acid residue to occupy the hydro- 
phobic environment of the membrane. Thus, the c subunit with the newly 
protonated aspartic acid moves from contact with the cytoplasmic half- 
channel into the membrane, and the other c subunits move in unison. The 
a unit remains stationary as the c ring rotates. Each proton that enters the 
cytoplasmic half-channel of the a unit moves through the membrane by 
riding around on the rotating c ring to exit through the matrix half-channel 
into the proton-poor environment of the matrix (Figure 18.32). 

How does the rotation of the c ring lead to the synthesis of ATP? The c 
ring is tightly linked to the y and ¢ subunits. Thus, as the c ring turns, the 
y and e« subunits are turned inside the «383 hexamer unit of F;. The rota- 
tion of the y subunit in turn promotes the synthesis of ATP through the 
binding-change mechanism. The exterior column formed by the two b chains 
and the 6 subunit prevents the a383 hexamer from rotating. Recall that the 
number of c subunits in the c ring appears to range between 10 and 14. This 
number is significant because it determines the number of protons that 
must be transported to generate a molecule of ATP. Each 360-degree rota- 
tion of the y subunit leads to the synthesis and release of three molecules of 
ATP. Thus, if there are 10 c subunits in the ring (as was observed in a crys- 
tal structure of yeast mitochondrial ATP synthase), each ATP generated 
requires the transport of 10/3 = 3.33 protons. For simplicity, we will 
assume that three protons must flow into the matrix for each ATP formed, 
but we must keep in mind that the true value may differ. As we will see, the 


Figure 18.32 Proton path through the 
membrane. Each proton enters the 
cytoplasmic half-channel, follows a complete 
rotation of the c ring, and exits through the 
other half-channel into the matrix. 


A little goes a long way 


Despite the various molecular machinations 
and the vast numbers of ATPs synthesized 
and protons pumped, a resting human 
being requires surprisingly little power. 
Approximately 116 watts, the energy 
output of a typical light bulb, provides 
enough energy to sustain a resting person. 
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Figure 18.33 Overview of oxidative 
phosphorylation. The electron-transport 
chain generates a proton gradient, which is 
used to synthesize ATP. 
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electrons from NADH pump enough protons to generate 2.5 molecules of 
ATP, whereas those from FADH); yield 1.5 molecules of ATP. 

Let us return for a moment to the example with which we began this 
chapter. Ifa resting human being requires 85 kg of ATP per day for bodily 
functions, then 3.3 X 107° protons must flow through the ATP synthase per 
day, or 3.3 X 107! protons per second. Figure 18.33 summarizes the process 
of oxidative phosphorylation. 


ATP synthase and G proteins have several common features 


Ww The a and B subunits of ATP synthase are members of the P-loop 

NTPase family of proteins. In Chapter 14, we learned that the signal- 
ing properties of other members of this family, the G proteins, depend on 
their ability to bind nucleoside triphosphates and diphosphates with great 
tenacity. They do not exchange nucleotides unless they are stimulated to do 
so by interaction with other proteins. The binding-change mechanism of 
ATP synthase is a variation on this theme. The P-loop regions of the B sub- 
units will bind either ADP or ATP (or release ATP), depending on which 
of three different faces of the -y subunit they interact with. The conforma- 
tional changes take place in an orderly way, driven by the rotation of the 
yy subunit. 


18.5 Many Shuttles Allow Movement Across 
Mitochondrial Membranes 


The inner mitochondrial membrane must be impermeable to most mole- 
cules, yet much exchange has to take place between the cytoplasm and 
the mitochondria. This exchange is mediated by an array of membrane- 
spanning transporter proteins (Section 13.4). 
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Figure 18.34 Glycerol 3-phosphate shuttle. Electrons from NADH can enter the mitochondrial 
electron-transport chain by being used to reduce dihydroxyacetone phosphate to glycerol 
3-phosphate. Glycerol 3-phosphate is reoxidized by electron transfer to an FAD prosthetic group in a 
membrane-bound glycerol 3-phosphate dehydrogenase. Subsequent electron transfer to Q to form 
QhH)> allows these electrons to enter the electron-transport chain. 


Electrons from cytoplasmic NADH enter mitochondria by shuttles 


One function of the respiratory chain is to regenerate NAD* for use in 
glycolysis. How is cytoplasmic NADH reoxidized to NAD* under aerobic 
conditions? NADH cannot simply pass into mitochondria for oxidation 
by the respiratory chain, because the inner mitochondrial membrane is 
impermeable to NADH and NAD*. The solution is that electrons from 
NADH, rather than NADH itself, are carried across the mitochondrial 
membrane. One of several means of introducing electrons from NADH into 
the electron-transport chain is the glycerol 3-phosphate shuttle (Figure 18.34). 
The first step in this shuttle is the transfer of a pair of electrons from NADH 
to dihydroxyacetone phosphate, a glycolytic intermediate, to form glycerol 
3-phosphate. This reaction is catalyzed by a glycerol 3-phosphate dehy- 
drogenase in the cytoplasm. Glycerol 3-phosphate is reoxidized to dihy- 
droxyacetone phosphate on the outer surface of the inner mitochondrial 
membrane by a membrane-bound isozyme of glycerol 3-phosphate 
dehydrogenase. An electron pair from glycerol 3-phosphate is transferred 
to an FAD prosthetic group in this enzyme to form FADH)b. This reaction 
also regenerates dihydroxyacetone phosphate. 

The reduced flavin transfers its electrons to the electron carrier Q, which 
then enters the respiratory chain as QH». When cytoplasmic NADH trans- 
ported by the glycerol 3-phosphate shuttle is oxidized by the respiratory chain, 
1.5 rather than 2.5 molecules of ATP are formed. The yield is lower because 
FAD rather than NAD* is the electron acceptor in mitochondrial glycerol 
3-phosphate dehydrogenase. The use of FAD enables electrons from cyto- 
plasmic NADH to be transported into mitochondria against an NADH 
concentration gradient. The price of this transport is one molecule of ATP 
per two electrons. This glycerol 3-phosphate shuttle is especially prominent 
in muscle and enables it to sustain a very high rate of oxidative phosphoryla- 
tion. Indeed, some insects lack lactate dehydrogenase and are completely 
dependent on the glycerol 3- phosphate shuttle for the regeneration of cyto- 
plasmic NAD*. 
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Figure 18.35 Malate-aspartate shuttle. 
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In the heart and liver, electrons from cytoplasmic NADH are brought 
into mitochondria by the malate—aspartate shuttle, which is mediated by 
two membrane carriers and four enzymes (Figure 18.35). Electrons are 
transferred from NADH in the cytoplasm to oxaloacetate, forming malate, 
which traverses the inner mitochondrial membrane in exchange for 
oa-ketoglutarate and is then reoxidized by NAD* in the matrix to form 
NADH in a reaction catalyzed by the citric acid cycle enzyme malate 
dehydrogenase. The resulting oxaloacetate does not readily cross the inner 
mitochondrial membrane and so a transamination reaction (Section 23.3) is 
needed to form aspartate, which can be transported to the cytoplasmic side 
in exchange for glutamate. Glutamate donates an amino group to oxaloac- 
etate, forming aspartate and a-ketoglutarate. In the cytoplasm, aspartate is 
then deaminated to form oxaloacetate and the cycle is restarted. 


The entry of ADP into mitochondria is coupled to the exit 
of ATP by ATP-ADP translocase 


The major function of oxidative phosphorylation is to generate ATP from 
ADP. ATP and ADP do not diffuse freely across the inner mitochondrial 
membrane. How are these highly charged molecules moved across the inner 
membrane into the cytoplasm? A specific transport protein, ATP-ADP 
translocase, enables these molecules to transverse this permeability barrier. 
Most important, the flows of ATP and ADP are coupled. ADP enters the 
mitochondrial matrix only if ATP exits, and vice versa. This process is carried 
out by the translocase, an antiporter: 


ADP* statin ar ATP nate =» ADP matrix + ATP cytoplasm 


ATP-ADP translocase is highly abundant, constituting about 15% of 
the protein in the inner mitochondrial membrane. The abundance is a 
manifestation of the fact that human beings exchange the equivalent of 
their weight in ATP each day. The 30-kd translocase contains a single 
nucleotide-binding site that alternately faces the matrix and the cytoplas- 
mic sides of the membrane (Figure 18.36). ATP and ADP bind to the 
translocase without Mg’*, and ATP has one more negative charge than that 
of ADP. Thus, in an actively respiring mitochondrion with a positive 
membrane potential, ATP transport out of the mitochondrial matrix and 
ADP transport into the matrix are favored. This ATP—ADP exchange is 
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Figure 18.36 Mechanism of mitochondrial ATP-ADP translocase. The translocase catalyzes 
the coupled entry of ADP into the matrix and the exit of ATP from it. The binding of ADP (1) from 
the cytoplasm favors eversion of the transporter (2) to release ADP into the matrix (3). Subsequent 
binding of ATP from the matrix to the everted form (4) favors eversion back to the original 
conformation (5), releasing ATP into the cytoplasm (6). 


energetically expensive; about a quarter of the energy yield from electron 
transfer by the respiratory chain is consumed to regenerate the membrane 
potential that is tapped by this exchange process. The inhibition of this 
process leads to the subsequent inhibition of cellular respiration as well 
(p. 558). 


Mitochondrial transporters for metabolites have a 
common tripartite structure 


Examination of the amino acid sequence of the ATP-ADP translocase 
revealed that this protein consists of three tandem repeats of a 100-amino- 
acid module, each of which appears to have two transmembrane segments. 
This tripartite structure has recently been confirmed by the determination 
of the three-dimensional structure of this transporter (Figure 18.37). The 
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h Figure 18.37 Structure of 
mitochondrial transporters. The structure 
of the ATP-ADP translocase is shown. Notice 
that this structure comprises three similar 
units (shown in red, blue, and yellow) that 
come together to form a binding site, here 
occupied by an inhibitor of this transporter. 
Other members of the mitochondrial 
transporter family adopt similar tripartite 
Matrix side structures. [Drawn from 10KC.pdb.] 
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Figure 18.38 Mitochondrial transporters. 
Transporters (also called carriers) are 
transmembrane proteins that carry specific 
ions and charged metabolites across the inner 
mitochondrial membrane. 


Cytoplasm ATP Malate Citrate + H* OH OH 


Inner 
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transmembrane helices form a tepeelike structure with the nucleotide- 
binding site (marked by a bound inhibitor) lying in the center. Each of the 
three repeats adopts a similar structure. 

ATP-ADP translocase is but one of many mitochondrial transporters 
for ions and charged metabolites (Figure 18.38). The phosphate carrier, 
which works in concert with ATP-ADP translocase, mediates the elec- 
troneutral exchange of H2PO, for OH . The combined action of these 
two transporters leads to the exchange of cytoplasmic ADP and P; for 
matrix ATP at the cost of the influx of one H” (owing to the transport of 
one OH out of the matrix). These two transporters, which provide ATP 
synthase with its substrates, are associated with the synthase to form a large 
complex called the ATP synthasome. 

Other homologous carriers also are present in the inner mitochondrial 
membrane. The dicarboxylate carrier enables malate, succinate, and fumar- 
ate to be exported from the mitochondrial matrix in exchange for P;. The 
tricarboxylate carrier exchanges citrate and H™ for malate. Pyruvate in the 
cytoplasm enters the mitochondrial membrane in exchange for OH” by 
means of the pyruvate carrier. In all, more than 40 such carriers are encoded 
in the human genome. 


18.6 The Regulation of Cellular Respiration Is Governed 
Primarily by the Need for ATP 


Because ATP is the end product of cellular respiration, the ATP needs of 
the cell are the ultimate determinant of the rate of respiratory pathways and 
their components. 


The complete oxidation of glucose yields about 30 molecules of ATP 


We can now estimate how many molecules of ATP are formed when glucose 
is completely oxidized to CO2. The number of ATP (or GTP) molecules 
formed in glycolysis and the citric acid cycle is unequivocally known 
because it is determined by the stoichiometries of chemical reactions. In 
contrast, the ATP yield of oxidative phosphorylation is less certain because 
the stoichiometries of proton pumping, ATP synthesis, and metabolite- 
transport processes need not be integer numbers or even have fixed values. 
As stated earlier, the best current estimates for the number of protons 
pumped out of the matrix by NADH-Q oxidoreductase, Q-cytochrome 
c oxidoreductase, and cytochrome c oxidase per electron pair are four, two, 
and four, respectively. The synthesis of a molecule of ATP is driven by the 
flow of about three protons through ATP synthase. An additional proton is 


Table 18.4 ATP yield from the complete oxidation of glucose 555 


ATP yield pet 18.6 Regulation of Oxidative 
Reaction sequence glucose molecule Phosphorylation 
Glycolysis: Conversion of glucose into pyruvate (in the cytoplasm) 
Phosphorylation of glucose =1 
Phosphorylation of fructose 6-phosphate = 
Dephosphorylation of 2 molecules of 1,3-BPG +2 
Dephosphorylation of 2 molecules of phosphoenolpyruvate +2 


2 molecules of NADH are formed in the oxidation 
of 2 molecules of glyceraldehyde 3-phosphate 


Conversion of pyruvate into acetyl CoA (inside mitochondria) 
2 molecules of NADH are formed 


Citric acid cycle (inside mitochondria) 
2 molecules of adenosine triphosphate are formed 
from 2 molecules of succinyl CoA +2 
6 molecules of NADH are formed in the oxidation of 2 molecules 
each of isocitrate, a-ketoglutarate, and malate 
2 molecules of FADH) are formed in the oxidation of 2 molecules 
of succinate 


Oxidative phosphorylation (inside mitochondria) 
2 molecules of NADH formed in glycolysis; each 
yields 1.5 molecules of ATP (assuming transport of NADH by 


the glycerol 3-phosphate shuttle) 4e3} 
2 molecules of NADH formed in the oxidative decarboxylation 
of pyruvate; each yields 2.5 molecules of ATP 4E5 
2 molecules of FADH) formed in the citric acid cycle; each 
yields 1.5 molecules of ATP 3) 
6 molecules of NADH formed in the citric acid cycle; each 
yields 2.5 molecules of ATP ae 
Net Yield per Molecule of Glucose +30 


Source: The ATP yield of oxidative phosphorylation is based on values given in P. C. Hinkle, M. A. Kumar, 
A. Resetar, and D. L. Harris. Biochemistry 30:3576, 1991. 

Note: The current value of 30 molecules of ATP per molecule of glucose supersedes the earlier value of 

36 molecules of ATP. The stoichiometries of proton pumping, ATP synthesis, and metabolite transport 
should be regarded as estimates. About 2 more molecules of ATP are formed per molecule of glucose 
oxidized when the malate—aspartate shuttle rather than the glycerol 3-phosphate shuttle is used. 


consumed in transporting ATP from the matrix to the cytoplasm. Hence, 
about 2.5 molecules of cytoplasmic ATP are generated as a result of the flow 
of a pair of electrons from NADH to Op. For electrons that enter at the level 
of Q-cytochrome c oxidoreductase, such as those from the oxidation of suc- 
cinate or cytoplasmic NADH, the yield is about 1.5 molecules of ATP per 
electron pair. Hence, as tallied in Table 18.4, about 30 molecules of ATP are 
formed when glucose is completely oxidized to COz; this value supersedes the 
traditional estimate of 36 molecules of ATP. Most of the ATP, 26 of 30 
molecules formed, is generated by oxidative phosphorylation. Recall that 
the anaerobic metabolism of glucose yields only 2 molecules of ATP. The 
efficiency of cellular respiration is manifested in the fact that one of the 
effects of endurance exercise, a practice that calls for much ATP for an 
extended period of time, is to increase the number of mitochondria and 
blood vessels in muscle and thus increase the extent of ATP generation by 
oxidative phosphorylation. 


The rate of oxidative phosphorylation is determined 
by the need for ATP 


How is the rate of the electron-transport chain controlled? Under most 
physiological conditions, electron transport is tightly coupled to phosphor- 
ylation. Electrons do not usually flow through the electron-transport chain to 


O» unless ADP is simultaneously phosphorylated to ATP. When ADP concen- 
tration rises, as would be the case in active muscle, the rate of oxidative 
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Figure 18.39 Respiratory control. Electrons 
are transferred to O> only if ADP is 
concomitantly phosphorylated to ATP. 


Figure 18.40 Energy charge regulates the 
use of fuels. The synthesis of ATP from ADP 
and P; controls the flow of electrons from 
NADH and FADHz to oxygen. The availability 
of NAD* and FAD in turn control the rate of 
the citric acid cycle (CAC). 


phosphorylation increases to meet the ATP needs of the muscle. The regu- 
lation of the rate of oxidative phosphorylation by the ADP level is called 
respiratory control or acceptor control. Experiments on isolated mitochondria 
demonstrate the importance of ADP level (Figure 18.39). The rate of 
oxygen consumption by mitochondria increases markedly when ADP is 
added and then returns to its initial value when the added ADP has been 
converted into ATP. 

The level of ADP likewise affects the rate of the citric acid cycle. At low 
concentrations of ADP, as in a resting muscle, NADH and FADH), are not 
consumed by the electron-transport chain. The citric acid cycle slows 
because there is less NAD* and FAD to feed the cycle. As the ADP level 
rises and oxidative phosphorylation speeds up, NADH and FADH) are 
oxidized, and the citric acid cycle becomes more active. Electrons do not 
flow from fuel molecules to O» unless ATP needs to be synthesized. We see here 
another example of the regulatory significance of the energy charge 
(Figure 18.40). 
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Regulated uncoupling leads to the generation of heat 


Some organisms possess the ability to uncouple oxidative phosphorylation from 
ATP synthesis to generate heat. Such uncoupling is a means to maintain body 
temperature in hibernating animals, in some newborn animals (including 
human beings), and in many adult mammals, especially those adapted to 
cold. The skunk cabbage uses an analogous mechanism to heat its floral 
spikes in early spring, increasing the evaporation of odoriferous molecules 
that attract insects to fertilize its flowers. In animals, the uncoupling is in 
brown adipose tissue (BAT), which is specialized tissue for the process of 
nonshivering thermogenesis. In contrast, white adipose tissue (WAT), which 
constitutes the bulk of adipose tissue, plays no role in thermogenesis but 
serves as an energy source and an endocrine gland (Chapters 26 and 27). 
Brown adipose tissue is very rich in mitochondria, often called brown 
fat mitochondria. The tissue appears brown from the combination of the 
greenish-colored cytochromes in the numerous mitochondria and the red 
hemoglobin present in the extensive blood supply, which helps to carry the 
heat through the body. The inner mitochondrial membrane of these 


mitochondria contains a large amount of uncoupling 
protein (UCP-1), or thermogenin, a dimer of 33-kd Electron 
subunits that resembles ATP-ADP translocase. XL transport 
UCP-1 forms a pathway for the flow of protons from Fatty acids 2992 (inn Pe RORSR RRR RS 
the cytoplasm to the matrix. In essence, UCP-1 gener- activate 
ates heat by short-circuiting the mitochondrial proton ro 
battery. The energy of the proton gradient, normally uae 
captured as ATP, is released as heat as the protons 
flow through UCP-1 to the mitochondrial matrix. 
This dissipative proton pathway is activated when the ee ADP + P; —{—> ATP 
core body temperature begins to fall. Inresponsetoa : : ; 
temperature drop, the release of hormones leads to the Figure 18.41 Action of an uncoupling protein. Uncoupling protein 
: : : : (UCP-1) generates heat by permitting the influx of protons into the 

liberation of free fatty acids from triacylglycerols that — jpitochondria without the synthesis of ATP. 
in turn activate thermogenin (Figure 18.41). 

We can witness the effects of a lack of nonshivering 
thermogenesis by examining pig behavior. Pigs are 
unusual mammals in that they have large litters and are the only ungulates 
(hoofed animals) that build nests for birth. These behavioral characteristics 
appear to be the result of a biochemical deficiency. Pigs lack UCP-1 
and, hence, brown fat. Piglets must rely on other means of thermogenesis, 
such as nesting, large litter size, and shivering. 

Until recently, adult humans were believed to lack brown fat tissue. 
However, new studies have established that adults, women especially, have 
brown adipose tissue in the neck and upper chest regions that is activated by 
cold (Figure 18.42). Obesity leads to a decrease in brown adipose tissue. 


a In addition to UCP-1, two other uncoupling proteins have been iden- 

tified. UCP-2, which is 56% identical in sequence with UCP-1, is 
found in a wide variety of tissues. UCP-3 (57% identical with UCP-1 and 
73% identical with UCP-2) is localized to skeletal muscle and brown fat. 
This family of uncoupling proteins, especially UCP-2 and UCP-3, may 
play a role in energy homeostasis. In fact, the genes for UCP-2 and UCP-3 
map to regions of the human and mouse chromosomes that have been linked 
to obesity, supporting the notion that they function as a means of regulating 
body weight. 


.- a 
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Pal _ er 
Figure 18.42 Brown adipose tissue is revealed on exposure to cold. The results of PET—CT 

scanning show the uptake and distribution of '°F-fluorodeoxyglucose ('8F-FDG) in adipose tissue. 

The patterns of '°F-FDG uptake in the same subject are dramatically different under thermoneutral 

conditions (left) and after exposure to cold (right). [Courtesy of Wouter van Marken Lichtenbelt. 

Copyright 2009 Massachusetts Medical Society. All rights reserved.] 
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Figure 18.43 Sites of action of some 
inhibitors of electron transport. 
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Oxidative phosphorylation can be inhibited at many stages 


Many potent and lethal poisons exert their effect by inhibiting oxidative 
phosphorylation at one of a number of different locations (Figure 18.43). 


1. Inhibition of the electron-transport chain. Rotenone, which is used as a fish 
and insect poison, and amytal, a barbiturate sedative, block electron transfer 
in NADH-Q oxidoreductase and thereby prevent the utilization of NADH 
as a substrate. Rotenone, as an electron-transport-chain inhibitor, may play 
a role, along with genetic susceptibility, in the development of Parkinson 
disease. In the presence of rotenone and amytal, electron flow resulting from 
the oxidation of succinate is unimpaired, because these electrons enter through 
QH)b, beyond the block. Antimycin A interferes with electron flow from cyto- 
chrome by in Q-cytochrome c oxidoreductase. Furthermore, electron flow 
in cytochrome c oxidase can be blocked by cyanide (CN), azide (N3_), 
and carbon monoxide (CO). Cyanide and azide react with the ferric form of 
heme a3, whereas carbon monoxide inhibits the ferrous form. Inhibition of 
the electron-transport chain also inhibits ATP synthesis because the 
proton-motive force can no longer be generated. 


2. Inhibition of ATP synthase. Oligomycin, an antibiotic used as an antifun- 
gal agent, and dicyclohexylcarbodiimide (DCC) prevent the influx of pro- 
tons through ATP synthase. If actively respiring mitochondria are exposed 
to an inhibitor of ATP synthase, the electron-transport chain ceases to oper- 
ate. This observation clearly illustrates that electron transport and ATP syn- 
thesis are normally tightly coupled. 


3. Uncoupling electron transport from ATP synthesis. The tight coupling of 
electron transport and phosphorylation in mitochondria can be uncoupled 
by 2,4-dinitrophenol (DNP) and certain other acidic aromatic compounds. 
These substances carry protons across the inner mitochondrial membrane, 
down their concentration gradient. In the presence of these uncouplers, elec- 
tron transport from NADH to O» proceeds in a normal fashion, but ATP is 
not formed by mitochondrial ATP synthase, because the proton-motive 
force across the inner mitochondrial membrane is continuously dissipated. 
This loss of respiratory control leads to increased oxygen consumption and 
oxidation of NADH. Indeed, in the accidental ingestion of uncouplers, large 
amounts of metabolic fuels are consumed, but no energy is captured as ATP. 
Rather, energy is released as heat. DNP is the active ingredient in some her- 
bicides and fungicides. Remarkably, some people consume DNP as a 
weight-loss drug, despite the fact that the FDA banned its use in 1938. There 
are also reports that Soviet soldiers were given DNP to keep them warm dur- 
ing the long Russian winters. Chemical uncouplers are nonphysiological, 
unregulated counterparts of uncoupling proteins. 


4, Inhibition of ATP export. ATP-ADP translocase is specifically inhibited 
by very low concentrations of atractyloside (a plant glycoside) or bongkrekic 
acid (an antibiotic from a mold). Atractyloside binds to the translocase 
when its nucleotide site faces the cytoplasm, whereas bongkrekic acid binds 
when this site faces the mitochondrial matrix. Oxidative phosphorylation 
stops soon after either inhibitor is added, showing that AT P-ADP translo- 
case is essential for maintaining adequate amounts of ADP to accept the 
energy associated with the proton-motive force. 


Mitochondrial diseases are being discovered 


The number of diseases that can be attributed to mitochondrial 
mutations is steadily growing in step with our growing understanding 


of the biochemistry and genetics of mitochondria. The prevalence of mito- 
chondrial diseases is estimated to be from 10 to 15 per 100,000 people, 
roughly equivalent to the prevalence of the muscular dystrophies. The first 
mitochondrial disease to be understood was Leber hereditary optic neuropa- 
thy (LHON), a form of blindness that strikes in midlife as a result of muta- 
tions in Complex I. Some of these mutations impair NADH utilization, 
whereas others block electron transfer to Q, Mutations in Complex I are the 
most frequent cause of mitochondrial diseases. The accumulation of muta- 
tions in mitochondrial genes in a span of several decades may contribute to 
aging, degenerative disorders, and cancer. 

A human egg harbors several hundred thousand molecules of mitochon- 
drial DNA, whereas a sperm contributes only a few hundred and thus has 
little effect on the mitochondrial genotype. Because the maternally inherited 
mitochondria are present in large numbers and not all of the mitochondria 
may be affected, the pathologies of mitochondrial mutants can be quite 
complex. Even within a single family carrying an identical mutation, chance 
fluctuations in the percentage of mitochondria with the mutation lead to 
large variations in the nature and severity of the symptoms of the patho- 
logical condition as well as the time of onset. As the percentage of defec- 
tive mitochondria increases, energy-generating capacity diminishes until, 
at some threshold, the cell can no longer function properly. Defects in 
cellular respiration are doubly dangerous. Not only does energy transduc- 
tion decrease, but also the likelihood that reactive oxygen species will be 
generated increases. Organs that are highly dependent on oxidative phos- 
phorylation, such as the nervous system and the heart, are most vulnerable 
to mutations in mitochondrial DNA. 


Mitochondria play a key role in apoptosis 


In the course of development or in cases of significant cell damage, indi- 
vidual cells within multicellular organisms undergo programmed cell death, 
or apoptosis. Mitochondria act as control centers regulating this process. 
Although the details have not yet been established, the outer membrane of 
damaged mitochondria becomes highly permeable, a process referred to as 
mitochondrial outer membrane permeabilization (MOMP). This permeabili- 
zation is instigated by a family of proteins (Bcl family) that were initially 
discovered because of their role in cancer. One of the most potent activators 
of apoptosis, cytochrome c, exits the mitochondria and interacts with 
apoptotic peptidase-activating factor 1 (APAF-1), which leads to the for- 
mation of the apoptosome. The apoptosome recruits and activates a prote- 
olytic enzyme called caspase 9, a member of the cysteine protease family 
(Section 9.1), that in turn activates a cascade of other caspases. Each caspase 
type destroys a particular target, such as the proteins that maintain cell 
structure. Another target is a protein that inhibits an enzyme that destroys 
DNA (an enzyme called caspase-activated DNAse or CAD), freeing CAD 
to cleave the genetic material. This cascade of proteolytic enzymes has been 
called “death by a thousand tiny cuts.” 


Power transmission by proton gradients is a central 
motif of bioenergetics 


The main concept presented in this chapter is that mitochondrial electron 
transfer and ATP synthesis are linked by a transmembrane proton gradi- 
ent. ATP synthesis in bacteria and chloroplasts also is driven by proton 
gradients. In fact, proton gradients power a variety of energy-requiring 
processes such as the active transport of calcium ions by mitochondria, the 
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entry of some amino acids and sugars into bacteria, the 
rotation of bacterial flagella, and the transfer of electrons 
from NADP* to NADPH. Proton gradients can also be 
used to generate heat, as in hibernation. It is evident that 
we proton gradients are a central interconvertible currency of 
aD) free energy in biological systems (Figure 18.44). Mitchell 
| noted that the proton-motive force is a marvelously 
7 a simple and effective store of free energy because it 
requires only a thin, closed lipid membrane between two 

ce aqueous phases. 


Figure 18.44 The proton gradient is an 
interconvertible form of free energy. 


Summary 

18.1 Eukaryotic Oxidative Phosphorylation Takes Place in Mitochondria 
Mitochondria generate most of the ATP required by aerobic cells 
through a joint endeavor of the reactions of the citric acid cycle, which 
take place in the mitochondrial matrix, and oxidative phosphor- 
ylation, which takes place in the inner mitochondrial membrane. 
Mitochondria are descendants of a free-living bacterium that estab- 
lished a symbiotic relation with another cell. 


18.2 Oxidative Phosphorylation Depends on Electron Transfer 

In oxidative phosphorylation, the synthesis of ATP is coupled to the 
flow of electrons from NADH or FADH)> to Op, by a proton gradient 
across the inner mitochondrial membrane. Electron flow through 
three asymmetrically oriented transmembrane complexes results in 
the pumping of protons out of the mitochondrial matrix and the gen- 
eration of a membrane potential. ATP is synthesized when protons 
flow back to the matrix through a channel in an ATP-synthesizing 
complex, called ATP synthase (also known as FoF ,-ATPase). 
Oxidative phosphorylation exemplifies a fundamental theme of bio- 
energetics: the transmission of free energy by proton gradients. 


18.3 The Respiratory Chain Consists of Four Complexes: Three Proton 

Pumps and a Physical Link to the Citric Acid Cycle 

The electron carriers in the respiratory assembly of the inner mito- 
chondrial membrane are quinones, flavins, iron-sulfur complexes, 
heme groups of cytochromes, and copper ions. Electrons from NADH 
are transferred to the FMN prosthetic group of NADH-Q oxi- 
doreductase (Complex I), the first of four complexes. This oxi- 
doreductase also contains Fe-S centers. The electrons emerge in QH), 
the reduced form of ubiquinone (Q ). The citric acid cycle enzyme suc- 
cinate dehydrogenase is a component of the succinate-Q reductase 
complex (Complex I), which donates electrons from FADH) to Q to 
form QH).This highly mobile hydrophobic carrier transfers its elec- 
trons to Q-cytochrome c oxidoreductase (Complex III), a complex 
that contains cytochromes b and c, and an Fe-S center. This complex 
reduces cytochrome c, a water-soluble peripheral membrane protein. 
Cytochrome c, like Q, is a mobile carrier of electrons, which it then 
transfers to cytochrome c oxidase (Complex IV). This complex con- 
tains cytochromes a and a3 and three copper ions. A heme iron ion and 
a copper ion in this oxidase transfer electrons to Oo, the ultimate 
acceptor, to form H,O. 
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18.4 A Proton Gradient Powers the Synthesis of ATP 561 
The flow of electrons through Complexes I, III, and IV leads to the Key Terms 
transfer of protons from the matrix side to the cytoplasmic side of the 
inner mitochondrial membrane. A proton-motive force consisting of a 
pH gradient (matrix side basic) and a membrane potential (matrix side 
negative) is generated. The flow of protons back to the matrix side 
through ATP synthase drives ATP synthesis. The enzyme complex is 
a molecular motor made of two operational units: a rotating compo- 
nent and a stationary component. The rotation of the y subunit 
induces structural changes in the B subunit that result in the synthesis 
and release of ATP from the enzyme. Proton influx provides the force 
for the rotation. 

The flow of two electrons through NADH-Q oxidoreductase, 
Q-cytochrome c oxidoreductase, and cytochrome c oxidase gener- 
ates a gradient sufficient to synthesize 1, 0.5, and 1 molecule of ATP, 
respectively. Hence, 2.5 molecules of ATP are formed per molecule of 
NADH oxidized in the mitochondrial matrix, whereas only 1.5 mol- 
ecules of ATP are made per molecule of FADH) oxidized, because its 
electrons enter the chain at QH)g, after the first proton-pumping site. 


18.5 Many Shuttles Allow Movement Across Mitochondrial Membranes 
Mitochondria employ a host of transporters, or carriers, to move 
molecules across the inner mitochondrial membrane. The electrons of 
cytoplasmic NADH are transferred into mitochondria by the glycerol 
phosphate shuttle to form FADH) from FAD or by the malate— 
aspartate shuttle to form mitochondrial NADH. The entry of ADP 
into the mitochondrial matrix is coupled to the exit of ATP by ATP- 
ADP translocase, a transporter driven by membrane potential. 


18.6 The Regulation of Oxidative Phosphorylation Is Governed 

Primarily by the Need for ATP 

About 30 molecules of ATP are generated when a molecule of glucose 
is completely oxidized to CO and HO. Electron transport is nor- 
mally tightly coupled to phosphorylation. NADH and FADH) are 
oxidized only if ADP is simultaneously phosphorylated to ATP, a 
form of regulation called acceptor or respiratory control. Proteins have 
been identified that uncouple electron transport and ATP synthesis 
for the generation of heat. Uncouplers such as DNP also can disrupt 
this coupling; they dissipate the proton gradient by carrying protons 
across the inner mitochondrial membrane. 


Key Terms 
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electron-transport chain (p. 528) (Complex ITI) (p. 535) nucleotide translocase, ANT) 
reduction (redox, oxidation—reduction, cytochrome ¢ (Cyt c) (p. 535) (p. 552) 

E$) potential (p. 528) Rieske center (p. 536) respiratory (acceptor) control (p. 556) 
coenzyme Q (Q, ubiquinone) (p. 532) Q cycle (p. 536) uncoupling protein (UCP) (p. 557) 
Q pool (p. 533) cytochrome c oxidase (Complex IV) programmed cell death (apoptosis) 
NADH-Q oxidoreductase (p. 237) ; (p. 22?) 

(Complex I) (p. 533) superoxide dismutase (p. 541) mitochondrial outer membrane 
flavin mononucleotide (FMN) (p. 533) catalase (p. 542) peetacahilganonMOMENprae4) 


ATP synthase (Complex V, F;}Fyp ATPase) apoptosome (p. 559) 


iron-sulfur (nonheme iron) 
(p. 544) caspase (p. 559) 


protein (p. 534) 
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Problems 


1. Breathe or ferment? Compare fermentation and respira- 
tion with respect to electron donors and electron acceptors. 


2. Reference states. The standard oxidation—reduction 
potential for the reduction of O» to H20O is given as 0.82 V 
in Table 18.1. However, the value given in textbooks of 
chemistry is 1.23 V. Account for this difference. 


3. Less energetic electrons. Why are electrons carried by 
FADH) not as energy rich as those carried by NADH? 
What is the consequence of this difference? 


4. Now prove it. Calculate the energy released by the reduc- 
tion of O7 with FADH). 


5. Thermodynamic constraint. Compare the AG°’ values for 
the oxidation of succinate by NAD* and by FAD. Use the 
data given in Table 18.1 to find the E of the NAD* 
~ NADH and fumarate-succinate couples, and assume that 
Eo for the FAD-FADH) redox couple is nearly 0.05 V. 
Why is FAD rather than NAD™ the electron acceptor in 
the reaction catalyzed by succinate dehydrogenase? 


6. The benefactor and beneficiary. Identify the oxidant and 
the reductant in the following reaction. 


Pyruvate + NADH + H* = = lactate + NAD* 


7. Six of one, half dozen of the other. How is the redox 
potential (AE9) related to the free-energy change of a 
reaction (AG°’)? 


8. Location, location, location. Iron is a component of many 
of the electron carriers of the electron-transport chain. How 
can it participate in a series of coupled redox reactions if the 
E6 value is +0.77 V, as seen in Table 18.1? 


9. Line up. Place the following components of the electron- 
transport chain in their proper order: (a) cytochrome c; 
(b) Q-cytochromec oxidoreductase; (c) NADH-Q reductase; 
(d) cytochrome c oxidase; (e) ubiquinone. 


10. Match ’em. 

(a) Complex I__ 1. Q-cytochrome c oxidoreductase 
(b) Complex IT__ 2. Coenzyme Q 

(c) Complex III__ 3. Succinate-Q reductase 

(d) Complex IV__ 4. NADH-Q oxidoreductase 

(e) Ubiquinone__ 5. Cytochrome c oxidase 


11. Structural considerations. Explain why coenzyme Q is 
an effective mobile electron carrier in the electron-transport 
chain. 


12. Inhibitors. Rotenone inhibits electron flow through 
NADH-Q oxidoreductase. Antimycin A blocks electron 
flow between cytochromes b and c;. Cyanide blocks elec- 
tron flow through cytochrome oxidase to Oy. Predict the 
relative oxidation—reduction state of each of the following 


respiratory-chain components in mitochondria that are 
treated with each of the inhibitors: NAD*; NADH-Q 
oxidoreductase; coenzyme Q; cytochrome c1; cytochrome c; 
cytochrome a. 


13. Rumored to be a favorite of Elvis. Amytal is a barbitu- 
rate sedative that inhibits electron flow through Complex I. 
How would the addition of amytal to actively respiring 
mitochondria affect the relative oxidation—reduction states 
of the components of the electron-transport chain and the 
citric acid cycle? 


14. Efficiency. What is the advantage of having Complexes 
I, III, and IV associated with one another in the form of a 
respirasome? 


15. ROS, not ROUS. What are the reactive oxygen species 
and why are they especially dangerous to cells? 


16. Reclaim resources. Humans have only about 250 g of 
ATP, but even a couch potato needs about 83 kg of ATP to 
open the bag of chips and use the remote. How is this dis- 
crepancy between requirements and resources reconciled? 


17. Energy harvest. What is the yield of ATP when each of 
the following substrates is completely oxidized to CO, by a 
mammalian cell homogenate? Assume that glycolysis, the 
citric acid cycle, and oxidative phosphorylation are fully 
active. 


(d) Phosphoenolpyruvate 

(e) Galactose 

(f) Dihydroxyacetone 
phosphate 


(a) Pyruvate 

(b) Lactate 

(c) Fructose 
1,6-bisphosphate 


18. Potent poisons. What is the effect of each of the follow- 
ing inhibitors on electron transport and ATP formation by 
the respiratory chain? 
(a) Azide 

(b) Atractyloside 

(c) Rotenone 


(d) DNP 

(e) Carbon monoxide 

(f) Antimycin A 

19. A question of coupling. What is the mechanistic basis for 
the observation that the inhibitors of ATP synthase also 
lead to an inhibition of the electron-transport chain? 


20. A Brownian ratchet wrench. What causes the c subunits 
of ATP synthase to rotate? What determines the direction 
of rotation? 


21. Alternative routes. The most common metabolic sign of 
mitochondrial disorders is lactic acidosis. Why? 


22. Connections. How does the inhibition of ATP-ADP 
translocase affect the citric acid cycle? Glycolysis? 


23. Oz consumption. Oxidative phosphorylation in mito- 
chondria is often monitored by measuring oxygen con- 


sumption. When oxidative phosphorylation is proceeding 
rapidly, the mitochondria will rapidly consume oxygen. If 
there is little oxidative phosphorylation, only small amounts 
of oxygen will be used. You are given a suspension of 
isolated mitochondria and directed to add the following 
compounds in the order from a to h. With the addition of 
each compound, all of the previously added compounds 
remain present. Predict the effect of each addition on oxy- 
gen consumption by the isolated mitochondria. 


(a) Glucose (e) Succinate 


(b) ADP + P; (f) Dinitrophenol 
(c) Citrate (g) Rotenone 
(d) Oligomycin (h) Cyanide 


24. P:O ratios. The number of molecules of inorganic 
phosphate incorporated into organic form per atom of oxy- 
gen consumed, termed the P: O ratio, was frequently used 
as an index of oxidative phosphorylation. 


(a) What is the relation of the P:O ratio to the ratio of the 
number of protons translocated per electron pair (H*/2e ) 
and the ratio of the number of protons needed to synthesize 
ATP and transport it to the cytoplasm (P/H* )? 


(b) What are the P:O ratios for electrons donated by 
matrix NADH and by succinate? 


25. Cyanide antidote. The immediate administration of 
nitrite is a highly effective treatment for cyanide poisoning. 
What is the basis for the action of this antidote? (Hint: 
Nitrite oxidizes ferrohemoglobin to ferrihemoglobin.) 


26. Runaway mitochondria 1. Suppose that the mitochon- 
dria of a patient oxidize NADH irrespective of whether 
ADP is present. The P: O ratio for oxidative phosphoryla- 
tion by these mitochondria is less than normal. Predict the 
likely symptoms of this disorder. 


27. Recycling device. The cytochrome b component of 
Q-cytochrome c oxidoreductase enables both electrons of 
QH)> to be effectively utilized in generating a proton-motive 
force. Cite another recycling device in metabolism that 
brings a potentially dead end reaction product back into the 
mainstream. 


28. Crossover point. The precise site of action of a 
respiratory-chain inhibitor can be revealed by the crossover 
technique. Britton Chance devised elegant spectroscopic 
methods for determining the proportions of the oxidized 
and reduced forms of each carrier. This determination is 
feasible because the forms have distinctive absorption spec- 
tra, as illustrated in the adjoining graph for cytochrome c. 
You are given a new inhibitor and find that its addition to 
respiring mitochondria causes the carriers between NADH 
and QH) to become more reduced and those between cyto- 
chrome c and O» to become more oxidized. Where does 
your inhibitor act? 
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Problems 


Reduced 


Absorbance coefficient (M-! cm7! x 1075) 


| 
400 500 600 


Wavelength (nm) 


29. Runaway mitochondria 2. Years ago, uncouplers were 
suggested to make wonderful diet drugs. Explain why this 
idea was proposed and why it was rejected. Why might the 
producers of antiperspirants be supportive of the idea? 


30. Everything is connected. If actively respiring mitochon- 
dria are exposed to an inhibitor of ATP-ADP translocase, 
the electron-transport chain ceases to operate. Why? 


31. Identifying the inhibition. You are asked to determine 
whether a chemical is an electron-transport-chain inhibitor 
or an inhibitor of ATP synthase. Design an experiment to 
make this determination. 


32. To each according to its needs. It has been noted that the 
mitochondria of muscle cells often have more cristae than 
the mitochondria of liver cells. Provide an explanation for 
this observation. 


33. Opposites attract. An arginine residue (Arg 210) in the 
a subunit of ATP synthase is near the aspartate residue 
(Asp 61) in the matrix-side proton channel. How might Arg 
210 assist proton flow? 


34. Variable c subunits. Recall that the number of c sub- 
units in the c ring appears to range between 10 and 14. This 
number is significant because it determines the number of 
protons that must be transported to generate a molecule of 
ATP. Each 360-degree rotation of the y subunit leads to the 
synthesis and release of three molecules of ATP. Thus, if 
there are 10 c subunits in the ring (as was observed in a 
crystal structure of yeast mitochondrial ATP synthase), 
each ATP generated requires the transport of 10/3 = 3.33 
protons. How many protons are required to form ATP if 
the ring has 12 c subunits? 14? 


35. Counterintuitive. Under some conditions, mitochon- 
drial ATP synthase has been observed to actually run in 
reverse. How would that situation affect the proton-motive 
force? 
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36. Etiology? What does that mean? What does the fact that 
rotenone appears to increase the susceptibility to Parkinson 
disease indicate about the etiology of Parkinson disease? 


37. Exaggerating the difference. Why must ATP-ADP 
translocase (also called adenine nucleotide translocase or 
ANT) use Mg*-free forms of ATP and ADP? 


38. Respiratory control. The rate of oxygen consumption by 
mitochondria increases markedly when ADP is added and 
then returns to its initial value when the added ADP has 
been converted into ATP (see Figure 18.39). Why does the 
rate decrease? 


39. Same, but different. Why is the electroneutral exchange 
of H2PO4 for OH indistinguishable from the electroneu- 
tral symport of H,PO4 and H*? 


40. Multiple uses. Give an example of the use of the proton- 
motive force in ways other than for the synthesis of ATP? 


Chapter Integration Problems 


41. Just obeying the laws. Why do isolated F; subunits of 
ATP synthase catalyze ATP hydrolysis? 


42. The right location. Some cytoplasmic kinases, enzymes 
that phosphorylate substrates at the expense of ATP, bind 
to voltage-dependent anion channels. What might the 
advantage of this binding be? 


43. No exchange. Mice that completely lack ATP-ADP 
translocase (ANT /ANT ) can be made by using the 
knockout technique. Remarkably, these mice are viable but 
have the following pathological conditions: (1) high serum 
levels of lactate, alanine, and succinate; (2) little electron 
transport; and (3) a six- to eightfold increase in the level of 
mitochondrial H2O2 compared with that in normal mice. 
Provide a possible biochemical explanation for each of these 
conditions. 


44. Maybe you shouldn’t take your vitamins. Exercise is 
known to increase insulin sensitivity and to ameliorate type 2 
diabetes (Chapter 27). Recent research suggests that taking 
antioxidant vitamins might mitigate the beneficial effects of 
exercise with respect to ROS protection. 


(a) What are the antioxidant vitamins? 

(b) How does exercise protect against ROS? 

(c) Explain why vitamins might counteract the effects of 
exercise. 


Data Interpretation Problem 


45. Mitochondrial disease. A mutation in a mitochondrial 
gene encoding a component of ATP synthase has been 
identified. People who have this mutation suffer from 
muscle weakness, ataxia, and retinitis pigmentosa. A tissue 
biopsy was performed on each of three patients having this 


mutation, and submitochondrial particles were isolated that 
were capable of succinate-sustained ATP synthesis. First, 
the activity of the ATP synthase was measured on the addi- 
tion of succinate and the following results were obtained. 


ATP synthase activity 
(nmol of ATP formed min”! mg™') 


Controls 3.0 

Patient 1 0.25 
Patient 2 0.11 
Patient 3 0.17 


(a) What was the purpose of the addition of succinate? 
(b) What is the effect of the mutation on succinate- 
coupled ATP synthesis? 


Next, the ATPase activity of the enzyme was measured by 
incubating the submitochondrial particles with ATP in the 
absence of succinate. 


ATP hydrolysis 
(nmol of ATP hydrolyzed 


min mg) 


Controls 33 
Patient 1 30 
Patient 2 25 
Patient 3 31 


(c) Why was succinate omitted from the reaction? 

(d) What is the effect of the mutation on ATP hydrolysis? 
(e) What do these results, in conjunction with those 
obtained in the first experiment, tell you about the nature of 
the mutation? 


Mechanism Problem 


46. Chiral clue. ATPyS, a slowly hydrolyzed analog of 
ATP, can be used to probe the mechanism of phosphoryl- 
transfer reactions. Chiral ATPyS has been synthesized 
containing '%O in a specific y position and ordinary '°O 
elsewhere in the molecule. The hydrolysis of this chiral 
molecule by ATP synthase in '’O-enriched water yields 
inorganic [‘°O,”0, '8O]thiophosphate having the follow- 
ing absolute configuration. In contrast, the hydrolysis of 
this chiral ATPyS by a calcium-pumping ATPase from 
muscle gives thiophosphate of the opposite configuration. 
What is the simplest interpretation of these data? 


+ Ht 
H2'70 ADP 
Ss - Ss 
a oe ee ne 
1 of “a reef 1 ie 
ATPyS Thiophosphate 
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The Light Reactions of Photosynthesis 


Photosystem | 


Proton gradient 


Photosystem II ATP 
= 
po 
oO 
c 
oO 
o ADP 
a 
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Chloroplasts (left) convert light energy into chemical energy. High-energy 
electrons in chloroplasts are transported through two photosystems (right). 
In this transit, which culminates in the generation of reducing power, ATP is 
synthesized in a manner analogous to mitochondrial ATP synthesis. In 
contrast with mitochondrial electron transport, however, electrons in 
chloroplasts are energized by light. [(Left) Created by Kristian Peters/GNU 
Free Documentation Licencse.] 


O: our planet are organisms capable of collecting the electromagnetic OUTLINE 
energy of the visible spectrum and converting it into chemical energy. 
Green plants are the most obvious of these organisms, though 60% of this 
conversion is carried out by algae and bacteria. This transformation is per- 
haps the most important of all of the energy transformations that we will see 19.2 Light Absorption by Chlorophyll 
in our study of biochemistry; without it, life on our planet as we know it Induces Electron Transfer 
simply could not exist. 

The process of converting electromagnetic radiation into chemical energy 
is called photosynthesis, which uses light energy to convert carbon dioxide 
and water into carbohydrates and oxygen. 


19.1 Photosynthesis Takes Place in 
Chloroplasts 


19.3 Two Photosystems Generate a 
Proton Gradient and NADPH in 
Oxygenic Photosynthesis 


19.4 A Proton Gradient Across the 


CO, + H,O —=">(CH,O) + O» Thylakoid Membrane Drives ATP 
Synthesis 
In this equation, CHO represents carbohydrate, primarily sucrose and 


starch. These carbohydrates provide not only the energy to run the biological 
world, but also the carbon molecules to make a wide array of biomolecules. 
Photosynthetic organisms are called autotrophs (literally, “self-feeders’’) 19.6 The Ability to Convert Light into 
because they can synthesize chemical fuels such as glucose from carbon Chemical Energy Is Ancient 
dioxide and water by using sunlight as an energy source and then recover 

some of this energy from the synthesized glucose through the glycolytic 


19.5 Accessory Pigments Funnel 
Energy into Reaction Centers 
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Photosynthetic yield 


“If a year's yield of photosynthesis were 
amassed in the form of sugar cane, it would 
form a heap over two miles high and with a 
base 43 square miles.’ 

—G. E. Fogge 


If all of this sugar cane were converted into 
sugar cubes (0.5 inch, or 1.27 cm, on a 
side) and stacked end to end, the sugar 
cubes would extend 1.6 x 10!° miles 
(2.6 X 10'° kilometers) or to the planet 
Pluto. 


pathway and aerobic metabolism. Organisms that obtain energy from 
chemical fuels only are called heterotrophs, which ultimately depend on 
autotrophs for their fuel. 

We can think of photosynthesis as comprising two parts: the light 
reactions and the dark reactions. In the light reactions, light energy is trans- 
formed into two forms of biochemical energy with which we are already 
familiar: reducing power and ATP. The products of the light reactions 
are then used in the dark reactions to drive the reduction of CO, and its 
conversion into glucose and other sugars. The dark reactions are also called 
the Calvin cycle or light-independent reactions and will be discussed in 
Chapter 20. 


Photosynthesis converts light energy into chemical energy 


The light reactions of photosynthesis closely resemble the events of oxida- 
tive phosphorylation. In Chapters 17 and 18, we learned that cellular respi- 
ration is the oxidation of glucose to CO2 with the reduction of O to water, 
a process that generates ATP. In photosynthesis, this process must be 
reversed—reducing CO, and oxidizing H2O to synthesize glucose. 


Photosynthesis 


Energy + 6 H2O + 6 COg ———> CoH 20,4 + 6 Oo 


Cellular 
respiration 


C6H120¢% + 6 Oo 6 Oo + 6 H20 + energy 


Although the processes of respiration and photosynthesis are chemically 
opposite each other, the biochemical principles governing the two processes 
are nearly identical. The key to both processes is the generation of high- 
energy electrons. The citric acid cycle oxidizes carbon fuels to CO? to gener- 
ate high-energy electrons. The flow of these high-energy electrons down an 
electron-transport chain generates a proton-motive force. This proton- 
motive force is then transduced by ATP synthase to form ATP. To synthe- 
size glucose from CO», high-energy electrons are required for two purposes: 
(1) to provide reducing power in the form of NADPH to reduce CO, and 
(2) to generate ATP to power the reduction. How can high-energy electrons 
be generated without using a chemical fuel? Photosynthesis uses energy from 
light to boost electrons from a low-energy state to a high-energy state. In the 
high-energy, unstable state, nearby molecules can abscond with the excited 
electrons. These electrons are used to produce reducing power, and they are 
used indirectly through an electron-transport chain and a proton-motive 
force across a membrane, which subsequently drives the synthesis of ATP. 
The reactions that are powered by sunlight are called the light reactions 
(Figure 19.1). 


Figure 19.1 The light 
reactions of photosynthesis. 
Light is absorbed and the 
energy is used to drive 
electrons from water to 
generate NADPH and to drive 
protons across a membrane. 
These protons return through 
ATP synthase to make ATP. 


Ht 


Photosynthesis in green plants is mediated by two kinds of light reactions. 
Photosystem I generates reducing power in the form of NADPH but, in the 
process, becomes electron deficient. Photosystem II oxidizes water and 
transfers the electrons to replenish the electrons lost by photosystem I. A 
side product of these reactions is Oy. Electron flow from photosystem II to 
photosystem I generates the transmembrane proton gradient, augmented by 
the protons released by the oxidation of water, that drives the synthesis of 
ATP. In keeping with the similarity of their principles of operation, both 
processes take place in double-membrane organelles: mitochondria for 
cellular respiration and chloroplasts for photosynthesis. 


19.1 Photosynthesis Takes Place in Chloroplasts 


Photosynthesis, the means of converting light into chemical energy, takes 
place in organelles called chloroplasts, typically 5 jm long. Like a mitochon- 
drion, a chloroplast has an outer membrane and an inner membrane, with 
an intervening intermembrane space (Figure 19.2). The inner membrane 
surrounds a space called the stroma, which is the site of the dark reactions of 
photosynthesis (Section 20.1). In the stroma are membranous structures 
called thylakoids, which are flattened sacs, or discs. The thylakoid sacs are 
stacked to form a granum. Different grana are linked by regions of thylakoid 
membrane called stroma lamellae (Figure 19.3). The thylakoid membranes 
separate the thylakoid space from the stroma space. Thus, chloroplasts have 
three different membranes (outer, inner, and thylakoid membranes) and three 
separate spaces (intermembrane, stroma, and thylakoid spaces). In developing 
chloroplasts, thylakoids arise from budding of the inner membrane, and so 
they are analogous to the mitochondrial cristae. Like the mitochondrial 
cristae, they are the site of coupled oxidation—reduction reactions of the 
light reactions that generate the proton-motive force. 
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Figure 19.2 Diagram of a chloroplast. 


The primary events of photosynthesis take place 
in thylakoid membranes 


The thylakoid membranes contain the energy-transforming machinery: 
light-harvesting proteins, reaction centers, electron-transport chains, and 
ATP synthase. These membranes contain nearly equal amounts of lipids 
and proteins. The lipid composition is highly distinctive: about 40% of 
the total lipids are galactolipids and 4% are sulfolipids, whereas only 10% are 
phospholipids. The thylakoid membrane and the inner membrane, like 


Photosynthetic catastrophe 


If photosynthesis were to cease, all higher 
forms of life would be extinct in about 

25 years. A milder version of such a 
catastrophe ended the Cretaceous period 
65.1 million years ago when a large asteroid 
struck the Yucatan Peninsula of Mexico. 
Enough dust was sent into the atmosphere 
that photosynthetic capacity was greatly 
diminished, which apparently led to the 
disappearance of the dinosaurs and 
allowed the mammals to rise to 
prominence. 


Figure 19.3 Electron micrograph of a 
chloroplast from a spinach leaf. The 
thylakoid membranes pack together to form 
grana. [Courtesy of Dr. Kenneth Miller. 
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Figure 19.4 Cyanobacteria. A colony of the 
photosynthetic filamentous cyanobacterium 
Anabaena shown at 450 magnification. 
Ancestors of these bacteria are thought to 
have evolved into present-day chloroplasts. 
James W. Richardson/Visuals Unlimited.] 


the inner mitochondrial membrane, are impermeable to most molecules and 
ions. The outer membrane of a chloroplast, like that of a mitochondrion, is 
highly permeable to small molecules and ions. The stroma contains the sol- 
uble enzymes that utilize the NADPH and ATP synthesized by the thy- 
lakoids to convert CO, into sugar. Plant-leaf cells contain between 1 and 
100 chloroplasts, depending on the species, cell type, and growth conditions. 


Chloroplasts arose from an endosymbiotic event 


a Chloroplasts contain their own DNA and the machinery for repli- 

cating and expressing it. However, chloroplasts are not autonomous: 
they also contain many proteins encoded by nuclear DNA. How did the 
intriguing relation between the cell and its chloroplasts develop? We now 
believe that, in a manner analogous to the evolution of mitochondria 
(Section 18.1), chloroplasts are the result of endosymbiotic events in which 
a photosynthetic microorganism, most likely an ancestor of a cyanobacte- 
rium (Figure 19.4), was engulfed by a eukaryotic host. Evidence suggests 
that chloroplasts in higher plants and green algae are derived from a single 
endosymbiotic event, whereas those in red and brown algae are derived 
from at least one additional event. 

The chloroplast genome is smaller than that of a cyanobacterium, but 
the two genomes have key features in common. Both are circular and havea 
single start site for DNA replication, and the genes of both are arranged in 
operons—sequences of functionally related genes under common control 
(Chapter 31). In the course of evolution, many of the genes of the chloro- 
plast ancestor were transferred to the plant cell’s nucleus or, in some cases, 
lost entirely, thus establishing a fully dependent relation. 


19.2 Light Absorption by Chlorophyll 
Induces Electron Transfer 


The trapping of light energy is the key to photosynthesis. The first event is 
the absorption of light by a photoreceptor molecule. The principal photore- 
ceptor in the chloroplasts of most green plants is the pigment molecule 
chlorophyll a, a substituted tetrapyrrole (Figure 19.5). The four nitrogen 


Figure 19.5 Chlorophyll. Like 
heme, chlorophyll a is a cyclic 
tetrapyrrole. One of the pyrrole 
rings (shown in red) is reduced, 
and an additional five-carbon ring 
(shown in blue) is fused to 
another pyrrole ring. A phytol 
chain (shown in green) is 
connected by an ester linkage. 
Magnesium ion binds at the 
center of the structure. 


atoms of the pyrroles are coordinated to a magnesium ion. Unlike a porphy- 
rin such as heme, chlorophyll has a reduced pyrrole ring and an additional 
5-carbon ring fused to one of the pyrrole rings. Another distinctive feature 
of chlorophyll is the presence of phytol, a highly hydrophobic 20-carbon 
alcohol, esterified to an acid side chain. 

Chlorophylls are very effective photoreceptors because they contain 
networks of conjugated double bonds—alternating single and double 
bonds. Such compounds are called conjugated polyenes. In polyenes, the 
electrons are not localized to a particular atomic nucleus and consequently 
can more readily absorb light energy. Chlorophylls have very strong absorp- 
tion bands in the visible region of the spectrum, where the solar output 
reaching Earth is maximal (Figure 19.6). Chlorophyll a’s peak molar 
extinction coefficient (€), a measure of acompound’s ability to absorb light, 
is higher than 10° M~! cm 1, among the highest observed for organic 
compounds. 

What happens when light is absorbed by a pigment molecule such as 
chlorophyll? The energy from the light excites an electron from its ground 
energy level to an excited energy level (Figure 19.7). This high-energy 
electron can have one of two fates. For most compounds that absorb light, 
the electron simply returns to the ground state and the absorbed energy is 
converted into heat. However, if a suitable electron acceptor is nearby, the 
excited electron can move from the initial molecule to the acceptor 
(Figure 19.8). A positive charge forms on the initial molecule, owing to the 
loss of an electron, and a negative charge forms on the acceptor, owing to the 
gain of an electron. Hence, this process is referred to as photoinduced charge 
separation. 

In chloroplasts, the sites at which the charge separation takes place 
within each photosystem is called the reaction center. The photosynthetic 
apparatus is arranged to maximize photoinduced charge separation and 
minimize an unproductive return of the electron to its ground state. The 
electron, extracted from its initial site by the absorption of light, now has 
reducing power: it can reduce other molecules to store the energy originally 
obtained from light in chemical forms. 


Electron 
—_— transfer — 


Energy —> 


Excited 
molecule Acceptor Dt A 
(D) (A) 


Figure 19.8 Photoinduced charge separation. If a suitable electron acceptor is nearby, an 
electron that has been moved to a high energy level by light absorption can move from the excited 
molecule to the acceptor. 


A special pair of chlorophylls initiate charge separation 


Photosynthetic bacteria such as Rhodopseudomonas viridis contain a photo- 
synthetic reaction center that has been revealed at atomic resolution. The 
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Figure 19.6 Light absorption by 
chlorophyll a. Chlorophyll a absorbs visible 
light efficiently as judged by the extinction 
coefficient near 10° M7! cm7!. 


Light 


Energy —> 


Ground state Excited state 


Figure 19.7 Light absorption. The 
absorption of light leads to the excitation of an 
electron from its ground state to a higher 
energy level. 


569 


570 


CHAPTER 19 The Light Reactions 
of Photosynthesis 


Bacteriopheophytin 


Special pair 


hk Figure 19.9 Bacterial photosynthetic reaction center. The core of the reaction center 
from Rhodopseudomonas viridis consists of two similar chains: L (red) and M (blue). An H chain 
(white) and a cytochrome subunit (yellow) complete the structure. Notice that the L and M 
subunits are composed largely of « helices that span the membrane. Also notice that a chain 
of electron-carrying prosthetic groups, beginning with a special pair of bacteriochlorophylls and 
ending at a bound quinone, runs through the structure from bottom to top in this view. [Drawn 
from 1PRC.pdb.] 


bacterial reaction center consists of four polypeptides: L (31 kd), M (36 kd), 
and H (28 kd) subunits and C, a c-type cytochrome with four c-type hemes 
(Figure 19.9). Sequence comparisons and low-resolution structural studies have 
. revealed that the bacterial reaction center is homologous to the more complex 
Bacteriochlorophyll b ; : ; 
(BChI-b) plant systems. Thus, many of our observations of the bacterial system will 
apply to plant systems as well. 

The L and M subunits form the structural and functional core of the 
bacterial photosynthetic reaction center (see Figure 19.9). Each of these 
homologous subunits contains five transmembrane helices, in contrast with 
the H subunit, which has just one. The H subunit lies on the cytoplasmic 
side of the cell membrane, and the cytochrome subunit lies on the exterior 
face of the cell membrane, called the periplasmic side because it faces the 
periplasm, the space between the cell membrane and the cell wall. Four 
bacteriochlorophyll b (BChI-b) molecules, two bacteriopheophytin b (BPh) 
molecules, two quinones (Q a and Q p), anda ferrous ion are associated with 
the L and M subunits. 

Bacteriochlorophylls are photoreceptors similar to chlorophylls, except 
for the reduction of an additional pyrrole ring and other minor differences 
that shift their absorption maxima to the near infrared, to wavelengths as 
long as 1000 nm. Bacteriopheophytin is the term for a bacteriochlorophy]l 
that has two protons instead of a magnesium ion at its center. 

The reaction begins with light absorption by a pair of BChl-b molecules 


OCH; 


Bacteriopheophytin : , aes : : 
(BPh) that lie near the periplasmic side of the membrane in the L-M dimer. The 
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pair of BChlI-b molecules is called the special pair because of its fundamental 
role in photosynthesis. The special pair absorbs light maximally at 960 nm, 
and, for this reason, is often called P960 (P stands for pigment). After 
absorbing light, the excited special pair ejects an electron, which is trans- 
ferred through another BChl-b to a bacteriopheophytin (Figure 19.10, 
steps 1 and 2). This initial charge separation yields a positive charge on the 
special pair (P960*) and a negative charge on BPh. The electron ejection 
and transfer take place in less than 10 picoseconds (107 |" s). 

A nearby electron acceptor, a tightly bound quinone (Qa), quickly 
grabs the electron away from BPh before the electron has a chance to fall 
back to the P960 special pair. From Qa, the electron moves to a more 
loosely associated quinone, Q g. The absorption of a second photon and the 
movement of a second electron from the special pair through the bacterio- 
pheophytin to the quinones completes the two-electron reduction of Qp 
from Q to QH>. Because the Q p-binding site lies near the cytoplasmic side 
of the membrane, two protons are taken up from the cytoplasm, contributing 
to the development of a proton gradient across the cell membrane (Figure 
19.10, steps 5, 6, and 7). 

In their high-energy states, P960* and BPh could undergo charge 
recombination; that is, the electron on BPh could move back to neutralize 
the positive charge on the special pair. Its return to the special pair would 
waste a valuable high-energy electron and simply convert the absorbed light 
energy into heat. How is charge recombination prevented? Two factors in 
the structure of the reaction center work together to suppress charge recom- 
bination nearly completely (Figure 19.10, steps 3 and 4). First, the next 
electron acceptor (Q a) is less than 10 A away from BPh_, and so the elec- 
tron is rapidly transferred farther away from the special pair. Second, one of 
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Figure 19.10 Electron chain in the 
photosynthetic bacterial reaction center. 
The absorption of light by the special pair 
(P960) results in the rapid transfer of an 
electron from this site to a bacteriopheophytin 
(BPh), creating a photoinduced charge 
separation (steps 1 and 2). (The asterisk on 
P960 stands for excited state.) The possible 
return of the electron from the pheophytin to 
the oxidized special pair is suppressed by the 
“hole” in the special pair being refilled with an 
electron from the cytochrome subunit and the 
electron from the pheophytin being 
transferred to a quinone (Qa) that is farther 
away from the special pair (steps 3 and 4). 
Qa passes the electron to Qg. The reduction 
of a quinone (Qg) on the cytoplasmic side of 
the membrane results in the uptake of two 
protons from the cytoplasm (steps 5 and 6). 
The reduced quinone can move into the 
quinone pool in the membrane (step 7). 
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Figure 19.11 Cyclic electron flow in the 
bacterial reaction center. Excited electrons 
from the P960 reaction center flow through 
bacteriopheophytin (BPh), a pair of quinone 
molecules (Qa and Qg), cytochrome bc, 
complex, and finally through cytochrome c2 to 
the reaction center. The cytochrome bc, 
complex pumps protons as a result of electron 
flow, which powers the formation of ATP. 
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Figure 19.12 Two photosystems. The absorption of photons by two 
distinct photosystems (PS | and PS Il) is required for complete electron 


flow from water to NADP*. 


the hemes of the cytochrome subunit is less than 10 A away from the special 
pair, and so the positive charge on P960 is neutralized by the transfer of an 
electron from the reduced cytochrome. 


Cyclic electron flow reduces the cytochrome of the reaction center 


The cytochrome subunit of the reaction center must regain an electron to 
complete the cycle. It does so by taking back two electrons from reduced 
quinone (QH;,). QH)> first enters the Q pool in the membrane where it is 
reoxidized to Q by complex bc;, which is homologous to complex III of the 
respiratory electron-transport chain. Complex bc; transfers the electrons 
from QH) to cytochrome c2, a water-soluble protein in the periplasm, and 
in the process pumps protons into the periplasmic space. The electrons now 
on cytochrome c? flow to the cytochrome subunit of the reaction center. The 
flow of electrons is thus cyclic (Figure 19.11). The proton gradient gener- 
ated in the course of this cycle drives the generation of ATP through the 
action of ATP synthase. 


19.3. Two Photosystems Generate a Proton Gradient and 
NADPH in Oxygenic Photosynthesis 


Photosynthesis is more complicated in green plants than in photosynthetic 
bacteria. In green plants, photosynthesis depends on the interplay of two 
kinds of membrane-bound, light-sensitive complexes—photosystem I (PS I) 
and photosystem IT (PS II), as shown in Figure 19.12. There are similarities 
in photosynthesis between green plants and photosynthetic bacteria. Both 
require light to energize reaction centers consisting of special pairs, called 
P680 for photosystem I and P700 for photosystem II, and both transfer 
electrons by using electron-transport chains. However, in plants, electron 
flow is not cyclic but progresses from photosystem IT to 
photosystem I under most circumstances. 

Photosystem I, which responds to light with wave- 
lengths shorter than 700 nm, uses light-derived high- 
energy electrons to create biosynthetic reducing power in 
the form of NADPH, a versatile reagent for driving bio- 
synthetic processes. The electrons for creating one mol- 
ecule of NADPH are taken from two molecules of water 
by photosystem II, which responds to wavelengths 
shorter than 680 nm. A molecule of O2 is generated 
as a side product of the actions of photosystem II. The elec- 
trons travel from photosystem II to photosystem I 
through cytochrome bf, a membrane-bound complex 
homologous to Complex III in oxidative phosphoryla- 
tion. Cytochrome bf generates a proton gradient across 
the thylakoid membrane that drives the formation of 
ATP. Thus, the two photosystems cooperate to produce 
NADPH and ATP. 


Photosystem II transfers electrons from water to plastoquinone 
and generates a proton gradient 


Photosystem II, an enormous transmembrane assembly of more than 
20 subunits, catalyzes the light-driven transfer of electrons from water to 
plastoquinone. This electron acceptor closely resembles ubiquinone, a com- 
ponent of the mitochondrial electron-transport chain. Plastoquinone cycles 


between an oxidized form (Q_) anda reduced form (QH)y, plastoquinol). The 
overall reaction catalyzed by photosystem II is 


Ligh 
20+ 2H,0—— 0, + 20H, 


The electrons in QH) are at a higher redox potential than those in H2O. 
Recall that, in oxidative phosphorylation, electrons flow from ubiquinol to 
an acceptor, Oy, which is at a lower potential. Photosystem II drives the 
reaction in a thermodynamically uphill direction by using the free energy 
of light. 

This reaction is similar to one catalyzed by the bacterial system in that a 
quinone is converted from its oxidized into its reduced form. Photosystem II 
is reasonably similar to the bacterial reaction center (Figure 19.13). The core 
of the photosystem is formed by D1 and D2, a pair of similar 32-kd subunits 
that span the thylakoid membrane. These subunits are homologous to the 
Land M chains of the bacterial reaction center. Unlike the bacterial system, 
photosystem II contains a large number of additional subunits that bind 
more than 30 chlorophyll molecules altogether and increase the efficiency 
with which light energy is absorbed and transferred to the reaction center 
(Section 19.5). 
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The photochemistry of photosystem II begins with 
excitation of a special pair of chlorophyll molecules that 
are bound by the D1 and D2 subunits (Figure 19.14). 
Because the chlorophyll a molecules of the special pair 
absorb light at 680 nm, the special pair is often called 
P680. On excitation, P680 rapidly transfers an electron to 
a nearby pheophytin. From there, the electron is trans- 
ferred first to a tightly bound plastoquinone at site Q 4 and 
then to a mobile plastoquinone at site Qp. This electron 
flow is entirely analogous to that in the bacterial system. 
With the arrival of a second electron and the uptake of two 
protons, the mobile plastoquinone is reduced to QH>. At 
this point, the energy of two photons has been safely and 
efficiently stored in the reducing potential of QH)». 

The major difference between the bacterial system and 
photosystem II is the source of the electrons that are used 
to neutralize the positive charge formed on the special 
pair. P680*, a very strong oxidant, extracts electrons from 
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Figure 19.13 The structure of 
photosystem II. The D1 and D2 subunits 
are shown in red and blue, respectively, and 
the numerous bound chlorophyll molecules 
are shown in green. Notice that the special 
pair and the manganese center (the site of 
oxygen evolution) lie toward the thylakoid- 
lumen side of the membrane. [Drawn from 
1S5L.pdb.] 
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Figure 19.14 Electron flow through photosystem II. Light 
absorption induces electron transfer from P680 down an electron- 
transfer pathway to an exchangeable plastoquinone. The positive 
charge on P680 is neutralized by electron flow from water molecules 
bound at the manganese center. 
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Evolution of oxygen is evident by the 
generation of bubbles in the aquatic plant 
Elodea. [Colin Milkins/Oxford Scientific Films/ 
Photolibrary.] 


Figure 19.16 A plausible 
scheme for oxygen evolution 
from the manganese center. 
The deduced core structure of 
the manganese center including 
four manganese ions and one 
calcium ion is shown, although 
many additional ligands are 
omitted for clarity. The center is 
oxidized, one electron at a time, 
until two bound H2O0 molecules 
are linked to form a molecule of 
Op», which is then released from 
the center. A tyrosine residue 
(not shown) participates in the 
coupled proton—electron transfer 
steps. The structures are 
designated Sp to Sy to indicate 
the number of electrons that 
have been removed. 


Figure 19.15 Four photons are required 
to generate one oxygen molecule. When 
dark-adapted chloroplasts are exposed to a 
brief flash of light, one electron passes 
through photosystem II. Monitoring the O2 
released after each flash reveals that four 
flashes are required to generate each Oz 
molecule. The peaks in Op release are after 
the 3rd, 7th, and 11th flashes because the L | | | | | | ft tf 
dark-adapted chloroplasts start in the S; 123 4 5 6 7 8 9 10 11 12 
state—that is, the one-electron reduced state. Flash number 


O, evolved per flash 


water molecules bound at the manganese center. The structure of this center 
includes a calcium ion and four manganese ions. Manganese was apparently 
evolutionarily selected for this role because of its ability to exist in multiple 
oxidation states (Mn? rs Mn?" , Mn** ; Mn?" ) and to form strong bonds 
with oxygen-containing species. The manganese center, in its reduced form, 
oxidizes two molecules of water to form a single molecule of oxygen. Each 
time the absorbance of a photon kicks an electron out of P680, the posi- 
tively charged special pair extracts an electron from the manganese center 
(Figure 19.15). However, the electrons do not come directly from the man- 
ganese ions. A tyrosine residue (often designated Z) of subunit D1 in pho- 
tosystem II is the immediate electron donor, forming a tyrosine radical. The 
tyrosine radical removes electrons from the manganese ions, which in turn 
remove electrons from HO to generate O7 and H™. Four photons must be 
absorbed to extract four electrons from a water molecule (Figure 19.16). 
The four electrons harvested from water are used to reduce two molecules 
of Q to QH». 

Photosystem II spans the thylakoid membrane such that the site of 
quinone reduction is on the side of the stroma, whereas the manganese 
center, hence the site of water oxidation, lies in the thylakoid lumen. Thus, 
the two protons that are taken up with the reduction of Q to QH2 come from 
the stroma, and the four protons that are liberated in the course of water 
oxidation are released into the lumen. This distribution of protons gener- 


ates a proton gradient across the thylakoid membrane characterized by 
an excess of protons in the thylakoid lumen compared with the stroma 
(Figure 19.17). 


Cytochrome bf links photosystem II to photosystem | 


Electrons flow from photosystem II to photosystem I through the cyto- 
chrome bf complex. This complex catalyzes the transfer of electrons from 
plastoquinol (QH;) to plastocyanin (Pc), a small, soluble copper protein in 
the thylakoid lumen. 


QH) + 2 Pc(Cu2*) = Q ba 2Pc(Cu*) at: 2 Ui siatald idicn 


The two protons from plastoquinol are released into the thylakoid lumen. 
This reaction is reminiscent of that catalyzed by Complex III in oxidative 
phosphorylation, and most components of the cytochrome bf complex are 
homologous to those of Complex III. The cytochrome bf complex includes 
four subunits: a 23-kd cytochrome with two b-type hemes, a 20-kd Rieske- 
type Fe-5S protein, a 33-kd cytochrome f with a c-type cytochrome, and a 
17-kd chain. 

This complex catalyzes the reaction by proceeding through the Q cycle 
(see Figure 18.12). In the first half of the Q cycle, plastoquinol (QHz) is 
oxidized to plastoquinone (Q), one electron at a time. The electrons from 
plastoquinol flow through the Fe-S protein to convert oxidized plastocyanin 
(Pc) into its reduced form. 

In the second half of the Q cycle, cytochrome bf reduces a molecule of 
plastoquinone from the Q pool to plastoquinol, taking up two protons from 
one side of the membrane, and then reoxidizes plastoquinol to release these 
protons on the other side. The enzyme is oriented so that protons are 
released into the thylakoid lumen and taken up from the stroma, contri- 
buting further to the proton gradient across the thylakoid membrane 
(Figure 19.18). 


Photosystem I uses light energy to generate reduced 
ferredoxin, a powerful reductant 


The final stage of the light reactions is catalyzed by photosystem I, a trans- 
membrane complex consisting of about 14 polypeptide chains and multiple 
associated proteins and cofactors (Figure 19.19). The core of this system is 
a pair of similar subunits, psaA (83 kd) and psaB (82 kd). These subunits are 
quite a bit larger than the core subunits of photosystem II and the bacterial 
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Figure 19.17 Proton-gradient direction. 
Photosystem II releases protons into the 
thylakoid lumen and takes them up from the 
stroma. The result is a pH gradient across the 
thylakoid membrane with an excess of 
protons (low pH) inside. 


Figure 19.18 Cytochrome bf contribution 
to proton gradient. The cytochrome bf 
complex oxidizes QH> to Q through the Q 
cycle. Four protons are released into the 
thylakoid lumen in each cycle. 


hk Figure 19.19 The structure of 
photosystem I. The psaA and psaB subunits 
are shown in red and blue, respectively. 
Notice the numerous bound chlorophyll 
molecules, shown in green, including the 
special pair, as well as the iron-sulfur clusters 
that facilitate electron transfer from the 
stroma. [Drawn from 1JBO.pdb.] 
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Figure 19.20 Electron flow through 
photosystem | to ferredoxin. Light 
absorption induces electron transfer from 
P700 down an electron-transfer pathway that 
includes a chlorophyll molecule, a quinone 
molecule, and three 4Fe-4S clusters to reach 
ferredoxin. The positive charge left on P70O is 
neutralized by electron transfer from reduced 
plastocyanin. 
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reaction center. Nonetheless, they appear to be homologous; the terminal 
40% of each subunit is similar to a corresponding subunit of photosystem II. 
A special pair of chlorophyll a molecules lie at the center of the structure and 
absorb light maximally at 700 nm. This center, called P700, initiates photo- 
induced charge separation (Figure 19.20). The electron travels from P700 
down a pathway through chlorophyll at site Ag and quinone at site A; toa 
set of 4Fe-45 clusters. The next step is the transfer of the electron to ferre- 
doxin (Fd), a soluble protein containing a 2Fe-25 cluster coordinated to four 
cysteine residues (Figure 19.21). Ferredoxin transfers electrons to NADP”. 
Meanwhile, P700* captures an electron from reduced plastocyanin to 
return to P700 so that P700 can be excited again. Thus, the overall reaction 
catalyzed by photosystem I is a simple one-electron oxidation—reduction 
reaction. 
ie Light 54: 
Pc(Cu") + Fd, —— Pe(Cu*") + Fdreg 

Given that the reduction potentials for plastocyanin and ferredoxin are 
+0.37 V and—0.45 V, respectively, the standard free energy for this reaction 
is +79.1 kJ mol~!(+18.9 kcal mol!). This uphill reaction is driven by the 
absorption of a 700-nm photon, which has an energy of 171 kJ mol ! 
(40.9 kcal mol‘). 


Cys 


hk Figure 19.21 Structure of ferredoxin. 
In plants, ferredoxin contains a 2Fe-2S cluster. 
This protein accepts electrons from 
photosystem | and carries them to ferredoxin— 
NADP reductase. [Drawn from 1FXA.pdb_] 


Ferredoxin-NADPt reductase converts NADPt into NADPH 


Although reduced ferredoxin is a strong reductant, it is not useful for 
driving many reactions, in part because ferredoxin carries only one available 
electron. In contrast, NADPH, a two-electron reductant, is a widely used 
electron donor in biosynthetic processes, including the reactions of the 
Calvin cycle (Chapter 20). How is reduced ferredoxin used to drive 
the reduction of NADP* to NADPH? This reaction is catalyzed by 
ferredoxin-NADP* reductase, a flavoprotein with an FAD prosthetic group 
(Figure 19.22A). The bound FAD moiety accepts two electrons and 
two protons from two molecules of reduced ferredoxin to form FADH) 
(Figure 19.22B). The enzyme then transfers a hydride ion (H” ) to NADP* to 
form NADPH. This reaction takes place on the stromal side of the 
membrane. Hence, the uptake of a proton in the reduction of NADP~ 
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h Figure 19.22 Structure and function of ferredoxin-NADP* reductase. (A) Structure of 
ferredoxin-NADP* reductase. This enzyme accepts electrons, one at a time, from ferredoxin (shown 
in orange). (B) Ferredoxin-NADP* reductase first accepts two electrons and two protons from two 
molecules of reduced ferredoxin (Fd) to form FADH>, which then transfers two electrons and a 
proton to NADP* to form NADPH. [Drawn from 1EWY.pdb_] 


further contributes to the generation of the proton gradient across the thy- 
lakoid membrane. 

The cooperation between photosystem I and photosystem II creates a 
flow of electrons from HO to NADP”. The pathway of electron flow is 
called the Z scheme of photosynthesis because the redox diagram from P680 
to P700* looks like the letter Z (Figure 19.23). 


19.4 A Proton Gradient Across the Thylakoid 
Membrane Drives ATP Synthesis 


In 1966, André Jagendorf showed that chloroplasts synthesize ATP in the 
dark when an artificial pH gradient is imposed across the thylakoid mem- 
brane. To create this transient pH gradient, he soaked chloroplasts in a pH 
4 buffer for several hours and then rapidly mixed them with a pH 8 buffer 
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2 | da Photon 2 | Figure 19.23 Pathway of electron flow 
= | I NADPH from HO to NADP* in photosynthesis. 
cS Qe This endergonic reaction is made possible by 
Q OF Photon 1 the absorption of light by photosystem II 
3 - (P680) and photosystem | (P700). 
& \ Abbreviations: Ph, pheophytin; Qa and Qz, 
P700 plastoquinone-binding proteins; Pc, 
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Figure 19.24 Jagendorf’s demonstration. 
Chloroplasts synthesize ATP after the 
imposition of a pH gradient. 
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containing ADP and P;. The pH of the stroma suddenly increased to 
8, whereas the pH of the thylakoid space remained at 4. A burst of ATP 
synthesis then accompanied the disappearance of the pH gradient across the 
thylakoid membrane (Figure 19.24). This incisive experiment was one of the 
first to unequivocally support the hypothesis put forth by Peter Mitchell 
that ATP synthesis is driven by proton-motive force. 

The principles of ATP synthesis in chloroplasts are nearly identical with 
those in mitochondria. ATP formation is driven by a proton-motive force in 
both photophosphorylation and oxidative phosphorylation. We have seen how 
light induces electron transfer through photosystems II and I and the cyto- 
chrome bf complex. At various stages in this process, protons are released 
into the thylakoid lumen or taken up from the stroma, generating a proton 
gradient. The gradient is maintained because the thylakoid membrane is 
essentially impermeable to protons. The thylakoid space becomes markedly 
acidic, with the pH approaching 4. The light-induced transmembrane proton 
gradient is about 3.5 pH units. As discussed in Section 18.4, energy inherent 
in the proton gradient, called the proton-motive force (Ap), is described as 
the sum of two components: a charge gradient and a chemical gradient. In 
chloroplasts, nearly all of Ap arises from the pH gradient, whereas, in 
mitochondria, the contribution from the membrane potential is larger. The 
reason for this difference is that the thylakoid membrane is quite permeable 
toCl” and Mg”. The light-induced transfer of H™ into the thylakoid space 
is accompanied by the transfer of either Cl” in the same direction or Mg** 
(1 Mg?* per 2 H*) in the opposite direction. Consequently, electrical neu- 
trality is maintained and no membrane potential is generated. The influx of 
Mg?" into the stroma plays a role in the regulation of the Calvin Cycle 
(Section 20.2). A pH gradient of 3.5 units across the thylakoid membrane 
corresponds to a proton-motive force of 0.20 V or a AG of —20.0 kJ mol! 
(—4.8 kcal mol‘). 


The ATP synthase of chloroplasts closely resembles those 
of mitochondria and prokaryotes 


The proton-motive force generated by the light reactions is converted into 
ATP by the ATP synthase of chloroplasts, also called the CF;—CF ) complex 
(C stands for chloroplast and F for factor). CF;-CF9 ATP synthase closely 
resembles the F,—F complex of mitochondria (Section 18.4). CF conducts 
protons across the thylakoid membrane, whereas CF, catalyzes the forma- 
tion of ATP from ADP and P;. 

CF is embedded in the thylakoid membrane. It consists of four differ- 
ent polypeptide chains known as I (17 kd), II (16.5 kd), III (8 kd), and IV 
(27 kd) having an estimated stoichiometry of 1:2:12:1. Subunits I and II 
have sequence similarity to subunit b of the mitochondrial Fo subunit, II] 
corresponds to subunit c of the mitochondrial complex, and subunit IV is 
similar in sequence to subunit a. CF, the site of ATP synthesis, has a 
subunit composition a3B3y6,. The B subunits contain the catalytic sites, 
similarly to the F; subunit of mitochondrial ATP synthase. Remarkably, 
the B subunits of ATP synthase in corn chloroplasts are more than 60% 
identical in amino acid sequence with those of human ATP synthase, 
despite the passage of approximately 1 billion years since the separation of 
the plant and animal kingdoms. 

Note that the membrane orientation of CF;—CF is reversed compared 
with that of the mitochondrial ATP synthase (Figure 19.25). However, the 
functional orientation of the two synthases is identical: protons flow from 
the lumen through the enzyme to the stroma or matrix where ATP is 
synthesized. Because CF, is on the stromal surface of the thylakoid 
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Figure 19.25 Comparison of photosynthesis and oxidative phosphorylation. The light- 
induced electron transfer in photosynthesis drives protons into the thylakoid lumen. The excess 
protons flow out of the lumen through ATP synthase to generate ATP in the stroma. In oxidative 
phosphorylation, electron flow down the electron-transport chain pumps protons out of the 
mitochondrial matrix. Excess protons from the intermembrane space flow into the matrix through 
ATP synthase to generate ATP in the matrix. 


membrane, the newly synthesized ATP is released directly into the stromal 
space. Likewise, NADPH formed by photosystem I is released into the 
stromal space. Thus, ATP and NADPH, the products of the light reactions of 
photosynthesis, are appropriately positioned for the subsequent dark reactions, 
in which CO, is converted into carbohydrate. 


Cyclic electron flow through photosystem | leads to the 
production of ATP instead of NADPH 


On occasion, when the ratio of NADPH to NADP” is very high as might 
be the case if there was another source of electrons to form NADPH 
(Section 20.3), NADP* may be unavailable to accept electrons from 
reduced ferredoxin. In this case, electrons arising from P700, the reaction 
center of photosystem I, may take an alternative pathway that does not 
end at NADPH. The electron in reduced ferredoxin is transferred to the 
cytochrome bf complex rather than to NADP. This electron then flows 
back through the cytochrome bf complex to reduce plastocyanin, which can 
then be reoxidized by P700* to complete a cycle. The net outcome of this 
cyclic flow of electrons is the pumping of protons by the cytochrome bf 
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Figure 19.26 Cyclic photophosphorylation. (A) In this pathway, electrons from reduced 
ferredoxin are transferred to cytochrome bf rather than to ferredoxin-NADP* reductase. The flow of 
electrons through cytochrome bf pumps protons into the thylakoid lumen. These protons flow 
through ATP synthase to generate ATP. Neither NADPH nor Op is generated by this pathway. (B) A 
scheme showing the energetic basis for cyclic photophosphorylation. Abbreviations: Fd, ferredoxin; 
Pc, plastocyanin. 


complex. The resulting proton gradient then drives the synthesis of ATP. 
In this process, called cyclic photophosphorylation, ATP is generated without 
the concomitant formation of NADPH (Figure 19.26). Photosystem IT does 
not participate in cyclic photophosphorylation, and so O2 is not formed 
from HO. 


The absorption of eight photons yields one O2, two NADPH, 
and three ATP molecules 


We can now estimate the overall stoichiometry for the light reactions. The 
absorption of four photons by photosystem II generates one molecule of O2 
and releases 4 protons into the thylakoid lumen. The two molecules of plas- 
toquinol are oxidized by the Q cycle of the cytochrome bf complex to release 
8 protons into the lumen. Finally, the electrons from four molecules of 
reduced plastocyanin are driven to ferredoxin by the absorption of four 
additional photons. The four molecules of reduced ferredoxin generate two 
molecules of NADPH. Thus, the overall reaction is 


2H,O + 2NADP* + 10 Hitroma > O2 + 2 NADPH + 12 Hitmen 


The 12 protons released in the lumen can then flow through ATP synthase. 
Given that there are apparently 12 subunit III] components in CFo, we 
expect that 12 protons must pass through CF to complete one full rotation 
of CF;, A single rotation generates three molecules of ATP. Given the ratio 
of 3 ATP for 12 protons, the overall reaction is 


2H,O + 2NADP* + 10Hgtoma ——> Oo + 2NADPH + 12 Hibmen 
3 ADP?” +3P?” +3H* + 12Hyimen —> 3ATP*” + 3H2O + 12 Hitioma 
2NADP* + 3ADP*~ + 3P?> +H* —> 0, + 2NADPH + 3ATP*~ + H,0 


Thus, eight photons are required to yield three molecules of ATP (2.7 pho- 
tons/ATP). 


Cyclic photophosphorylation is a somewhat more productive way to 
synthesize ATP. The absorption of four photons by photosystem I leads to 
the release of 8 protons into the lumen by the cytochrome bf system. These 
protons flow through ATP synthase to yield two molecules of ATP. Thus, 
each two absorbed photons yield one molecule of ATP. No NADPH is 
produced. 


19.5 Accessory Pigments Funnel Energy into 
Reaction Centers 


A light-harvesting system that relied only on the chlorophyll a molecules 
of the special pair would be rather inefficient for two reasons. First, chloro- 
phyll a molecules absorb light only at specific wavelengths (see Figure 19.6). 
A large gap is present in the middle of the visible region between approxi- 
mately 450 and 650 nm. This gap falls right at the peak of the solar spec- 
trum, and so failure to collect this light would constitute a considerable lost 
opportunity. Second, even on a cloudless day, many photons that can be 
absorbed by chlorophyll a pass through the chloroplast without being 
absorbed, because the density of chlorophyll a molecules in a reaction center 
is not very great. Accessory pigments, both additional chlorophylls and 
other classes of molecules, are closely associated with reaction centers. These 
pigments absorb light and funnel the energy to the reaction center for conversion 
into chemical forms. Accessory pigments prevent the reaction center from 
sitting idle. 


Resonance energy transfer allows energy to move from the site 
of initial absorbance to the reaction center 


How is energy funneled from an associated pigment to a reaction center? 
The absorption of a photon does not always lead to electron excitation and 
transfer. More commonly, excitation energy is transferred from one mole- 
cule to a nearby molecule through electromagnetic interactions through 
space (Figure 19.27). The rate of this process, called resonance energy 
transfer, depends strongly on the distance between the energy-donor and 
the energy-acceptor molecules; an increase in the distance between the 
donor and the acceptor by a factor of two typically results in a decrease in 
the energy-transfer rate by a factor of 2° = 64. For reasons of conservation 
of energy, energy transfer must be from a donor in the excited state to 
an acceptor of equal or lower energy. The excited state of the special pair of 


Figure 19.27 Resonance energy transfer. (1) An electron can accept energy from electron 
magnetic radiation of appropriate wavelength and jump to a higher energy state. (2) When the 
excited electron falls back to its lower energy state, the absorbed energy is released. (3) The 
released energy can be absorbed by an electron in a nearby molecule, and this electron jumps to a 
high energy state. 
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Figure 19.29 Absorption spectra of 
chlorophylls a and b. 
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Figure 19.28 Energy transfer from 
accessory pigments to reaction 
centers. Light energy absorbed by 
accessory chlorophyll molecules or other 
pigments can be transferred to reaction 
centers, where it drives photoinduced 
charge separation. The green squares 
represent accessory chlorophyll 
molecules and the red squares 
represent carotenoid molecules; the 
white squares designate protein. 
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chlorophyll molecules is lower in energy than that of single chlorophyll mole- 
cules, allowing reaction centers to trap the energy transferred from other mol- 
ecules (Figure 19.28). 


Light-harvesting complexes contain additional 
chlorophylls and carotinoids 


Chlorophyll b and carotenoids are important light-harvesting molecules that 
funnel energy to the reaction center. Chlorophyll b differs from chlorophyll 
ain having a formyl group in place of a methyl group. This small difference 
shifts its two major absorption peaks toward the center of the visible region. 
In particular, chlorophyll b efficiently absorbs light with wavelengths 
between 450 and 500 nm (Figure 19.29). 

Carotenoids are extended polyenes that absorb light between 400 and 
500 nm. The carotenoids are responsible for most of the yellow and red 
colors of fruits and flowers, and they provide the brilliance of fall, when the 
chlorophyll molecules degrade, revealing the carotenoids. 


SS ERT ONTO ONROERROERR S y OR OSSS S 


Lycopene 


SOS OSS OSL OS OS NS 


B-Carotene 


In addition to their role in transferring energy to reaction centers, the 
carotenoids serve a safeguarding function. Carotenoids suppress damaging 
photochemical reactions, particularly those including oxygen that can be 
induced by bright sunlight. This protection may be especially important in 
the fall when the primary pigment chlorophyll is being degraded and thus 
not able to absorb light energy. Plants lacking carotenoids are quickly killed 
on exposure to light and oxygen. 


The accessory pigments are arranged in numerous light- 
harvesting complexes that completely surround the reaction 
center. The 26-kd subunit of light-harvesting complex II 
(LHC-II) is the most abundant membrane protein in chlo- 
roplasts. This subunit binds seven chlorophyll a molecules, 
six chlorophyll b molecules, and two carotenoid molecules. 
Similar light-harvesting assemblies exist in photosynthetic 
bacteria (Figure 19.30). 


-) 


er 


The components of photosynthesis are highly organized 


The complexity of photosynthesis, seen already in the 
elaborate interplay of complex components, extends even to 
the placement of the components in the thylakoid mem- 
branes. Thylakoid membranes of most plants are differentiated 
into stacked (appressed) and unstacked (nonappressed) 
regions (see Figures 19.2 and 19.3). Stacking increases the 
amount of thylakoid membrane in a given chloroplast vol- 
ume. Both regions surround a common internal thylakoid 
space, but only unstacked regions make direct contact with 
the chloroplast stroma. Stacked and unstacked regions differ h Figure 19.30 Structure of a bacterial light-harvesting 
in the nature of their photosynthetic assemblies (Figure 19.31). complex. Eight polypeptides, each of which binds three chlorophyll 
Photosystem I and ATP synthase are located almost exclu- molecules (green) and a carotenoid molecule (red), surround a 
sively janebacked regions, whereas photosystem iis pres- central cavity that contains the reaction center (not shown). Notice 
: ‘ : the high concentration of accessory pigments that surround the 

ent mostly in stacked regions. The cytochrome bf complex is teaction center. [Drawn from 1LGHpdb] 
found in both regions. Indeed, this complex rapidly moves 
back and forth between the stacked and the unstacked 
regions. Plastoquinone and plastocyanin are the mobile car- 
riers of electrons between assemblies located in different regions of the 
thylakoid membrane. A common internal thylakoid space enables protons 
liberated by photosystem II in stacked membranes to be utilized by ATP 
synthase molecules that are located far away in unstacked membranes. 

What is the functional significance of this lateral differentiation of the 
thylakoid membrane system? The positioning of photosystem I in the 
unstacked membranes also gives it direct access to the stroma for the reduc- 
tion of NADP*. ATP synthase, too, is located in the unstacked region to 
provide space for its large CF; globule and to give access to ADP. In 
contrast, the tight quarters of the appressed region pose no problem for 
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Figure 19.31 Location of photosynthesis components. Photosynthetic assemblies are 
differentially distributed in the stacked (appressed) and unstacked (nonappressed) regions of 
thylakoid membranes. [After a drawing kindly provided by Dr. Jan M. Anderson and Dr. Bertil 
Andersson.] 
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Many herbicides inhibit the light reactions of photosynthesis 


cl Many commercial herbicides kill weeds by interfering with the action of 
photosystem II or photosystem I. Inhibitors of photosystem II block 
electron flow, whereas inhibitors of photosystem I divert electrons from the 
terminal part of this photosystem. Photosystem II inhibitors include urea 
derivatives such as diuwron and triazine derivatives such as atrazine. These 
| chemicals bind to the Qx site of the D1 subunit of photosystem IT and block 
7 a the formation of plastoquinol (QH)). 

Tt CHs Paraquat (1,1’-dimethyl-4-4'-bipyridinium) is an inhibitor of photosys- 
fe) tem I. Paraquat, a dication, can accept electrons from photosystem I to 
Diuron become a radical. This radical reacts with O2 to produce reactive oxygen 

species such as superoxide (O2_) and hydroxyl radical (OH-). Such reactive 
oxygen species react with double bonds in membrane lipids, damaging the 


dL membrane. 
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cl 


Atrazine Ww The ability to convert light energy into chemical energy is a tremendous 
evolutionary advantage. Geological evidence suggests that oxygenic pho- 
tosynthesis became important approximately 2 billion years ago. Anoxy- 
genic photosynthetic systems arose much earlier in the 3.5-billion-year history 
of life on Earth (Table 19.1). The photosynthetic system of the nonsulfur 
purple bacterium Rhodopseudomonas viridis has most features common to 
oxygenic photosynthetic systems and clearly predates them. Green sulfur 
bacteria such as Chlorobium thiosulfatophilum carry out a reaction that also 
seems to have appeared before oxygenic photosynthesis and is even more 
similar to oxygenic photosynthesis than R. viridis is. Reduced sulfur species 
such as H5 are electron donors in the overall photosynthetic reaction: 


Light 


CO, + 2H —> (CHO) + 28 + H,0 


Nonetheless, photosynthesis did not evolve immediately at the origin of 
life. No photosynthetic organisms have been discovered in the domain of 
Archaea, implying that photosynthesis evolved in the domain of Bacteria 
after Archaea and Bacteria diverged from a common ancestor. All domains 
of life do have electron-transport chains in common, however. As we have 
seen, components such as the ubiquinone—-cytochrome c oxidoreductase 
and cytochrome bf family are present in both respiratory and photosyn- 
thetic electron-transport chains. These components were the foundations 
on which light-energy-capturing systems evolved. 


Table 19.1 Major groups of photosynthetic prokaryotes 


Bacteria Photosynthetic electron donor Op» use 

Green sulfur Ho, H25, S$ Anoxygenic 
Green nonsulfur Variety of amino acids and organic acids Anoxygenic 
Purple sulfur Ho, H25, S$ Anoxygenic 
Purple nonsulfur Usually organic molecules Anoxygenic 


Cyanobacteria H20 Oxygenic 
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Photosynthesis Takes Place in Chloroplasts 

The proteins that participate in the light reactions of photosynthesis 
are located in the thylakoid membranes of chloroplasts. The light reac- 
tions result in (1) the creation of reducing power for the production of 
NADPH, (2) the generation of a transmembrane proton gradient for 
the formation of ATP, and (3) the production of Oo. 


Light Absorption by Chlorophyll Induces Electron Transfer 
Chlorophyll molecules—tetrapyrroles with a central magnesium ion— 
absorb light quite efficiently because they are polyenes. An electron 
excited to a high-energy state by the absorption of a photon can move 
to nearby electron acceptors. In photosynthesis, an excited electron 
leaves a pair of associated chlorophyll molecules known as the special 
pair. The functional core of photosynthesis, a reaction center, from a 
photosynthetic bacterium has been studied in great detail. In this sys- 
tem, the electron moves from the special pair (containing bacteri- 
ochlorophyll) to a bacteriopheophytin (a bacteriochlorophyll lacking 
the central magnesium ion) to quinones. The reduction of quinones 
leads to the generation of a proton gradient, which drives ATP synthe- 
sis in a manner analogous to that of oxidative phosphorylation. 


Two Photosystems Generate a Proton Gradient and NADPH in 
Oxygenic Photosynthesis 

Photosynthesis in green plants is mediated by two linked photosys- 
tems. In photosystem II, the excitation of a special pair of chlorophyll 
molecules called P680 leads to electron transfer to plastoquinone in a 
manner analogous to that of the bacterial reaction center. The elec- 
trons are replenished by the extraction of electrons from a water mol- 
ecule at a center containing four manganese ions. One molecule of O2 
is generated at this center for each four electrons transferred. The plas- 
toquinol produced at photosystem II is reoxidized by the cytochrome 
bf complex, which transfers the electrons to plastocyanin, a soluble 
copper protein. From plastocyanin, the electrons enter photosystem I. 
In photosystem I, the excitation of special pair P700 releases electrons 
that flow to ferredoxin, a powerful reductant. Ferredoxin-NADP* 
reductase, a flavoprotein located on the stromal side of the membrane, 
then catalyzes the formation of NADPH. A proton gradient is gener- 
ated as electrons pass through photosystem II, through the cytochrome 
bf complex, and through ferredoxin-NADP* reductase. 


A Proton Gradient Across the Thylakoid Membrane 

Drives ATP Synthesis 

The proton gradient across the thylakoid membrane creates a proton- 
motive force, used by ATP synthase to form ATP. The ATP synthase 
of chloroplasts (also called CFo—CF,) closely resembles the ATP- 
synthesizing assemblies of bacteria and mitochondria (Fo—F;). If the 
NADPH:NADP* ratio is high, electrons transferred to ferredoxin by 
photosystem I can reenter the cytochrome bf complex. This process, 
called cyclic photophosphorylation, leads to the generation of a proton 
gradient by the cytochrome bf complex without the formation of 
NADPH or Os. 


Accessory Pigments Funnel Energy into Reaction Centers 

Light-harvesting complexes that surround the reaction centers con- 
tain additional molecules of chlorophyll a, as well as carotenoids and 
chlorophyll b molecules, which absorb light in the center of the visible 
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photoinduced charge separation (p. 569) 
reaction center (p. 569) 


spectrum. These accessory pigments increase the efficiency of light 
capture by absorbing light and transferring the energy to reaction 
centers through resonance energy transfer. 


19.6 The Ability to Convert Light into Chemical Energy Is Ancient 
The photosystems have structural features in common that suggest a 
common evolutionary origin. Similarities in organization and molecu- 
lar structure to those of oxidative phosphorylation suggest that the 
photosynthetic apparatus evolved from an early energy-transduction 


system. 
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Problems 


1. Complementary powers. Photosystem I produces a pow- 
erful reductant, whereas photosystem IT produces a power- 
ful oxidant. Identify the reductant and oxidant and describe 
their roles. 


2. If a little is good. What is the advantage of having an 
extensive set of thylakoid membranes in the chloroplasts? 


3. Cooperation. Explain how light-harvesting complexes 
enhance the efficiency of photosynthesis. 


4, One thing leads to another. What is the ultimate electron 
acceptor in photosynthesis? The ultimate electron donor? 
What powers the electron flow between the donor and the 
acceptor? 


5. Neutralization compensation. In chloroplasts, a greater 
pH gradient across the thylakoid membrane is required to 
power the synthesis of ATP than is required across the 
mitochondrial inner membrane. Explain this difference. 


6. Environmentally appropriate. Chlorophyll is a hydropho- 
bic molecule. Why is this property crucial for the function 
of chlorophyll? 


7. Proton origins. What are the various sources of protons 
that contribute to the generation of a proton gradient in 
chloroplasts? 


8. Efficiency matters. What fraction of the energy of 700-nm 
light absorbed by photosystem I is trapped as high-energy 
electrons? 


9. That’s not right. Explain the defect or defects in the 
hypothetical scheme for the light reactions of photosynthe- 
sis depicted here. 
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10. Electron transfer. Calculate the AE) and AG*®’ for the 
reduction of NADP* by ferredoxin. Use data given in 
Table 18.1. 


11. To boldly go. (a) It can be argued that, if life were to exist 
elsewhere in the universe, it would require some process 
like photosynthesis. Why is this argument reasonable? (b) If 
the Enterprise were to land on a distant plant and find no 
measurable oxygen in the atmosphere, could the crew con- 
clude that photosynthesis is not taking place? 


12. Weed killer 1. Dichlorophenyldimethylurea (DCMU), 
a herbicide, interferes with photophosphorylation and Oy 
evolution. However, it does not block O» evolution in the 
presence of an artificial electron acceptor such as ferricya- 
nide. Propose a site for the inhibitory action of DCMU. 


13. Weed killer 2. Predict the effect of the herbicide dichlo- 
rophenyldimethylurea (DCMU) on a plant’s ability to per- 
form cyclic photophosphorylation. 


14. Infrared harvest. Consider the relation between the 
energy of a photon and its wavelength. 


(a) Some bacteria are able to harvest 1000-nm light. What 
is the energy (in kilojoules or kilocalories) of a mole (also 
called an einstein) of 1000-nm photons? 


(b) What is the maximum increase in redox potential that 
can be induced by a 1000-nm photon? 


(c) What is the minimum number of 1000-nm photons 
needed to form ATP from ADP and P;? Assume a AG of 
50 kJ mol! (12 kcal mol!) for the phosphorylation reaction. 


15. Missing acceptors. Suppose that a bacterial reaction 
center containing only the special pair and the quinones has 
been prepared. Given the separation of 22 A between the 
special pair and the closest quinone, estimate the rate of 
electron transfer between the excited special pair and this 
quinone. 


16. Close approach. Suppose that energy transfer between 
two chlorophyll a molecules separated by 10 A takes place 
in 10 picoseconds. Suppose that this distance is increased to 
20 A with all other factors remaining the same. How long 
would energy transfer take? 


Chapter Integration Problems 


17. Functional equivalents. What structural feature of mito- 
chondria corresponds to the thylakoid membranes? 


18. Compare and contrast. Compare and contrast oxidative 
phosphorylation and photosynthesis. 


19. Energy accounts. On page 580, the balance sheet for the 
cost of the photosynthetically powered synthesis of glucose 
is presented. Eighteen molecules of ATP are required. Yet, 
when glucose undergoes combustion in cellular respira- 
tion, 30 molecules of ATP are produced. Account for the 
difference. 


20. Looking for a place to rest. Albert Szent-Gyérgyi, 
Nobel Prize-winning biochemist, once said something to 
the effect: Life is nothing more than an electron looking for 
a place to rest. Explain how this pithy statement applies to 
photosynthesis and cellular respiration. 
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Problems 


Mechanism Problem 


21. Hill reaction. In 1939, Robert Hill discovered that chlo- 
roplasts evolve O» when they are illuminated in the pres- 
ence of an artificial electron acceptor such as ferricyanide 
[Fe?*(CN)¢]°”. Ferricyanide is reduced to ferrocyanide 
[Fe?*(CN)¢]*” in this process. No NADPH or reduced 
plastocyanin is produced. Propose a mechanism for the 
Hill reaction. 


Data Interpretation and Chapter Integration Problem 


22. The same, but different. The a383y complex of mito- 
chondrial or chloroplast ATP synthase will function as an 
ATPase in vitro. The chloroplast enzyme (both synthase 
and ATPase activity) is sensitive to redox control, whereas 
the mitochondrial enzyme is not. To determine where the 
enzymes differ, a segment of the mitochondrial yy subunit 
was removed and replaced with the equivalent segment 
from the chloroplast y subunit. The ATPase activity of the 
modified enzyme was then measured as a function of redox 
conditions. 


(a) What is the redox regulator of the ATP synthase in 
vivo? The adjoining graph shows the ATPase activity 
of modified and control enzymes under various redox 
conditions. 


Modified enzyme and thioredoxin 
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[Data from O. Bald et al. J. Biol, Chem. 275:12757-12762, 2000] 


(b) What is the effect of increasing the reducing power of 
the reaction mixture for the control and the modified 
enzymes? 

(c) What is the effect of the addition of thioredoxin? How 
do these results differ from those in the presence of DTT 
alone? Suggest a possible explanation for the difference. 

(d) Did the researchers succeed in identifying the region of 
the y subunit responsible for redox regulation? 

(e) What is the biological rationale of regulation by high 
concentrations of reducing agents? 

(f£) What amino acids in the y subunit are most likely 
affected by the reducing conditions? 

(g) What experiments might confirm your answer to 
part e? 
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Atmospheric carbon dioxide measurements at Mauna 
Loa, Hawaii. These measurements show annual 
cycles resulting from seasonal variation in carbon 
dioxide fixation by the Calvin cycle in terrestrial plants. 
Much of this fixation takes place in rain forests, which 
account for approximately 50% of terrestrial fixation. 


hotosynthesis proceeds in two parts: the light reactions and the dark 

reactions. The light reactions, discussed in Chapter 19, transform light 
energy into ATP and biosynthetic reducing power, NADPH. The dark 
reactions use the ATP and NADPH produced by the light reactions to 
reduce carbon atoms from their fully oxidized state as carbon dioxide to a 
more reduced state as a hexose. Carbon dioxide is thereby trapped in a form 
that is useful for many processes and most especially as a fuel. Together, the 
light reactions and dark reactions of photosynthesis cooperate to transform light 
energy into carbon fuel. The dark reactions are also called the Calvin cycle, 
after Melvin Calvin, the biochemist who elucidated the pathway. The com- 
ponents of the Calvin cycle are called the dark reactions because, in contrast 
with the light reactions, these reactions do not directly depend on the pres- 
ence of light. 

The second half of this chapter examines a pathway common to all 
organisms, known variously as the pentose phosphate pathway, the hexose 
monophosphate pathway, the phosphogluconate pathway, or the pentose 
shunt. The pathway provides a means by which glucose can be oxidized to 
generate NADPH, the currency of readily available reducing power in cells. 
The phosphoryl group on the 2'-hydroxyl group of one of the ribose units 
of NADPH distinguishes NADPH from NADH. There is a fundamental 
distinction between NADPH and NADH in biochemistry: NADH is oxidized 


[Dennis Potokar/Photo Researchers.] 
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CHAPTER 20 The Calvin Cycle and the 
Pentose Phosphate Pathway 


Figure 20.1 Calvin cycle. The Calvin cycle 
consists of three stages. Stage 1 is the fixation 
of carbon by the carboxylation of ribulose 
1,5-bisphosphate. Stage 2 is the reduction of 
the fixed carbon to begin the synthesis of 
hexose. Stage 3 is the regeneration of the 
starting compound, ribulose 1,5-bisphosphate. 


by the respiratory chain to generate ATP, whereas NADPH serves as a reduc- 
tant in biosynthetic processes. The pentose phosphate pathway can also be 
used for the catabolism of pentose sugars from the diet, the synthesis of 
pentose sugars for nucleotide biosynthesis, and the catabolism and synthe- 
sis of less common four- and seven-carbon sugars. The pentose phosphate 
pathway and the Calvin cycle have in common several enzymes and inter- 
mediates that attest to an evolutionary kinship. Like glycolysis and gluco- 
neogenesis, these pathways are mirror images of each other: the Calvin cycle 
uses NADPH to reduce carbon dioxide to generate hexoses, whereas the 
pentose phosphate pathway breaks down glucose into carbon dioxide to 
generate NADPH. 


20.1 The Calvin Cycle Synthesizes Hexoses from 
Carbon Dioxide and Water 


As stated in Chapter 16, glucose can be formed from noncarbohydrate pre- 
cursors, such as lactate and amino acids, by gluconeogenesis. The energy 
powering gluconeogenesis ultimately comes from previous catabolism of 
carbon fuels. In contrast, photosynthetic organisms can use the Calvin cycle 
to synthesize glucose from carbon dioxide gas and water, by using sunlight 
as an energy source. The Calvin cycle introduces into life all of the carbon 
atoms that will be used as fuel and as the carbon backbones of biomolecules. 
Photosynthetic organisms are called autotrophs (literally, “self-feeders’’) 
because they can convert sunlight into chemical energy, which they subse- 
quently use to power their biosynthetic processes. Organisms that obtain 
energy from chemical fuels only are called heterotrophs, and such organisms 
ultimately depend on autotrophs for their fuel. 
The Calvin cycle comprises three stages (Figure 20.1): 
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1. The fixation of CO, by ribulose 1,5-bisphosphate to form two molecules 591 
of 3-phosphoglycerate; 20.1 The Calvin Cycle 


2. The reduction of 3-phosphoglycerate to form hexose sugars; and 


3. The regeneration of ribulose 1,5-bisphosphate so that more CO, can be 
fixed. 


This set of reactions takes place in the stroma of chloroplasts, the photo- 
synthetic organelles. 


Carbon dioxide reacts with ribulose 1,5-bisphosphate to form two 
molecules of 3-phosphoglycerate 


The first step in the Calvin cycle is the fixation of CO». This begins with the 
conversion of ribulose 1,5-bisphosphate into a highly reactive enediol inter- 
mediate. The CO, molecule condenses with the enediol intermediate to 
form an unstable six-carbon compound, which is rapidly hydrolyzed to two 
molecules of 3-phosphoglycerate. 
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This highly exergonic reaction [AG°’ = —51.9 kJ mol! (—12.4 kcal 


mol !)] is catalyzed by ribulose 1,5-bisphosphate carboxylase/oxygenase 
(usually called rubisco), an enzyme located on the stromal surface of the 
thylakoid membranes of chloroplasts. This important reaction is the rate- 
limiting step in hexose synthesis. Rubisco in chloroplasts consists of eight 
large (L, 55-kd) subunits and eight small (S, 13-kd) ones (Figure 20.2). 
Each L chain contains a catalytic site and a regulatory site. The S chains 
enhance the catalytic activity of the L chains. This enzyme is abundant in 
chloroplasts, accounting for approximately 30% of the total leaf protein in 
some plants. In fact, rubisco is the most abundant enzyme and probably the 
most abundant protein in the biosphere. Large amounts are present because 
rubisco is a slow enzyme; its maximal catalytic rate is only 3s‘. 


Small 
subunit 


h Figure 20.2 Structure of rubisco. The 
enzyme ribulose 1,5-bisphosphate carboxylase/ 
oxygenase (rubisco) comprises eight large 
subunits (one shown in red and the others in 
yellow) and eight small subunits (one shown in 
blue and the others in white). The active sites lie 
in the large subunits. [Drawn from 1RXO.pdb.] 
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Figure 20.3 Role of the magnesium ion 
in the rubisco mechanism. Ribulose 
1,5-bisphosphate binds to a magnesium ion 
that is linked to rubisco through a glutamate 
residue, an aspartate residue, and the lysine 
carbamate. The coordinated ribulose 
1,5-bisphosphate gives up a proton to form a 
reactive enediolate species that reacts with 
CO, to form a new carbon-carbon bond. 


Figure 20.4 Formation of 
3-phosphoglycerate. The overall pathway 
for the conversion of ribulose 1,5 
bisphosphate and COz into two molecules of 
3-phosphoglycerate. Although the free species 
are shown, these steps take place on the 


Rubisco activity depends on magnesium and carbamate 


Rubisco requires a bound divalent metal ion for activity, usually magnesium 
ion. Like the zinc ion in the active site of carbonic anhydrase (Section 9.2), 
this metal ion serves to activate a bound substrate molecule by stabilizing a 
negative charge. Interestingly, a CO2 molecule other than the substrate is 
required to complete the assembly of the Mg”*-binding site in rubisco. 
This CO» molecule adds to the uncharged e-amino group of lysine 201 to 
form a carbamate. This negatively charged adduct then binds the Mg”* ion. 
The formation of the carbamate is facilitated by the enzyme rubisco activase, 
although it will also form spontaneously at a lower rate. 


Glu 


Ribulose 1,5-bisphosphate Enediolate intermediate 


CO, 


The metal center plays a key role in binding ribulose 1,5-bisphosphate 
and activating it so that it will react with CO2 (Figure 20.3). Ribulose 
1,5-bisphosphate binds to Mg”* through its keto group and an adjacent 
hydroxyl group. This complex is readily deprotonated to form an enediolate 
intermediate. This reactive species, analogous to the zinc—hydroxide species 
in carbonic anhydrase, couples with CO», forming the new carbon-carbon 
bond. The resulting product is coordinated to the Mg** ion through three 
groups, including the newly formed carboxylate. A molecule of H2O is then 
added to this B-ketoacid to form an intermediate that cleaves to form two 
molecules of 3-phosphoglycerate (Figure 20.4). 
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Rubisco also catalyzes a wasteful oxygenase reaction: 593 
Catalytic imperfection 20.1 The Calvin Cycle 


The reactive intermediate generated on the Mg”* ion sometimes reacts with 
Op» instead of CO . Thus, rubisco also catalyzes a deleterious oxygenase 
reaction. The products of this reaction are phosphoglycolate and 3-phospho- 
glycerate (Figure 20.5). The rate of the carboxylase reaction is four times 
that of the oxygenase reaction under normal atmospheric conditions at 
25°C; the stromal concentration of CO> is then 10 pM and that of Oz is 
250 wM. The oxygenase reaction, like the carboxylase reaction, requires 
that lysine 201 be in the carbamate form. Because this carbamate forms only 
in the presence of COz, rubisco is prevented from catalyzing the oxygenase 
reaction exclusively when CO, is absent. 


CH,0PO37 
oo ‘canal CH20PO37— Ht ke 
7 7 + O70 
io H* i 02 i as H,O H,0 iseaehealealate Figure 20.5 A wasteful side 
H—C—_OH ed ». 408 ee c—o eed + reaction. The reactive enediolate 
| | _ intermediate on rubisco also reacts 
i as Te H—C—OH €00 with molecular oxygen to form a 
4 oo 5 H—C——OH hydroperoxide intermediate, which 
le cea el sea GHZORGs then proceeds to form one molecule 
CH,0PO377 2 
Ribulose Enediolate Hydroperoxide 7 ‘ one Phicephogiycerate aid: one 
1,5-bisphosphate intermediate intermediate 3-Phosphoglycerate molecule of phosphoglycolate. 


Phosphoglycolate is not a versatile metabolite. A salvage pathway 
recovers part of its carbon skeleton (Figure 20.6). A specific phosphatase 
converts phosphoglycolate into glycolate, which enters peroxisomes (also 
called microbodies; Figure 20.7). Glycolate is then oxidized to glyoxylate 
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Figure 20.6 Photorespiratory reactions. Phosphoglycolate is formed as a product of the 


oxygenase reaction in chloroplasts. After dephosphorylation, glycolate is transported into peroxi- Figure 20.7 Electron micrograph of a 
somes where it is converted into glyoxylate and then glycine. In mitochondria, two glycines are peroxisome nestled between two 
converted into serine, after losing a carbon as CO2 and ammonium ion. The ammonium ion is chloroplasts. [Courtesy of Dr. Sue Ellen 


salvaged in chloroplasts. Frederick] 
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Figure 20.8 Hexose phosphate 
formation. 3-Phosphoglycerate is converted 
into fructose 6-phosphate in a pathway 
parallel to that of gluconeogenesis. 


by glycolate oxidase, an enzyme with a flavin mononucleotide pros- 
thetic group. The H2O> produced in this reaction is cleaved by catalase 
to H,O and O>. Transamination of glyoxylate then yields glycine. Two 
glycine molecules can unite to form serine, a potential precursor of glu- 
cose, with the release of CO, and ammonium ion (NH,4"). The ammo- 
nium ion, used in the synthesis of nitrogen-containing compounds, is 
salvaged by a glutamine synthetase reaction (see Figure 20.6 and Section 
243 | 

This salvage pathway serves to recycle three of the four carbon atoms of 
two molecules of glycolate. However, one carbon atom is lost as CO. This 
process is called photorespivation because O» is consumed and COy, is 
released. Photorespiration is wasteful because organic carbon is converted 
into CO) without the production of ATP, NADPH, or another energy-rich 
metabolite. Indeed, photorespiration accounts for the loss of up to 25% of 
the carbon fixed. Evolutionary processes have presumably enhanced the 
preference of rubisco for carboxylation. For instance, the rubisco of higher 
plants is eightfold as specific for carboxylation as that of photosynthetic 
bacteria. 

Much research has been undertaken to generate recombinant forms of 
rubisco that display reduced oxygenase activity, but all such attempts have 
failed. This raises the question, what is the biochemical basis of this inef- 
ficiency? Structural studies show that when the reactive enediol intermedi- 
ate is formed, loops close over the active site to protect the enediol. A 
channel to the environment is maintained to allow access by COdz. 
However, like CO», O» is a linear molecule that also fits the channel. In 
essence, the problem lies not with the enzyme but in the unremarkable 
structure of CO . COs lacks any chemical features that would allow dis- 
crimination between it and other gases such as Oy», and thus the oxygenase 
activity is an inevitable failing of the enzyme. Another possibility exists, 
however. The oxygenase activity may not be an imperfection of the 
enzyme, but rather an imperfection in our understanding. Perhaps the 
oxygenase activity performs a biochemically important role that we have 
not yet discovered. 


Hexose phosphates are made from phosphoglycerate, 
and ribulose 1,5-bisphosphate is regenerated 


The 3-phosphoglycerate product of rubisco is next converted into fructose 
6-phosphate, which readily isomerizes to glucose 1-phosphate and glucose 
6-phosphate. The mixture of the three phosphorylated hexoses is called the 
hexose monophosphate pool. The steps in this conversion (Figure 20.8) are 
like those of the gluconeogenic pathway (see Figure 16.24), except that 
glyceraldehyde 3-phosphate dehydrogenase in chloroplasts, which gener- 
ates glyceraldehyde 3-phosphate (GAP), is specific for NADPH rather 
than NADH. These reactions and that catalyzed by rubisco bring CO? to 
the level of a hexose, converting CO» into a chemical fuel at the expense of 
NADPH and ATP generated from the light reactions. 

The third phase of the Calvin cycle is the regeneration of ribulose 
1,5-bisphosphate, the acceptor of CO, in the first step. The problem is to 
construct a five-carbon sugar from six-carbon and three-carbon sugars. A 
transketolase and an aldolase play the major role in the rearrangement of 
the carbon atoms. The transketolase, which we will see again in the pentose 
phosphate pathway, requires the coenzyme thiamine pyrophosphate 
(TPP) to transfer a two-carbon unit (CO—CH,O8H) from a ketose to an 
aldose. 


Pi _7CH0H 
| i 
A is + | 
R R 
Ketose Aldose 


(n carbons) (m carbons) 


Aldolase, which we have already encountered in glycolysis (Section 16.1), 
catalyzes an aldol condensation between dihydroxyacetone phosphate 
(DHAP) and an aldehyde. This enzyme is highly specific for dihydroxyac- 


Aldose 
(n - 2 carbons) 


Ketose 
(m + 2 carbons) 


etone phosphate, but it accepts a wide variety of aldehydes. 
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With these enzymes, the construction of the five-carbon sugar proceeds as 


shown in Figure 20.9. 
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Figure 20.9 Formation of five-carbon 
sugars. First, transketolase converts a six- 
carbon sugar and a three-carbon sugar into a 
four-carbon sugar and a five-carbon sugar. 
Then, aldolase combines the four-carbon 
product and a three-carbon sugar to form a 
seven-carbon sugar. Finally, this seven-carbon 
sugar reacts with another three-carbon sugar 
to form two additional five-carbon sugars. 
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Cc epimerase 
Figure 20.10 Regeneration of ribulose | a 
1,5-bisphosphate. Both ribose 5-phosphate ar ee 


and xylulose 5-phosphate are converted into 


ribulose 5-phosphate, which is then 


phosphorylated to complete the regeneration 


of ribulose 1,5-bisphosphate. 


Figure 20.11 Calvin cycle. The 
diagram shows the reactions 
necessary with the correct 
stoichiometry to convert three 
molecules of CO into one molecule 
of dihydroxyacetone phosphate 
(DHAP). The cycle is not as simple as 
presented in Figure 20.1; rather, it 
entails many reactions that lead 
ultimately to the synthesis of glucose 
and the regeneration of ribulose 
1,5-bisphosphate. [After J. R. Bowyer 
and R. C. Leegood. “Photosynthesis,” 


in Plant Biochemistry, P. M. Dey and J. 


B. Harborne, Eds. (Academic Press, 
1997), p. 85] 
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Finally, ribose 5-phosphate is converted into ribulose 5-phosphate by 
phosphopentose isomerase, whereas xylulose 5-phosphate is converted into 
ribulose 5-phosphate by phosphopentose epimerase. Ribulose 5-phosphate is 
converted into ribulose 1,5-bisphosphate through the action of phosphoribu- 
lose kinase (Figure 20.10). The sum of these reactions is 


Fructose 6-phosphate + 2 glyceraldehyde 3-phosphate 
+ dihydroxyacetone phosphate + 3 ATP —~> 
3 ribulose 1,5-bisphosphate + 3 ADP 


This series of reactions completes the Calvin cycle (Figure 20.11). 
Figure 20.11 presents the required reactions with the proper stoichiometry 
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to convert three molecules of COz into one molecule of dihydroxyacetone 
phosphate (DHAP). However, two molecules of DHAP are required for the 
synthesis of a member of the hexose monophosphate pool. Consequently, 
the cycle as presented must take place twice to yield a hexose monophos- 
phate. The outcome of the Calvin cycle is the generation of a hexose and the 
regeneration of the starting compound, ribulose 1,5-bisphosphate. In 
essence, ribulose 1,5-bisphosphate acts catalytically, similarly to oxaloace- 
tate in the citric acid cycle. 


Three ATP and two NADPH molecules are used to bring 
carbon dioxide to the level of a hexose 


What is the energy expenditure for synthesizing a hexose? Six rounds of the 
Calvin cycle are required, because one carbon atom is reduced in each 
round. Twelve molecules of ATP are expended in phosphorylating 12 mol- 
ecules of 3-phosphoglycerate to 1,3-bisphosphoglycerate, and 12 molecules 
of NADPH are consumed in reducing 12 molecules of 1,3-bisphosphoglyc- 
erate to glyceraldehyde 3-phosphate. An additional six molecules of ATP 
are spent in regenerating ribulose 1,5-bisphosphate. 

We can now write a balanced equation for the net reaction of the 
Calvin cycle: 


6 CO, + 18 ATP + 12 NADPH + 12 H,O —> 
C6H120, + 18 ADP + 18P; + 12 NADP* + 6H™ 


Thus, three molecules of ATP and two molecules of NADPH are consumed 
in incorporating a single CO2 molecule into a hexose such as glucose or 
fructose. 


Starch and sucrose are the major carbohydrate stores in plants 


What are the fates of the members of the hexose monophosphate pool? 
These molecules are used in a variety of ways, but there are two primary 
roles. Plants contain two major storage forms of sugar: starch and sucrose. 
Starch, like its animal counterpart glycogen, is a polymer of glucose resi- 
dues, but it is less branched than glycogen because it contains a smaller 
proportion of a-1,6-glycosidic linkages (Section 11.2). Another difference 
is that ADP-glucose, not UDP-glucose, is the activated precursor. Starch is 
synthesized and stored in chloroplasts. 

In contrast, sucrose (common table sugar), a disaccharide, is synthesized 
in the cytoplasm. Plants lack the ability to transport hexose phosphates 
across the chloroplast membrane, but they are able to transport triose 
phosphates from chloroplasts to the cytoplasm. Triose phosphate interme- 
diates such as glyceraldehyde 3-phosphate cross into the cytoplasm in 
exchange for phosphate through the action of an abundant phosphate trans- 
locator. Fructose 6-phosphate formed from triose phosphates joins the 
glucose unit of UDP-glucose to form sucrose 6-phosphate (Figure 20.12). 
The hydrolysis of the phosphate ester yields sucrose, a readily transportable 
and mobilizable sugar that is stored in many plant cells, as in sugar beets 
and sugar cane. 


20.2 The Activity of the Calvin Cycle Depends on 
Environmental Conditions 


How do the light reactions communicate with the dark reactions to regulate 
this crucial process of fixing CO2 into biomolecules? The principal means of 
vegulation is alteration of the stromal environment by the light reactions. The 
light reactions lead to an increase in pH and in the stromal concentrations of 
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Me” * NADPH, and reduced ferredoxin—all of which contribute to the 
activation of certain Calvin-cycle enzymes (Figure 20.13). 


Rubisco is activated by light-driven changes in proton 
and magnesium ion concentrations 


As stated earlier, the rate-limiting step in the Calvin cycle is the carboxyla- 
tion of ribulose 1,5-bisphosphate to form two molecules of 3-phosphoglyc- 
erate. The activity of rubisco increases markedly on illumination because light 
facilitates the carbamate formation necessary to enzyme activity. In the 
stroma, the pH increases from 7 to 8, and the level of Me?* rises. Both 
effects are consequences of the light-driven pumping of protons into the 
thylakoid space. Mg** ions from the thylakoid space are released into the 
stroma to compensate for the influx of protons. Carbamate formation is 
DARK favored at alkaline pH. CO adds to a deprotonated from of lysine 201 of 
rubisco, and Mg”* ion binds to the carbamate to generate the active form of 
the enzyme. Thus, light leads to the generation of regulatory signals as well 
as ATP and NADPH. 


Thioredoxin plays a key role in regulating the Calvin cycle 


Light-driven reactions lead to electron transfer from water to ferredoxin 
and, eventually, to NADPH. The presence of reduced ferredoxin and 
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Figure 20.13 Light regulation of the 
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NADPH are good signals that conditions are right for biosynthesis. One 
way in which this information is conveyed to biosynthetic enzymes is by 
thioredoxin, a 12-kd protein containing neighboring cysteine residues that 
cycle between a reduced sulfhydryl and an oxidized disulfide form 
(Figure 20.14). The reduced form of thioredoxin activates many biosyn- 
thetic enzymes by reducing disulfide bridges that control their activity and 
inhibits several degradative enzymes by the same means (Table 20.1). In 
chloroplasts, oxidized thioredoxin is reduced by ferredoxin in a reaction 
catalyzed by ferredoxin—thioredoxin reductase. This enzyme contains a 
4Fe-4S cluster that couples two one-electron oxidations of reduced ferre- 
doxin to the two-electron reduction of thioredoxin. Thus, the activities of the 
light and dark reactions of photosynthesis are coordinated through electron 
transfer from reduced ferredoxin to thioredoxin and then to component enzymes 
containing regulatory disulfide bonds (Figure 20.15). We shall return to thi- 
oredoxin when we consider the reduction of ribonucleotides (Section 25.3). 

NADPH is a signal molecule that activates two biosynthetic enzymes, 
phosphoribulose kinase and glyceraldehyde 3-phosphate dehydrogenase. In 
the dark, these enzymes are inhibited by association with an 8.5-kd protein 
called CP12. NADPH disrupts this association, leading to the release of the 
active enzymes. 


The C, pathway of tropical plants accelerates photosynthesis 
by concentrating carbon dioxide 


The oxygenase activity of rubisco presents a biochemical challenge to 
tropical plants because the oxygenase activity increases more rapidly with 
temperature than does the carboxylase activity. How, then, do plants, such 
as sugar cane, that grow in hot climates prevent very high rates of wasteful 
photorespiration? Their solution to this problem is to achieve a high local 
concentration of CO, at the site of the Calvin cycle in their photosynthetic 
cells. The essence of this process, which was elucidated by Marshall 
Davidson Hatch and C. Roger Slack, is that fouwr-carbon (C4) compounds such 
as oxaloacetate and malate carry CO» from mesophyll cells, which are in con- 
tact with air, to bundle-sheath cells, which are the major sites of photosynthesis 
(Figure 20.16). The decarboxylation of the four-carbon compound in a 
bundle-sheath cell maintains a high concentration of CO, at the site of the 
Calvin cycle. The three-carbon product returns to the mesophyll cell for 
another round of carboxylation. 

The Cy pathway for the transport of COz starts in a mesophyll cell with 
the condensation of CO and phosphoenolpyruvate to form oxaloacetate in 
a reaction catalyzed by phosphoenolpyruvate carboxylase. In some species, 
oxaloacetate is converted into malate by an NADP*-linked malate 
dehydrogenase. Malate enters the bundle-sheath cell and is oxidatively 
decarboxylated within the chloroplasts by an NADP” -linked malate dehy- 
drogenase. The released CO? enters the Calvin cycle in the usual way by 
condensing with ribulose 1,5-bisphosphate. Pyruvate formed in this decar- 
boxylation reaction returns to the mesophyll cell. Finally, phosphoenolpyru- 
vate is formed from pyruvate by pyruvate-P; dikinase. 

The net reaction of this C4 pathway is 


CO), (in mesophyll cell) + ATP + 2 H,O0 > 
CO, (in bundle-sheath cell) + AMP + 2 P;+ 2H* 


Thus, the energetic equivalent of two ATP molecules is consumed in transport- 
ing CO» to the chloroplasts of the bundle-sheath cells. In essence, this process 
is active transport: the pumping of CQO, into the bundle-sheath cell is 
driven by the hydrolysis of one molecule of ATP to one molecule of AMP 


h Figure 20.14 Thioredoxin. The 
oxidized form of thioredoxin contains a 
disulfide bond. When thioredoxin is reduced 
by reduced ferredoxin, the disulfide bond is 
converted into two free sulfhydryl groups. 
Reduced thioredoxin can cleave disulfide 
bonds in enzymes, activating certain 
Calvin cycle enzymes and inactivating 
some degradative enzymes. [Drawn from 
1F9M.pdb.] 
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Figure 20.15 Enzyme activation by 
thioredoxin. Reduced thioredoxin activates 
certain Calvin cycle enzymes by cleaving 
regulatory disulfide bonds. 
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Figure 20.16 C, pathway. Carbon dioxide 
is concentrated in bundle-sheath cells by the 
expenditure of ATP in mesophyll cells. 


Figure 20.17 Electron micrograph of an 
open stoma and a closed stoma. [Herb 
Charles Ohlmeyer/Fran Heyl Associates.] 
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and two molecules of orthophosphate. The CO, concentration can be 
20-fold as great in the bundle-sheath cells as in the mesophyll cells. 

When the Cy pathway and the Calvin cycle operate together, the net 
reaction is 


6 CO) + 30 ATP + 12 NADPH + 24 H,0 —> 
C6H120, + 30 ADP + 30 P; + 12 NADP* + 18 H* 


Note that 30 molecules of ATP are consumed per hexose molecule formed 
when the Cy pathway delivers CO. to the Calvin cycle, in contrast with 
18 molecules of ATP per hexose molecule in the absence of the C4 pathway. 
The high concentration of CO. in the bundle-sheath cells of C4 plants, 
which is due to the expenditure of the additional 12 molecules of ATP, is 
critical for their rapid photosynthetic rate, because COz is limiting when 
light is abundant. A high CO2 concentration also minimizes the energy loss 
caused by photorespiration. 

Tropical plants with a C4 pathway do little photorespiration because the 
high concentration of CO» in their bundle-sheath cells accelerates the carboxy- 
lase reaction relative to the oxygenase reaction. This effect is especially impor- 
tant at higher temperatures. The geographic distribution of plants having 
this pathway (C4 plants) and those lacking it (C3 plants) can now be under- 
stood in molecular terms. Cy plants have the advantage in a hot environment 
and under high illumination, which accounts for their prevalence in the 
tropics. C3 plants, which consume only 18 molecules of ATP per hexose 
molecule formed in the absence of photorespiration (compared with 30 mol- 
ecules of ATP for C4 plants), are more efficient at temperatures lower than 
about 28°C, and so they predominate in temperate environments. 


Rubisco is present in bacteria, eukaryotes, and even archaea, though 

other photosynthetic components have not been found in archaea. 
Thus, rubisco emerged early in evolution, when the atmosphere was rich in 
CO, and almost devoid of Oy. The enzyme was not originally selected to 
operate in an environment like the present one, which is almost devoid of 
CO, and rich in Oo. Photorespiration became significant about 600 million 
years ago, when the CO» concentration fell to present levels. The C4 path- 
way is thought to have evolved in response no more than 30 million years 
ago and possibly as recently as 7 million years ago. It is interesting that none 
of the enzymes are unique to Cy plants, suggesting that this pathway made 
use of already existing enzymes. 


Crassulacean acid metabolism permits growth in arid ecosystems 


Many plants growing in hot, dry climates keep the stomata of their leaves 
closed in the heat of the day to prevent water loss (Figure 20.17). As a con- 


sequence, CO» cannot be absorbed during the daylight hours, when it is 
needed for glucose synthesis. Rather, CO2 enters the leaf when the stomata 
open at the cooler temperatures of night. To store the CO» until it can be 
used during the day, such plants make use of an adaptation called crassu- 
lacean acid metabolism (CAM), named after the genus Crassulacea (the suc- 
culents). Carbon dioxide is fixed by the Cy pathway into malate, which is 
stored in vacuoles. During the day, malate is decarboxylated and the CO» 
becomes available to the Calvin cycle. In contrast with C4 plants, CAM 
plants separate CO2 accumulation from COy utilization temporally rather 
than spatially. 


20.3 The Pentose Phosphate Pathway Generates NADPH 
and Synthesizes Five-Carbon Sugars 


Photosynthetic organisms can use the light reactions for generation of some 
NADPH. Another pathway, present in all organisms, meets the NADPH 
needs of nonphotosynthetic organisms and of the nonphotosynthetic tissues 
in plants. The pentose phosphate pathway is a crucial source of NADPH to 
use in reductive biosynthesis (Table 20.2) as well as for protection against 
oxidative stress. This pathway consists of two phases: (1) the oxidative gen- 
eration of NADPH and (2) the nonoxidative interconversion of sugars 
(Figure 20.18). In the oxidative phase, NADPH is generated when glucose 
6-phosphate is oxidized to ribulose 5-phosphate, which is subsequently 
converted into ribose 5-phosphate. Ribose 5-phosphate and its derivatives 
are components of RNA and DNA, as well as of ATP, NADH, FAD, and 
coenzyme A. 


Glucose 6-phosphate + 2 NADP* + H,O —> 
ribulose 5-phosphate + 2 NADPH + 2H*+ CO, 


In the nonoxidative phase, the pathway catalyzes the interconversion of 
three-, four-, five-, six-, and seven-carbon sugars in a series of nonoxidative 
reactions. Excess five-carbon sugars may be converted into intermediates of 
the glycolytic pathway. All these reactions take place in the cytoplasm. 
These interconversions rely on the same reactions that lead to the regene- 
ration of ribulose 1,5-bisphosphate in the Calvin cycle. 


Two molecules of NADPH are generated in the conversion of glucose 
6-phosphate into ribulose 5-phosphate 


The oxidative phase of the pentose phosphate pathway starts with the dehy- 
drogenation of glucose 6-phosphate at carbon 1, a reaction catalyzed by 
glucose 6-phosphate dehydrogenase (Figure 20.19). This enzyme is highly 
specific for NADP?; the Kyy for NAD‘* is about a thousand times as great 
as that for NADP”. The product is 6-phosphoglucono-8-lactone, which is an 
intramolecular ester between the C-1 carboxyl group and the C-5 hydroxyl 
group. The next step is the hydrolysis of 6-phosphoglucono-6-lactone by a 
specific lactonase to give 6-phosphogluconate. This six-carbon sugar is then 
oxidatively decarboxylated by 6-phosphogluconate dehydrogenase to yield 
ribulose 5-phosphate. NADP” is again the electron acceptor. 


The pentose phosphate pathway and glycolysis are linked 
by transketolase and transaldolase 


The preceding reactions yield two molecules of NADPH and one molecule 
of ribulose 5-phosphate for each molecule of glucose 6-phosphate oxidized. 
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Table 20.2 Pathways requiring 
NADPH 


Synthesis 

Fatty acid biosynthesis 
Cholesterol biosynthesis 
Neurotransmitter biosynthesis 
Nucleotide biosynthesis 


Detoxification 
Reduction of oxidized glutathione 
Cytochrome P450 monooxygenases 
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Figure 20.18 Pentose phosphate pathway. The pathway 
consists of (1) an oxidative phase that generates NADPH and 
(2) a nonoxidative phase that interconverts phosphorylated 
sugars. 


The ribulose 5-phosphate is subsequently 
isomerized to ribose 5-phosphate by phospho- 
pentose isomerase. 
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OS _/CH20H | 
| Phosphopentose ‘meas 
H—C—OH isomerase H ¢ OH 
H—C—OH sa cl 
2- 
CH2OPOz CH,0P0;2- 


Ribulose 5-phosphate Ribose 5-phosphate 


Although ribose 5-phosphate is a precursor to 
many biomolecules, many cells need NADPH 
for reductive biosyntheses much more than 
they need ribose 5-phosphate for incorporation 
into nucleotides and nucleic acids. For instance, 
adipose tissue, the liver, and mammary glands 
require large amounts of NADPH for fatty 
acid synthesis (Chapter 22). In these cases, 
ribose 5-phosphate is converted into the glyco- 
lytic intermediates glyceraldehyde 3- phosphate 
and fructose 6-phosphate by transketolase and 
transaldolase. These enzymes create a reversible 
link between the pentose phosphate pathway and 
glycolysis by catalyzing these three successive 
reactions. 


Transketolase 


Cs + Cs C3 +C, 
Transketolase 

C3+ Cy Co + Ca 
Transketolase 

C4 + Cs Co + C3 


The net result of these reactions is the forma- 
tion of two hexoses and one triose from three 
pentoses: 


3 Gs 2 Og Ga 

The first of the three reactions linking the 
pentose phosphate pathway and glycolysis is 
the formation of glyceraldehyde 3-phosphate and 
sedoheptulose 7-phosphate from two pentoses. 
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Figure 20.19 Oxidative phase of the pentose phosphate pathway. Glucose 6-phosphate is 
oxidized to 6-phosphoglucono-8-lactone to generate one molecule of NADPH. The lactone product 
is hydrolyzed to 6-phosphogluconate, which is oxidatively decarboxylated to ribulose 5-phosphate 
with the generation of a second molecule of NADPH. 
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The donor of the two-carbon unit in this reaction is xylulose 5-phosphate, 
an epimer of ribulose 5-phosphate. A ketose is a substrate of transketolase 
only if its hydroxyl group at C-3 has the configuration of xylulose rather 
than ribulose. Ribulose 5-phosphate is converted into the appropriate 
epimer for the transketolase reaction by phosphopentose epimerase in the 
reverse reaction of that which takes place in the Calvin cycle. 
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Glyceraldehyde 3-phosphate and sedoheptulose 7-phosphate generated 
by the transketolase then react to form fructose 6-phosphate and erythrose 
4-phosphate. 
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CHAPTER 20 The Calvin Cycle and the 
Pentose Phosphate Pathway 


Table 20.3 Pentose phosphate pathway 


Reaction 


Oxidative phase 


This synthesis of a four-carbon sugar and a six-carbon sugar is catalyzed by 
transaldolase. 

In the third reaction, transketolase catalyzes the synthesis of fructose 
6-phosphate and glyceraldehyde 3-phosphate from erythrose 4-phosphate and 
xylulose 5-phosphate. 
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The sum of these reactions is 


2 Xylulose 5-phosphate + ribose 5-phosphate == 
2 fructose 6-phosphate + glyceraldehyde 3-phosphate 


Xylulose 5-phosphate can be formed from ribose 5-phosphate by the 
sequential action of phosphopentose isomerase and phosphopentose epim- 
erase, and so the net reaction starting from ribose 5-phosphate is 


3 Ribose 5-phosphate == 
2 fructose 6-phosphate + glyceraldehyde 3-phosphate 


Thus, excess ribose 5-phosphate formed by the pentose phosphate pathway can 
be completely converted into glycolytic intermediates. Moreover, any ribose 
ingested in the diet can be processed into glycolytic intermediates by this 
pathway. It is evident that the carbon skeletons of sugars can be extensively 
rearranged to meet physiological needs (Table 20.3). 


Mechanism: Transketolase and transaldolase stabilize 
carbanionic intermediates by different mechanisms 


The reactions catalyzed by transketolase and transaldolase are distinct, yet 
similar in many ways. One difference is that transketolase transfers a 
two-carbon unit, whereas transaldolase transfers a three-carbon unit. Each 
of these units is transiently attached to the enzyme in the course of the 
reaction. 


Enzyme 


Glucose 6-phosphate + NADP* —> 6-phosphoglucono-8-lactone + NADPH + H* 
6-Phosphoglucono-8-lactone + H,0 —> 6-phosphogluconate + H* 
6-Phosphogluconate + NADP* —> ribulose 5-phosphate + CO, + NADPH + H* 


Nonoxidative Phase 
Ribulose 5-phosphate == ribose 5-phosphate 
Ribulose 5-phosphate == xylulose 5-phosphate 
Xylulose 5-phosphate + ribose 5-phosphate == 
sedoheptulose 7-phosphate + glyceraldehyde 3-phosphate 
Sedoheptulose 7-phosphate + glyceraldehyde 3-phosphate == 
fructose 6-phosphate + erythrose 4-phosphate 
Xylulose 5-phosphate + erythrose 4-phosphate == 
fructose 6-phosphate + glyceraldehyde 3-phosphate 


Glucose 6-phosphate dehydrogenase 
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Figure 20.20 Transketolase mechanism. (1) Thiamine pyrophosphate (TPP) ionizes to form a 
carbanion. (2) The carbanion of TPP attacks the ketose substrate. (3) Cleavage of a carbon-carbon 
bond frees the aldose product and leaves a two-carbon fragment joined to TPP. (4) This activated 
glycoaldehyde intermediate attacks the aldose substrate to form a new carbon-carbon bond. (5) 
The ketose product is released, freeing the TPP for the next reaction cycle. 


Transketolase reaction. Transketolase contains a tightly bound thiamine 
pyrophosphate as its prosthetic group. The enzyme transfers a two-carbon 
glycoaldehyde from a ketose donor to an aldose acceptor. The site of the 
addition of the two-carbon unit is the thiazole ring of TPP. Transketolase is 
homologous to the E; subunit of the pyruvate dehydrogenase complex 
(Section 17.1) and the reaction mechanism is similar (Figure 20.20). 

The C-2 carbon atom of bound TPP readily ionizes to give a carbanion. 
The negatively charged carbon atom of this reactive intermediate attacks 
the carbonyl group of the ketose substrate. The resulting addition compound 
releases the aldose product to yield an activated glycoaldehyde unit. The 
positively charged nitrogen atom in the thiazole ring acts as an electron sink 
in the development of this activated intermediate. The carbonyl] group of a 
suitable aldose acceptor then condenses with the activated glycoaldehyde 
unit to form a new ketose, which is released from the enzyme. 


Transaldolase reaction. ‘Transaldolase transfers a three-carbon dihydroxy- 
acetone unit from a ketose donor to an aldose acceptor. Transaldolase, in 
contrast with transketolase, does not contain a prosthetic group. Rather, a 
Schiff base is formed between the carbonyl group of the ketose substrate and the 
€-amino group of a lysine residue at the active site of the enzyme (Figure 20.21). 
This kind of covalent enzyme-substrate intermediate is like that formed 
in fructose 1,6-bisphosphate aldolase in the glycolytic pathway (Section 16.1) 
and, indeed, the enzymes are homologous. The Schiff base becomes pro- 
tonated, the bond between C-3 and C-4 is split, and an aldose is released. 
The negative charge on the Schiff-base carbanion moiety is stabilized by 
resonance. The positively charged nitrogen atom of the protonated Schiff 
base acts as an electron sink. The Schiff-base adduct is stable until a suit- 
able aldose becomes bound. The dihydroxyacetone moiety then reacts 
with the carbonyl group of the aldose. The ketose product is released by 
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Figure 20.21 Transaldolase mechanism. 
(1) The reaction begins with the formation of 
a Schiff base between a lysine residue in 
transaldolase and the ketose substrate. 
Protonation of the Schiff base (2) leads to 
the release of the aldose product (3), leaving 
a three-carbon fragment attached to the 
lysine residue. (4) This intermediate adds to 
the aldose substrate, with a concomitant 
protonation to form a new carbon-carbon 
bond. Subsequent deprotonation (5) and 
hydrolysis of the Schiff base (6) release the 
ketose product from the lysine side chain, 
completing the reaction cycle. 


hydrolysis of the Schiff base. The nitrogen atom of the protonated Schiff 
base plays the same role in transaldolase as the thiazole-ring nitrogen atom 
does in transketolase. In each enzyme, a group within an intermediate reacts 
like a carbanion in attacking a carbonyl group to form a new carbon-carbon 
bond. In each case, the charge on the carbanion is stabilized by resonance 
(Figure 20.22). 


20.4 The Metabolism of Glucose 6-phosphate 
by the Pentose Phosphate Pathway Is 
Coordinated with Glycolysis 


Glucose 6-phosphate is metabolized by both the glycolytic 
pathway (Chapter 16) and the pentose phosphate pathway. 
How is the processing of this important metabolite parti- 
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Figure 20.22 Carbanion intermediates. For transketolase and 
transaldolase, a carbanion intermediate is stabilized by resonance. In 
transketolase, TPP stabilizes this intermediate; in transaldolase, a 
protonated Schiff base plays this role. 
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drogenation of glucose 6-phosphate because it is needed as 
the electron acceptor. The effect of low levels of NADP* is 
intensified by the fact that NADPH competes with NADP* 
in binding to the enzyme. The ratio of NADP* to NADPH 


in the cytoplasm of a liver cell from a well-fed rat is about 0.014, several 
orders of magnitude lower than the ratio of NAD* to NADH, which is 700 
under the same conditions. The marked effect of the NADP™ level on the 
rate of the oxidative phase ensures that NADPH is not generated unless the 
supply needed for reductive biosyntheses is low. The nonoxidative phase of 
the pentose phosphate pathway is controlled primarily by the availability of 
substrates. 


The flow of glucose 6-phosphate depends on the need 
for NADPH, ribose 5-phosphate, and ATP 


We can grasp the intricate interplay between glycolysis and the pentose 
phosphate pathway by examining the metabolism of glucose 6-phosphate in 
four different metabolic situations (Figure 20.23). 


Mode 1. Much more ribose 5-phosphate than NADPH is required. For 
example, rapidly dividing cells need ribose 5-phosphate for the synthesis 
of nucleotide precursors of DNA. Most of the glucose 6-phosphate is con- 
verted into fructose 6-phosphate and glyceraldehyde 3-phosphate by the 
glycolytic pathway. Transaldolase and transketolase then convert two 
molecules of fructose 6-phosphate and one molecule of glyceraldehyde 


Mode 1 Glucose 
6-phosphate 


’ 
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20.4 Coordination of the Pentose 
Phosphate Pathway 


Figure 20.23 Four modes of the pentose 
phosphate pathway. Major products are 
shown in color. 
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Table 20.4 Tissues with active pentose phosphate pathways 


Tissue Function 

Adrenal gland Steroid synthesis 

Liver Fatty acid and cholesterol synthesis 
Testes Steroid synthesis 

Adipose tissue Fatty acid synthesis 

Ovary Steroid synthesis 

Mammary gland Fatty acid synthesis 

Red blood cells Maintenance of reduced glutathione 


3-phosphate into three molecules of ribose 5-phosphate by a reversal of the 
reactions described earlier. The stoichiometry of mode 1 is 


5 Glucose 6-phosphate + ATP — 6 ribose 5-phosphate + ADP + 2 H* 


Mode 2. The needs for NADPH and for ribose 5-phosphate are balanced. 
The predominant reaction under these conditions is the formation of two 
molecules of NADPH and one molecule of ribose 5-phosphate from one 
molecule of glucose 6-phosphate in the oxidative phase of the pentose phos- 
phate pathway. The stoichiometry of mode 2 is 


Glucose 6-phosphate + 2 NADP* + H,0—> 
ribose 5-phosphate + 2 NADPH + 2H* + CO, 


Mode 3. Much more NADPH than ribose 5-phosphate is required. For 
example, adipose tissue requires a high level of NADPH for the synthesis of 
fatty acids (Table 20.4). In this case, glucose 6-phosphate is completely 
oxidized to CO». Three groups of reactions are active in this situation. First, 
the oxidative phase of the pentose phosphate pathway forms two mole- 
cules of NADPH and one molecule of ribose 5-phosphate. Then, ribose 
5-phosphate is converted into fructose 6-phosphate and glyceralde- 
hyde 3-phosphate by transketolase and transaldolase. Finally, glucose 
6-phosphate is resynthesized from fructose 6-phosphate and glyceralde- 
hyde 3-phosphate by the gluconeogenic pathway. The stoichiometries of 
these three sets of reactions are 


6 Glucose 6-phosphate + 12 NADP* + 6 H,O—> 
6 Ribose 5-phosphate + 12 NADPH + 12 H* + 6CO, 
6 Ribose 5-phosphate ——> 
4 Fructose 6-phosphate + 2 glyceraldehyde 3-phosphate 
4 Fructose 6-phosphate + 2 glyceraldehyde 3-phosphate + H2Q —~> 
5-Glucose 6-phosphate + P; 
The sum of the mode 3 reactions is 


Glucose 6-phosphate + 12 NADP* + 7H,O —> 
6 CO, + 12 NADPH + 12H* +P; 


Thus, the equivalent of glucose 6-phosphate can be completely oxidized to CO 
with the concomitant generation of NADPH. In essence, ribose 5-phosphate 
produced by the pentose phosphate pathway is recycled into glucose 
6-phosphate by transketolase, transaldolase, and some of the enzymes of 
the gluconeogenic pathway. 


Mode 4. Both NADPH and ATP are required. Alternatively, ribose 
5-phosphate formed by the oxidative phase of the pentose phosphate 


pathway can be converted into pyruvate. Fructose 6-phosphate and glycer- 
aldehyde 3-phosphate derived from ribose 5-phosphate enter the glycolytic 
pathway rather than reverting to glucose 6-phosphate. In this mode, ATP 
and NADPH are concomitantly generated, and five of the six carbons of 
glucose 6-phosphate emerge in pyruvate. 


3 Glucose 6-phosphate + 6 NADP* + 5 NAD* + 5P;+ 8 ADP——> 
5 pyruvate + 3 CO), + 6 NADPH + 5 NADH 
+ 8ATP + 2H,O+ 8H 


Pyruvate formed by these reactions can be oxidized to generate more ATP 
or it can be used as a building block in a variety of biosyntheses. 


Through the looking-glass: The Calvin cycle and the pentose phosphate 
pathway are mirror images 


Ww The complexities of the Calvin cycle and the pentose phosphate path- 

way are easier to comprehend if we consider them as functional mirror 
images of each other. The Calvin cycle begins with the fixation of CO2 and 
proceeds to use NADPH in the synthesis of glucose. The pentose phosphate 
pathway begins with the oxidation of a glucose-derived carbon atom to CO? 
and concomitantly generates NADPH. The regeneration phase of the 
Calvin cycle converts Cg and C3 molecules back into the starting material— 
the Cs molecule ribulose 1,5-bisphosphate. The pentose phosphate path- 
way converts a Cs molecule, ribose 5-phosphate, into Cg and C3 intermedi- 
ates of the glycolytic pathway. Not surprisingly, in photosynthetic organ- 
isms, many enzymes are common to the two pathways. We see the economy 
of evolution: the use of identical enzymes for similar reactions with dif- 
ferent ends. 


20.5 Glucose 6-phosphate Dehydrogenase Plays a Key 
Role in Protection Against Reactive Oxygen Species 


The NADPH generated by the pentose phosphate pathway plays a vital 
role in protecting the cells from reactive oxygen species (ROS). Reactive 
oxygen species generated in oxidative metabolism inflict damage on all 
classes of macromolecules and can ultimately lead to cell death. Indeed, 
ROS are implicated in a number of human diseases (see Table 18.3). 
Reduced glutathione (GSH), a tripeptide with a free sulfhydryl group, com- 
bats oxidative stress by reducing ROS to harmless forms. Its task accom- 
plished, the glutathione is now in the oxidized form (GSSG) and must be 
reduced to regenerate GSH. The reducing power is supplied by the NADPH 
generated by glucose 6-phosphate dehydrogenase in the pentose phosphate 
pathway. Indeed, cells with reduced levels of glucose 6-phosphate dehydro- 
genase are especially sensitive to oxidative stress. This stress is most acute 
in red blood cells because they lack mitochondria and have no alternative 
means of generating reducing power. 


Glucose 6-phosphate dehydrogenase deficiency causes a 
drug-induced hemolytic anemia 


¥ The importance of the pentose phosphate pathway is highlighted by 
some people’s anomalous responses to certain drugs. For instance, 
pamaquine, an antimalarial drug introduced in 1926, was associated with 
the appearance of severe and mysterious ailments. Most patients tolerated 
the drug well, but a few developed severe symptoms within a few days after 
therapy was started. The urine turned black, jaundice developed, and the 


609 


20.5 Protection Against Reactive 
Oxygen Species 
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(y-Glutamylcysteinylglycine) 
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Vicia faba. The Mediterranean plant Vicia 
faba is a source of fava beans that contain 
the purine glycoside vicine. [Inga Spence/ 
Visuals Unlimited.] 


rd 


Figure 20.24 Red blood cells with Heinz 
bodies. The light micrograph shows red 
blood cells obtained from a person deficient 
in glucose 6-phosphate dehydrogenase. The 
dark particles, called Heinz bodies, inside the 
cells are clumps of denatured hemoglobin 
that adhere to the plasma membrane and 
stain with basic dyes. Red blood cells in such 
people are highly susceptible to oxidative 
damage. [Courtesy of Dr. Stanley Schrier.] 


hemoglobin content of the blood dropped sharply. In some cases, massive 
destruction of red blood cells caused death. 

This drug-induced hemolytic anemia was shown 30 years later to be 
caused by a deficiency of glucose 6-phosphate dehydrogenase, the enzyme 
catalyzing the first step in the oxidative branch of the pentose phosphate 
pathway. The result is a dearth of NADPH in all cells, but this deficiency is 
most acute in red blood cells. This defect, which is inherited on the X chro- 
mosome, is the most common disease that results from an enzyme malfunc- 
tion, affecting hundreds of millions of people. The major role of NADPH in 
red cells is to reduce the disulfide form of glutathione to the sulfhydryl 
form. The enzyme that catalyzes the regeneration of reduced glutathione is 
glutathione reductase. 


y-Glu—— Cys — Gly 


Glutathione 
cS reductase 2 y-Glu-—Cys——Gly 
| + NADPH + Ht ————— | + NADP* 
S SH 
y-Glu—— Cys —Gly 
Oxidized glutathione (GSSG) Reduced glutathione (GSH) 


Red blood cells with a lowered level of reduced glutathione are more suscep- 
tible to hemolysis. 

Pamaquine sensitivity is not simply a historical oddity about malaria 
treatment many decades ago. Pamaquine is purine glycoside of fava beans, 
a bean that is still consumed today in countries surrounding the 
Mediterranean. People deficient in glucose 6-phosphate dehydrogenase 
suffer hemolysis from eating fava beans or inhaling the pollen of the fava 
flowers, a response called favism. How can we explain pamaquine-induced 
hemolysis biochemically? Pamaquine is an oxidative agent that leads to the 
generation of peroxides, reactive oxygen species that can damage mem- 
branes as well as other biomolecules. Peroxides are normally eliminated by 
the enzyme glutathione peroxidase, which uses reduced glutathione as a 
reducing agent. 


Glutathione peroxidase 


2 GSH + ROOH ——————> GSSG + H20O + ROH 


In the absence of glucose 6-phosphate dehydrogenase, peroxides continue 
to damage membranes because no NADPH is being produced to restore 
reduced glutathione. 

Reduced glutathione is also essential for maintaining the normal struc- 
ture of red blood cells by maintaining the structure of hemoglobin. The 
reduced form of glutathione serves as a sulfhydryl buffer that keeps the 
residues of hemoglobin in the reduced sulfhydryl form. Without adequate 
levels of reduced glutathione, the hemoglobin sulfhydryl groups can no 
longer be maintained in the reduced form. Hemoglobin molecules then 
cross-link with one another to form aggregates called Heinz bodies on cell 
membranes (Figure 20.24). Membranes damaged by Heinz bodies and 
reactive oxygen species become deformed, and the cell is likely to undergo 
lysis. Thus, the answer to our question is that glucose 6-phosphate dehy- 
drogenase is required to maintain reduced glutathione levels to protect 
against oxidative stress. In the absence of oxidative stress, however, the 
deficiency is quite benign. The sensitivity to pamaquine of people having 
this dehydrogenase deficiency also clearly demonstrates that atypical reac- 
tions to drugs may have a genetic basis. 


A deficiency of glucose 6-phosphate dehydrogenase confers 
an evolutionary advantage in some circumstances 


k 2 The incidence of the most common form of glucose 6-phosphate 
dehydrogenase deficiency, characterized by a 10-fold reduction in 
enzymatic activity in red blood cells, is 11% among Americans of African 
heritage. This high frequency suggests that the deficiency may be advanta- 
geous under certain environmental conditions. Indeed, glucose 6-phosphate 
dehydrogenase deficiency protects against falciparum malaria. The parasites 
causing this disease require reduced glutathione and the products of the 
pentose phosphate pathway for optimal growth. Thus, glucose 6-phosphate 
dehydrogenase deficiency is a mechanism of protection against malaria, 
which accounts for its high frequency in malaria-infested regions of the 
world. We see here once again the interplay of heredity and environment in 
the production of disease. 


Summary 


20.1 The Calvin Cycle Synthesizes Hexoses from Carbon Dioxide and Water 
ATP and NADPH formed in the light reactions of photosynthesis 
are used to convert COz into hexoses and other organic compounds. 
The dark phase of photosynthesis, called the Calvin cycle, starts with 
the reaction of CO, and ribulose 1,5-bisphosphate to form two 
molecules of 3-phosphoglycerate. The steps in the conversion of 
3-phosphoglycerate into fructose 6-phosphate and glucose 6-phos- 
phate are like those of gluconeogenesis, except that glyceraldehyde 
3-phosphate dehydrogenase in chloroplasts is specific for NADPH 
rather than NADH. Ribulose 1,5-bisphosphate is regenerated from 
fructose 6-phosphate, glyceraldehyde 3-phosphate, and dihydroxyac- 
etone phosphate by a complex series of reactions. Several of the steps 
in the regeneration of ribulose 1,5-bisphosphate are like those of the 
pentose phosphate pathway. Three molecules of ATP and two mole- 
cules of NADPH are consumed for each molecule of CO converted 
into a hexose. Starch in chloroplasts and sucrose in the cytoplasm are 
the major carbohydrate stores in plants. 

Rubisco also catalyzes a competing oxygenase reaction, which 
produces phosphoglycolate and 3-phosphoglycerate. The recycling of 
phosphoglycolate leads to the release of CO? and further consumption 
of Op» in a process called photorespiration. 


20.2 The Activity of the Calvin Cycle Depends on Environmental Conditions 
Reduced thioredoxin formed by the light-driven transfer of electrons 
from ferredoxin activates enzymes of the Calvin cycle by reducing 
disulfide bridges. The light-induced increase in pH and Mg”* level of 
the stroma is important in stimulating the carboxylation of ribulose 
1,5-bisphosphate by rubisco. Photorespiration is minimized in tropi- 
cal plants, which have an accessory pathway—the C4 pathway—for 
concentrating CQO» at the site of the Calvin cycle. This pathway 
enables tropical plants to take advantage of high levels of light and 
minimize the oxygenation of ribulose 1,5-bisphosphate. Plants in arid 
ecosystems employ crassulacean acid metabolism to prevent dehydra- 
tion. In CAM plants, the C4 pathway is active during the night, when 
the plant exchanges gases with the air. During the day, gas exchange is 
eliminated and CO, is generated from malate stored in vacuoles. 
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CHAPTER 20 The Calvin Cycle and the 
Pentose Phosphate Pathway 


Key Terms 


Calvin cycle (dark reactions) (p. 589) 

autotroph (p. 590) 

heterotroph (p. 590) 

rubisco (ribulose 1,5-bisphosphate 
carboxylase/oxygenase) (p. 591) 


20.3 


20.4 


20.5 


The Pentose Phosphate Pathway Generates NADPH and Synthesizes 
Five-Carbon Sugars 

Whereas the Calvin cycle is present only in photosynthetic organisms, 
the pentose phosphate pathway is present in all organisms. The pen- 
tose phosphate pathway generates NADPH and ribulose 5-phosphate 
in the cytoplasm, which is subsequently isomerized to ribose 5-phos- 
phate. NADPH is used in reductive biosyntheses, whereas ribose 
5-phosphate is used in the synthesis of RNA, DNA, and nucleotide 
coenzymes. The pentose phosphate pathway starts with the dehydro- 
genation of glucose 6-phosphate to form a lactone, which is hydro- 
lyzed to give 6-phosphogluconate and then oxidatively decarboxylated 
to yield ribulose 5-phosphate. NADP” is the electron acceptor in both 
of these oxidations. The last step is the isomerization of ribulose 
5-phosphate (a ketose) to ribose 5-phosphate (an aldose). A different 
mode of the pathway is active when cells need much more NADPH 
than ribose 5-phosphate. Under these conditions, ribose 5-phosphate 
is converted into glyceraldehyde 3-phosphate and fructose 6-phos- 
phate by transketolase and transaldolase. These two enzymes create a 
reversible link between the pentose phosphate pathway and gluconeo- 
genesis. Xylulose 5-phosphate, sedoheptulose 7-phosphate, and eryth- 
rose 4-phosphate are intermediates in these interconversions. In this 
way, 12 molecules of NADPH can be generated for each molecule of 
glucose 6-phosphate that is completely oxidized to CO». 


The Metabolism of Glucose 6-phosphate by the Pentose Phosphate 
Pathway Is Coordinated with Glycolysis 

Only the nonoxidative branch of the pathway is significantly active 
when much more ribose 5-phosphate than NADPH needs to be syn- 
thesized. Under these conditions, fructose 6-phosphate and glyceral- 
dehyde 3-phosphate (formed by the glycolytic pathway) are converted 
into ribose 5-phosphate without the formation of NADPH. 
Alternatively, ribose 5-phosphate formed by the oxidative branch can 
be converted into pyruvate through fructose 6-phosphate and glycer- 
aldehyde 3-phosphate. In this mode, ATP and NADPH are generat- 
ed, and five of the six carbons of glucose 6-phosphate emerge in pyru- 
vate. The interplay of the glycolytic and pentose phosphate pathways 
enables the levels of NADPH, ATP, and building blocks such as 
ribose 5-phosphate and pyruvate to be continuously adjusted to meet 
cellular needs. 


Glucose 6-phosphate Dehydrogenase Plays a Key Role in Protection 
Against Reactive Oxygen Species 

NADPH generated by glucose 6-phosphate dehydrogenase maintains 
the appropriate levels of reduced glutathione required to combat oxi- 
dative stress and maintain the proper reducing environment in the cell. 
Cells with diminished glucose 6-phosphate dehydrogenase activity are 
especially sensitive to oxidative stress. 


peroxisome (microbody) (p. 593) starch (p. 597) 
photorespiration (p. 594) sucrose (p. 597) 
hexose monophosphate pool (p. 594) thioredoxin (p. 599) 
transketolase (p. 594) C4 pathway (p. 600) 


aldolase (p. 595) Cy, plant (p. 600) 


C3 plant (p. 600) 


crassulacean acid 
metabolism (CAM) (p. 601) 


pathway (p. 601) 


pentose phosphate 
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Problems 


glucose 6-phosphate dehydrogenase (p. 601) 
glutathione (p. 609) 


Problems 


1. A vital cycle. Why is the Calvin cycle crucial to the func- 
tioning of all life forms? 


2. Compare and contrast. Identify the similarities and dif- 
ferences between the Krebs cycle and the Calvin cycle. 


3. Labeling experiments. When Melvin Calvin performed 
his initial experiments on carbon fixation, he exposed 
algae to radioactive carbon dioxide. After 5 seconds, only 
a single organic compound contained radioactivity but, 
after 60 seconds, many compounds had incorporated 
radioactivity. (a) What compound initially contained the 
radioactivity? (b) What compounds contained radioactivity 
after 60 seconds? 


4. Three-part harmony. The Calvin cycle can be thought of 
as occurring in three parts or stages. Describe the stages. 


5. Not always to the swiftest. Suggest a reason why rubisco 
might be the most abundant enzyme in the world. 


6. A requirement. In an atmosphere devoid of CO but rich 
in Oy, the oxygenase activity of rubisco disappears. Why? 


7. Reduce locally. Glyceraldehyde 3-phosphate dehydroge- 
nase in chloroplasts uses NADPH to participate in the 
synthesis of glucose. In gluconeogenesis in the cytoplasm, 
the isozyme of the dehydrogenase uses NADH. Why is it 
advantageous for the chloroplast enzyme to use NADPH? 


8. Total eclipse. An illuminated suspension of Chlorella is 
actively carrying out photosynthesis. Suppose that the light 
is suddenly switched off. How would the levels of 3-phos- 
phoglycerate and ribulose 1,5-bisphosphate change in the 
next minute? 


9. CO» deprivation. An illuminated suspension of Chlorella 
is actively carrying out photosynthesis in the presence of 1% 
CO». The concentration of CO, is abruptly reduced to 
0.003%. What effect would this reduction have on the levels 
of 3-phosphoglycerate and ribulose 1,5-bisphosphate in the 
next minute? 


10. Salvage operation. Write a balanced equation for the 
transamination of glyoxylate to yield glycine. 


11. Dog days of August. Before the days of pampered lawns, 
most homeowners practiced horticultural Darwinism. A 
result was that the lush lawns of early summer would often 
convert into robust cultures of crabgrass in the dog days of 
August. Provide a possible biochemical explanation for this 
transition. 


12. Is it hot in heve, or is it just me? Why is the C4 pathway 
valuable for tropical plants? 


13. No free lunch. Explain why maintaining a high concen- 
tration of COy in the bundle-sheath cells of C4 plants is an 
example of active transport. How much ATP is required 
per CO, to maintain a high concentration of CO» in the 
bundle-sheath cells of C4 plants? 


14. Breathing pictures? What is photorespiration, what is its 
cause, and why is it believed to be wasteful? 


15. Global warming. C3 plants are most common in higher 
latitudes and become less common at latitudes near the 
equator. The reverse is true of C4 plants. How might global 
warming affect this distribution? 


16. Communication. What are the light-dependent changes 
in the stroma that regulate the Calvin cycle? 


17. Linked in. Describe how the pentose phosphate 
pathway and glycolysis are linked by transaldolase and 
transketolase. 


18. Tracing glucose. Glucose labeled with C at C-6 is 
added to a solution containing the enzymes and cofactors of 
the oxidative phase of the pentose phosphate pathway. 
What is the fate of the radioactive label? 


19. Recurring decarboxylations. Which reaction in the citric 
acid cycle is most analogous to the oxidative decarboxyla- 
tion of 6-phosphogluconate to ribulose 5-phosphate? 
What kind of enzyme-bound intermediate is formed in 
both reactions? 


20. Synthetic stoichiometries. What is the stoichiometry 
of the synthesis of (a) ribose 5-phosphate from glucose 
6-phosphate without the concomitant generation of 
NADPH? (b) NADPH from glucose 6-phosphate without 
the concomitant formation of pentose sugars? 


21. Offal or awful? Liver and other organ meats contain 
large quantities of nucleic acids. In the course of digestion, 
RNA is hydrolyzed to ribose, among other chemicals. 
Explain how ribose can be used as a fuel. 


22. A required ATP. The metabolism of glucose 6-phos- 
phate into ribose 5-phosphate by the joint efforts of the 
pentose phosphate pathway and glycolysis can be summa- 
rized by the following equation. 


5 glucose + 6-phosphate + ATP —> 
6 ribose 5-phosphate + ADP 
Which reaction requires the ATP? 
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23. No respiration. Glucose is normally completely oxidized 
to CO» in the mitochondria. In what circumstance can 
glucose be complete oxidized to CO in the cytoplasm? 


24. Watch your diet, doctor. The noted psychiatrist 
Hannibal Lecter once remarked to FBI Agent Clarice 
Starling that he enjoyed liver with some fava beans and a 
nice Chianti. Why might this diet be dangerous for some 
people? 


25. No vedundancy. Why do deficiencies in glucose 
6-phosphate dehydrogenase frequently present as anemia? 


26. Damage control. What is the role of glutathione in pro- 
tection against damage by reactive oxygen species? Why is 
the pentose phosphate pathway crucial to this protection? 


27. Reductive power. What ratio of NADPH to NADP* is 
required to sustain [GSH] = 10 mM and [GSSG] = 1 mM? 
Use the redox potentials given in Table 18.1. 


Mechanism Problems 


28. An alternative approach. The mechanisms of some 
aldolases do not include Schiff-base intermediates. Instead, 
these enzymes require bound metal ions. Propose such a 
mechanism for the conversion of dihydroxyacetone phos- 
phate and glyceraldehyde 3-phosphate into fructose 
1,6-bisphosphate. 


29. A recurring intermediate. Phosphopentose isomerase 
interconverts the aldose ribose 5-phosphate and the ketose 
ribulose 5-phosphate. Propose a mechanism. 


Chapter Integration Problems 


30. Catching carbons. Radioactive-labeling experiments 
can yield estimates of how much glucose 6-phosphate is 
metabolized by the pentose phosphate pathway and how 
much is metabolized by the combined action of glycolysis 
and the citric acid cycle. Suppose that you have samples of 
two different tissues as well as two radioactively labeled 
glucose samples, one with glucose labeled with '*C at C-1 
and the other with glucose labeled with !*C at C-6. Design 
an experiment that would enable you to determine the rela- 
tive activity of the aerobic metabolism of glucose compared 
with metabolism by the pentose phosphate pathway. 


31. Photosynthetic efficiency. Use the following information 
to estimate the efficiency of photosynthesis. 


The AG®’ for the reduction of COz to the level of hexose is 
+477 kJ mol! (+114 kcal mol7!). 

A mole of 600-nm photons has an energy content of 199 kJ 
(47.6 kcal). 


Assume that the proton gradient generated in producing 
the required NADPH is sufficient to drive the synthesis of 
the required ATP. 


32. A violation of the First Law? The complete combustion 
of glucose to CO2 and H2O yields 30 ATP, as shown in 
Table 18.4. However, the synthesis of glucose requires only 
18 ATP. How is it possible that glucose synthesis from CO? 
and HO requires only 18 ATP but combustion to CO2 and 
H20 yields 30 ATP? Is it a violation of the First Law of 
Thermodynamics or perhaps a miracle? 


Data Interpretation Problem 


33. Deciding between 3 and 4. Graph A shows the photo- 
synthetic activity of two species of plant, one a Cy plant and 
the other a C3 plant, as a function of leaf temperature. 
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(a) Which data were most likely generated by the Cy plant 
and which by the C3 plant? Explain. 


(b) Suggest some possible explanations for why the photo- 
synthetic activity falls at higher temperatures. 


Graph B illustrates how the photosynthetic activity of C3 
and Cy, plants varies with CO2 concentration when tem- 
perature (30°C) and light intensity (high) are constant. 
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(c) Why can Cy plants thrive at CO concentrations that do 
not support the growth of C3 plants? 


(d) Suggest a plausible explanation why C3 plants continue 
to increase photosynthetic activity at higher CO  concen- 
trations, whereas Cy plants reach a plateau. 


Glycogen Metabolism 


lucose is an important fuel and, as we will see, a key precursor for the 

biosynthesis of many molecules. However, glucose cannot be stored, 
because high concentrations of glucose disrupt the osmotic balance of the 
cell, which would cause cell damage or death. How can adequate stores of 
glucose be maintained without damaging the cell? The solution to this prob- 
lem is to store glucose as a nonosmotically active polymer called glycogen. 

Glycogen is a readily mobilized storage form of glucose. It is a very large, 
branched polymer of glucose residues that can be broken down to yield 
glucose molecules when energy is needed (Figure 21.1). A glycogen mole- 
cule has approximately 12 layers of glucose molecules and can be as large 
as 40 nm. Most of the glucose residues in glycogen are linked by a-1,4- 
glycosidic bonds (Figure 21.2). Branches at about every tenth residue are 
created by a-1,6-glycosidic bonds. Recall that a-glycosidic linkages form 
open helical polymers, whereas B linkages produce nearly straight strands 
that form structural fibrils, as in cellulose (see Figure 11.14). 

Glycogen is not as reduced as fatty acids are and consequently not as 
energy rich. Why isn’t all excess fuel stored as fatty acids rather than as 
glycogen? The controlled release of glucose from glycogen maintains blood- 
glucose levels between meals. The circulating blood keeps the brain supplied 
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Glucose for energy 


Signaling cascades lead to the mobilization of glycogen 
to produce glucose, an energy source for runners. [(Left) 
Steve Krull/Alamy.] 
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Figure 21.1 Glycogen. At the core of the 
glycogen molecule is the protein glycogenin 
(p. 628). The nonreducing ends of the 
glycogen molecule form the surface of the 
glycogen granule. Degradation takes place 
at this surface. [After R. Melendez et al. 
Biophys. J. 77:1327-1332, 1999] 
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Figure 21.3 Electron micrograph of a 
liver cell. The dense particles in the 
cytoplasm are glycogen granules. [Courtesy of 
Dr. George Palade.] 
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Figure 21.2 Glycogen structure. In this structure of two outer branches of a glycogen molecule, 
the residues at the nonreducing ends are shown in red and the residue that starts a branch is 
shown in green. The rest of the glycogen molecule is represented by R. 


with glucose, which is virtually the only fuel used by the brain, except dur- 
ing prolonged starvation. Moreover, the readily mobilized glucose from 
glycogen is a good source of energy for sudden, strenuous activity. Unlike 
fatty acids, the released glucose can provide energy in the absence of oxygen 
and can thus supply energy for anaerobic activity. 

Although most tissues have some glycogen, the two major sites of glyco- 
gen storage are the liver and skeletal muscle. The concentration of glycogen 
is higher in the liver than in muscle (10% versus 2% by weight), but more 
glycogen is stored in skeletal muscle overall because of muscle’s much 
greater mass. Glycogen is present in the cytoplasm, with the molecule 
appearing as granules (Figure 21.3). In the liver, glycogen synthesis and 
degradation are regulated to maintain blood-glucose levels as required to 
meet the needs of the organism as a whole. In contrast, in muscle, these 
processes are regulated to meet the energy needs of the muscle itself. 


Glycogen metabolism is the regulated release and storage of glucose 


Glycogen degradation and synthesis are simple biochemical processes. 
Glycogen degradation consists of three steps: (1) the release of glucose 
1-phosphate from glycogen, (2) the remodeling of the glycogen substrate to 
permit further degradation, and (3) the conversion of glucose 1-phosphate 
into glucose 6-phosphate for further metabolism. The glucose 6-phosphate 
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derived from the breakdown of glycogen has three possible fates (Fig- 
ure 21.4): (1) it is the initial substrate for glycolysis, (2) it can be converted 
into free glucose for release into the bloodstream, and (3) it can be processed 
by the pentose phosphate pathway to yield NADPH and ribose derivatives. 
The conversion of glycogen into free glucose takes place mainly in the liver. 

Glycogen synthesis, which takes place when glucose is abundant, 
requires an activated form of glucose, uridine diphosphate glucose (UDP- 
glucose), formed by the reaction of UTP and glucose 1-phosphate. As is the 
case for glycogen degradation, the glycogen molecule must be remodeled for 
continued synthesis. 

The regulation of glycogen degradation and synthesis is complex. 
Several enzymes taking part in glycogen metabolism allosterically respond 
to metabolites that signal the energy needs of the cell. Through these allosteric 
responses, enzyme activity is adjusted to meet the needs of the cell. In addition, 
hormones may initiate signal cascades that lead to the reversible phospho- 
rylation of enzymes, which alters their catalytic rates. Regulation by hor- 
mones adjusts glycogen metabolism to meet the needs of the entire organism. 


21.1 Glycogen Breakdown Requires the Interplay 
of Several Enzymes 


The efficient breakdown of glycogen to provide glucose 6-phosphate for 
further metabolism requires four enzyme activities: one to degrade glyco- 
gen, two to remodel glycogen so that it can be a substrate for degradation, 
and one to convert the product of glycogen breakdown into a form suitable 
for further metabolism. We will examine each of these activities in turn. 


Phosphorylase catalyzes the phosphorolytic cleavage of glycogen to 
release glucose 1-phosphate 


Glycogen phosphorylase, the key enzyme in glycogen breakdown, cleaves 
its substrate by the addition of orthophosphate (P;) to yield glucose 
1-phosphate. The cleavage of a bond by the addition of orthophosphate is 
referred to as phosphorolysis. 


Glycogen + P; = — glucose 1-phophase + glycogen 
(n residues) (n — 1 residues) 


Phosphorylase catalyzes the sequential removal of glucosyl residues from 
the nonreducing ends of the glycogen molecule (the ends with a free OH 
group on carbon 4). Orthophosphate splits the glycosidic linkage between 
C-1 of the terminal residue and C-4 of the adjacent one. Specifically, it 
cleaves the bond between the C-1 carbon atom and the glycosidic oxygen 
atom, and the a configuration at C-1 is retained. 


CH20H CH20H CH20H CH20H 


5 oO HPO,7- 0 0 
OH OH a OH + OH 
HO fe) OR HO OPO;2> HO OR 
OH OH OH OH 


Glycogen Glucose 1-phosphate Glycogen 
(n residues) (n - 1 residues) 


Glucose 1-phosphate released from glycogen can be readily converted into 
glucose 6-phosphate, an important metabolic intermediate, by the enzyme 
phosphoglucomutase. 
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hk Figure 21.5 Structure of glycogen 
phosphorylase. This enzyme forms a 
homodimer: one subunit is shown in white 
and the other in yellow. Each catalytic site 
includes a pyridoxal phosphate (PLP) group, 
linked to lysine 680 of the enzyme. 
The binding site for the phosphate (P;) 
substrate is shown. Notice that the catalytic 
site lies between the C-terminal domain and 
the glycogen-binding site. A narrow crevice, 
which binds four or five glucose units of 
glycogen, connects the two sites. The 
separation of the sites allows the catalytic site 
to phosphorolyze several glucose units before 
the enzyme must rebind the glycogen 
substrate. [Drawn from 1NOI.pdb.] 
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The reaction catalyzed by phosphorylase is readily reversible in vitro. At 
pH 6.8, the equilibrium ratio of orthophosphate to glucose 1-phosphate is 
3.6. The value of AG®’ for this reaction is small because a glycosidic bond is 
replaced by a phosphoryl ester bond that has a nearly equal transfer poten- 
tial. However, phosphorolysis proceeds far in the direction of glycogen 
breakdown in vivo because the [P;]/[glucose 1-phosphate] ratio is usually 
greater than 100, substantially favoring phosphorolysis. We see here an 
example of how the cell can alter the free-energy change to favor a reaction’s 
occurrence by altering the ratio of substrate and product. 

The phosphorolytic cleavage of glycogen is energetically advantageous 
because the released sugar is already phosphorylated. In contrast, a hydrolytic 
cleavage would yield glucose, which would then have to be phosphorylated 
at the expense of a molecule of ATP to enter the glycolytic pathway. An 
additional advantage of phosphorolytic cleavage for muscle cells is that 
no transporters exist for glucose 1-phosphate, which is negatively 
charged under physiological conditions, and so it cannot be transported out 
of the cell. 


Mechanism: Pyridoxal phosphate participates in the 
phosphorolytic cleavage of glycogen 


The special challenge faced by phosphorylase is to cleave glycogen phos- 
phorolytically rather than hydrolytically to save the ATP required to phos- 
phorylate free glucose. Thus, water must be excluded from the active site. 
Phosphorylase is a dimer of two identical 97-kd subunits. Each subunit is 
compactly folded into an amino-terminal domain (480 residues) containing a 
glycogen-binding site and a carboxyl-terminal domain (360 residues; Figure 21.5). 
The catalytic site in each subunit is located in a deep crevice formed by 
residues from both domains. What is the mechanistic basis of the phospho- 
rolytic cleavage of glycogen? 
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Several clues suggest a mechanism by which phosphorylase achieves the 
exclusion of water. First, both the glycogen substrate and the glucose 
1-phosphate product have an a configuration at C-1. A direct attack by 
phosphate on C-1 of a sugar would invert the configuration at this carbon 
atom because the reaction would proceed through a pentacovalent transi- 
tion state. The fact that the glucose 1-phosphate formed has an a rather 
than a B configuration suggests that an even number of steps (most simply, 
two) is required. The most likely explanation for these results is that a 
carbonium ion intermediate is formed from the glucose residue. 

A second clue to the catalytic mechanism of phosphorylase is its require- 
ment for the coenzyme pyridoxal phosphate (PLP), a derivative of pyridox- 
ine (vitamin Bo, Section 15.4). The aldehyde group of this coenzyme forms 
a Schiff-base linkage with a specific lysine side chain of the enzyme 
(Figure 21.6). Structural studies indicate that the reacting orthophosphate 
group takes a position between the 5'-phosphate group of PLP and the 
glycogen substrate (Figure 21.7). The 5’-phosphate group of PLP acts in 
tandem with orthophosphate by serving as a proton donor and then as a pro- 
ton acceptor (i.e., as a general acid—base catalyst). Orthophosphate (in the 
HPO,” form) donates a proton to the oxygen atom attached to carbon 4 of 
the departing glycogen chain and simultaneously acquires a proton from 
PLP. The carbocation (carbonium ion) intermediate formed in this step is 
then attacked by orthophosphate to form a-glucose 1-phosphate, with the 
concomitant return of a hydrogen atom to pyridoxal phosphate. The special 
role of pyridoxal phosphate in the reaction is necessary because water is 
excluded from the active site. 
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The glycogen-binding site is 30 A away from the catalytic site (see 
Figure 21.5), but it is connected to the catalytic site by a narrow crevice able 
to accommodate four or five glucose units. The large separation between 
the binding site and the catalytic site enables the enzyme to phosphorolyze 
many residues without having to dissociate and reassociate after each cata- 
lytic cycle. An enzyme that can catalyze many reactions without having to 
dissociate and reassociate after each catalytic step is said to be processive—a 
property of enzymes that synthesize and degrade large polymers. We will 
see such enzymes again when we consider DNA and RNA synthesis. 


A debranching enzyme also is needed for the breakdown of glycogen 


Glycogen phosphorylase acting alone degrades glycogen to a limited extent. 
The enzyme can break a-1,4-glycosidic bonds on glycogen branches but 
soon encounters an obstacle. The a-1,6-glycosidic bonds at the branch 
points are not susceptible to cleavage by phosphorylase. Indeed, phosphor- 
ylase stops cleaving a-1,4 linkages when it reaches a terminal residue four 
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Figure 21.6 PLP-Schiff-base linkage. A 
pyridoxal phosphate (PLP) group (red) forms 
a Schiff base with a lysine residue (blue) at 
the active site of phosphorylase. 


Figure 21.7 Phosphorylase mechanism. A 
bound HPO,?7 group (red) favors the 
cleavage of the glycosidic bond by donating a 
proton to the departing glucose (black). This 
reaction results in the formation of a 
carbocation and is favored by the transfer of a 
proton from the protonated phosphate group 
of the bound pyridoxal phosphate (PLP) 
group (blue). The carbocation and the 
orthophosphate combine to form glucose 
1-phosphate. 
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residues away from a branch point. Because about 1 in 10 residues is 
branched, cleavage by the phosphorylase alone would come to a halt after 


the release of six glucose molecules per branch. 

How can the remainder of the glycogen molecule be mobilized for use 
as a fuel? Two additional enzymes, a transferase and a-1,6-glucosidase, 
remodel the glycogen for continued degradation by the phosphorylase 
(Figure 21.8). The transferase shifts a block of three glucosyl residues from one 
outer branch to another. This transfer exposes a single glucose residue joined 
by an a-1,6-glycosidic linkage. a-1,6-Glucosidase, also known as the 
debranching enzyme, hydrolyzes the a-1,6-glycosidic bond. 
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Figure 21.8 Glycogen remodeling. First, 
a-1,4-glycosidic bonds on each branch are 
cleaved by phosphorylase, leaving four 
residues along each branch. The transferase 
shifts a block of three glucosyl residues from 
one outer branch to the other. In this reaction, 
the a-1,4-glycosidic link between the blue and 
the green residues is broken and a new a-1,4 
link between the blue and the yellow residues 
is formed. The green residue is then removed 
by a-1,6-glucosidase, leaving a linear chain 
with all a-1,4 linkages, suitable for further 


cleavage by phosphorylase. reactions. 


(n - 1 residues) 


A free glucose molecule is released and then phosphory- 
lated by the glycolytic enzyme hexokinase. Thus, the 
transferase and a-1,6-glucosidase convert the branched 
structure into a linear one, which paves the way for fur- 
ther cleavage by phosphorylase. In eukaryotes, the trans- 
ferase and the a-1,6-glucosidase activities are present ina 
single 160-kd polypeptide chain, providing yet another 
example of a bifunctional enzyme (see Figure 16.29). 


Phosphoglucomutase converts glucose 1-phosphate 
into glucose 6-phosphate 


Glucose 1-phosphate formed in the phosphorolytic cleav- 
age of glycogen must be converted into glucose 6-phos- 
phate to enter the metabolic mainstream. This shift of a 
phosphoryl group is catalyzed by phosphoglucomutase. 
Recall that this enzyme is also used in galactose metabo- 
lism (Section 16.1). To effect this shift, the enzyme exchanges 
a phosphoryl group with the substrate (Figure 21.9). The 
catalytic site of an active mutase molecule contains a 
phosphorylated serine residue. The phosphoryl group is 
transferred from the serine residue to the C-6 hydroxyl 


group of glucose 1-phosphate to form glucose 1,6-bisphosphate. The C-1 
phosphoryl group of this intermediate is then shuttled to the same serine 
residue, resulting in the formation of glucose 6-phosphate and the regenera- 
tion of the phosphoenzyme. 

These reactions are like those of phosphoglycerate mutase, a glycolytic 
enzyme (Section 16.1). The role of glucose 1,6-bisphosphate in the inter- 
conversion of the phosphoglucoses is like that of 2,3-bisphosphoglycerate 
(2,3-BPG) in the interconversion of 2-phosphoglycerate and 3-phospho- 
glycerate in glycolysis. A phosphoenzyme intermediate participates in both 
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The liver contains glucose 6-phosphatase, a hydrolytic 
enzyme absent from muscle 


A major function of the liver is to maintain a nearly constant level of glucose in 
the blood. The liver releases glucose into the blood during muscular activity 
and between meals. The released glucose is taken up primarily by the brain 
and skeletal muscle. In contrast with unmodified glucose, however, the 
phosphorylated glucose produced by glycogen breakdown is not transported 
out of cells. The liver contains a hydrolytic enzyme, glucose 6-phosphatase 
that enables glucose to leave that organ. This enzyme cleaves the phosphoryl 
group to form free glucose and orthophosphate. This glucose 6-phosphatase 
is the same enzyme that releases free glucose at the conclusion of gluconeo- 
genesis. It is located on the lumenal side of the smooth endoplasmic 
reticulum membrane. Recall that glucose 6-phosphate is transported into 
the endoplasmic reticulum; glucose and orthophosphate formed by hydro- 
lysis are then shuttled back into the cytoplasm (Section 16.1). 


Glucose 6-phosphate + HzO —— glucose + P; 


Glucose 6-phosphatase is absent from most other tissues. Muscle tissues 
retain glucose 6-phosphate for the generation of ATP. In contrast, glucose 
is not a major fuel for the liver. 


21.2 Phosphorylase Is Regulated by Allosteric Interactions 
and Reversible Phosphorylation 


Glycogen metabolism is precisely controlled by multiple interlocking 
mechanisms. The focus of this control is the enzyme glycogen phosphory- 
lase. Phosphorylase is regulated by several allosteric effectors that signal the 
energy state of the cell as well as by reversible phosphorylation, which is respon- 
sive to hormones such as insulin, epinephrine, and glucagon. We will examine 
the differences in the control of glycogen metabolism in two tissues: skeletal 
muscle and liver. These differences are due to the fact that the muscle uses 
glucose to produce energy for itself, whereas the liver maintains glucose homeo- 
stasis of the organism as a whole. 


Muscle phosphorylase is regulated by the intracellular energy charge 


The dimeric skeletal-muscle phosphorylase exists in two interconvertible 
forms: a usually active phosphorylase a and a usually inactive phosphorylase b 
(Figure 21.10). Each of these two forms exists in equilibrium between an 
active relaxed (R) state and a much less active tense (T) state, but the equi- 
librium for phosphorylase a favors the active R state, whereas the equilib- 
rium for phosphorylase b favors the less-active T state (Figure 21.11). The 
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Figure 21.9 Reaction catalyzed by 
phosphoglucomutase. A phosphoryl group 
is transferred from the enzyme to the 
substrate, and a different phosphoryl group is 
transferred back to restore the enzyme to its 
initial state. 


Catalytic 
Phosphoserine sites 


residues 


Catalytic 
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Phosphorylase b (in T state) 


Phosphorylase a (in R state) 


hk Figure 21.10 Structures of phosphorylase a and phosphorylase b. Phosphorylase a 
is phosphorylated on serine 14 of each subunit. This modification favors the structure of the 
more-active R state. One subunit is shown in white, with helices and loops important for regulation 
shown in blue and red. The other subunit is shown in yellow, with the regulatory structures shown 
in orange and green. Phosphorylase b is not phosphorylated and exists predominantly in the T state. 
Notice that the catalytic sites are partly occluded in the T state. [Drawn from 1GPA.pdb and 
1NOJ.pdb_] 
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Figure 21.11 Phosphorylase regulation. Both 
phosphorylase b and phosphorylase a exist as equilibria 
between an active R state and a less-active T state. 
Phosphorylase b is usually inactive because the 
equilibrium favors the T state. Phosphorylase a is usually 
active because the equilibrium favors the R state. 
Regulatory structures are shown in blue and green. 


622 


default state of muscle phosphorylase is the b form, owing to the fact 
that, for muscle, phosphorylase needs to be active during muscle 
contraction. Muscle phosphorylase b is activated by the presence of 
high concentrations of AMP, which binds to a nucleotide-binding 
site and stabilizes the conformation of phosphorylase b in the active 
R state (Figure 21.12). Thus, when a muscle contracts and ATP is 
converted into AMP, the phosphorylase is signaled to degrade glyco- 
gen. ATP acts as a negative allosteric effector by competing with 
AMP. Thus, the transition of phosphorylase b between the active R 
state and the less-active T state is controlled by the energy charge of the 
muscle cell. Glucose 6-phosphate also binds to and stabilizes the less- 
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2 Glucose 
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Figure 21.12 Allosteric regulation of muscle phosphorylase. A low energy charge, 
represented by high concentrations of AMP, favors the transition to the R state. 


active state of phosphorylase b, an example of feedback inhibition. Under 
most physiological conditions, phosphorylase b is inactive because of the inhibi- 
tory effects of ATP and glucose 6-phosphate. In contrast, phosphorylase a is 
fully active, regardless of the levels of AMP, ATP, and glucose 6-phosphate. 
In resting muscle, nearly all the enzyme is in the inactive b form. 

Phosphorylase b is converted into phosphorylase a by the phosphoryla- 
tion of a single serine residue (serine 14) in each subunit. This conversion is 
initiated by hormones. Fear or the excitement of exercise will cause levels of 
the hormone epinephrine to increase. The increase in hormone levels and 
the electrical stimulation of muscle result in phosphorylation of the enzyme 
to the phosphorylase a form. The regulatory enzyme phosphorylase kinase 
catalyzes this covalent modification. 

Comparison of the structures of phosphorylase a in the R state and 
phosphorylase b in the T state reveals that subtle structural changes at 
the subunit interfaces are transmitted to the active sites (see Figure 21.10). 
The transition from the T state (the prevalent state of phosphorylase b) to the 
R state (the prevalent state of phosphorylase a) entails a 10-degree rotation 
around the twofold axis of the dimer. Most importantly, this transition is 
associated with structural changes in a helices that move a loop out of the 
active site of each subunit. Thus, the T state is less active because the cata- 
lytic site is partly blocked. In the R state, the catalytic site is more accessible 
and a binding site for orthophosphate is well organized. 


Liver phosphorylase produces glucose for use by other tissues 


The role of glycogen degradation in the liver is to form glucose for export to 
other tissues when the blood-glucose level is low. Consequently, we can 
think of the default state of liver phosphorylase as being the a form: glucose 
is to be generated unless the enzyme is signaled otherwise. The liver phos- 
phorylase a form thus exhibits the most responsive R<>T transition 
(Figure 21.13). The binding of glucose shifts the a form from the active R 
state to the less-active T state. In essence, the enzyme reverts to the low- 
activity T state only when it detects the presence of sufficient glucose. If 
glucose is present in the diet, there is no need to degrade 
glycogen. As we will see later, the presence of glucose also 
facilitates the a-to-b transition. 

The regulation of liver phosphorylase differs from that 
of muscle phosphorylase. In muscle, the default state is the 
b form: there is no need to generate glucose unless energy 
is required. As discussed previously, AMP shifts the 
muscle b form from the T to the R state. Unlike the enzyme 
in muscle, the liver phosphorylase is insensitive to regula- 
tion by AMP because the liver does not undergo the dra- 
matic changes in energy charge seen in a contracting 
muscle. We see here a clear example of the use of isozymic 
forms of the same enzyme to establish the tissue-specific 
biochemical properties of muscle and the liver. In human T state 
beings, liver phosphorylase and muscle phosphorylase are 
approximately 90% identical in amino acid sequence, yet 
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Figure 21.13 Allosteric regulation of liver phosphorylase. The 
binding of glucose to phosphorylase a shifts the equilibrium to the T 


the 10% ditterence results in subtle but important shifts in state and inactivates the enzyme. Thus, glycogen is not mobilized 
the stability of various forms of the enzyme. when glucose is already abundant. 


Phosphorylase kinase is activated by phosphorylation and calcium ions 


Phosphorylase kinase activates phosphorylase b by attaching a phosphoryl 
group. The subunit composition of phosphorylase kinase in skeletal muscle 
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Figure 21.14 Activation of phosphorylase 
kinase. Phosphorylase kinase, an (aByd)4 
assembly, is activated by hormones that 

lead to the phosphorylation of the B subunit 
and by Ca** binding to the 8 subunit. 

Both types of stimulation are required for 
maximal enzyme activity. When active, the 
enzyme converts phosphorylase 5 into 
phosphorylase a. 
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is (@By5)4, and the mass of this very large protein is 1200 kd. The catalytic 
activity resides in the y subunit, whereas the other subunits serve regula- 
tory functions. This kinase is under dual control: it is activated both by 
phosphorylation by phosphorylase kinase A (PKA) and by increases in 
Ca?* levels (Figure 21.14). Like its own substrate, phosphorylase kinase is 
activated by phosphorylation: the kinase is converted from a low-activity 
form into a high-activity one by phosphorylation of its B subunit. The activa- 
tion of phosphorylase kinase is one step in a signal-transduction cascade 
initiated by hormones. 

Phosphorylase kinase can also be partly activated by Ca’* levels of the 
order of 1 M. Its 8 subunit is calmodulin, a calcium sensor that stimulates 
many enzymes in eukaryotes. This mode of activation of the kinase is espe- 
cially noteworthy in muscle, where contraction is triggered by the release of 
Ca" from the sarcoplasmic reticulum. Phosphorylase kinase attains maxi- 
mal activity only after both phosphorylation of the B subunit and activation 
of the 8 subunit by Ca?" binding. 


21.3 Epinephrine and Glucagon Signal the Need 
for Glycogen Breakdown 


Protein kinase A activates phosphorylase kinase, which in turn activates 
glycogen phosphorylase. What activates protein kinase A? What is the 
signal that ultimately triggers an increase in glycogen breakdown? 


G proteins transmit the signal for the initiation of glycogen breakdown 


Several hormones greatly affect glycogen metabolism. Glucagon and epi- 
nephrine trigger the breakdown of glycogen. Muscular activity or its anti- 
cipation leads to the release of epinephrine (adrenaline), a catecholamine derived 
from tyrosine, from the adrenal medulla. Epinephrine markedly stimulates 
glycogen breakdown in muscle and, to a lesser extent, in the liver. The liver 
is more responsive to glucagon, a polypeptide hormone secreted by the 
a cells of the pancreas when the blood-sugar level is low. Physiologically, 
glucagon signifies the starved state (Figure 21.15). 
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How do hormones trigger the breakdown of glycogen? They initiate a 
cyclic AMP signal-transduction cascade, already discussed in Section 16.1 
(Figure 21.16). 


1. The signal molecules epinephrine and glucagon bind to specific seven- 
transmembrane (71M) receptors in the plasma membranes of target cells 
(Section 14.1). Epinephrine binds to the B-adrenergic receptor in muscle, 
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Figure 21.15 PATHWAY INTEGRATION: 
Hormonal control of glycogen 
breakdown. Glucagon stimulates liver- 
glycogen breakdown when blood glucose is 
low. Epinephrine enhances glycogen 
breakdown in muscle and the liver to provide 
fuel for muscle contraction. 


Figure 21.16 Regulatory cascade for 
glycogen breakdown. Glycogen degradation 
is stimulated by hormone binding to 7TM 
receptors. Hormone binding initiates a 
G-protein-dependent signal-transduction 
pathway that results in the phosphorylation 
and activation of glycogen phosphorylase. 
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whereas glucagon binds to the glucagon receptor in the liver. These binding 
events activate the G, protein. A specific external signal has been transmitted 
into the cell through structural changes, first in the receptor and then in the 
G protein. 
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2. The GTP-bound subunit of G, activates the transmembrane protein 
adenylate cyclase. This enzyme catalyzes the formation of the second mes- 
senger cyclic AMP from ATP. 


3. The elevated cytoplasmic level of cyclic AMP activates protein kinase A 
(Section 10.3). The binding of cyclic AMP to inhibitory regulatory subunits 
triggers their dissociation from the catalytic subunits. The free catalytic 
subunits are now active. 


4. Protein kinase A phosphorylates phosphorylase kinase first on B sub- 
unit and then on the a subunit, which subsequently activates glycogen 
phosphorylase. 


The cyclic AMP cascade highly amplifies the effects of hormones. The 
binding of a small number of hormone molecules to cell-surface receptors 
leads to the release of a very large number of sugar units. Indeed, much of 
the stored glycogen would be mobilized within seconds were it not for a 
counterregulatory system. 

The signal-transduction processes in the liver are more complex than 
those in muscle. Epinephrine can also elicit glycogen degradation in the 
liver. However, in addition to binding to the B-adrenergic receptor, it binds 
to the 7TM a-adrenergic receptor, which then initiates the phosphoinositide 
cascade (Section 14.2) that induces the release of Ca** from endoplasmic 
reticulum stores. Recall that the 6 subunit of phosphorylase kinase is the 
Ca’* sensor calmodulin. The binding of Ca’* to calmodulin leads to a 
partial activation of phosphorylase kinase. Stimulation by both glucagon 
and epinephrine leads to maximal mobilization of liver glycogen. 


Glycogen breakdown must be rapidly turned off when necessary 


There must be a way to shut down the high-gain system of glycogen break- 
down quickly to prevent the wasteful depletion of glycogen after energy 
needs have been met. When glucose needs have been satisfied, phosphory- 
lase kinase and glycogen phosphorylase are dephosphorylated and inacti- 
vated. Simultaneously, glycogen synthesis is activated. 

The signal-transduction pathway leading to the activation of glycogen 
phosphorylase is shut down automatically when the initiating hormone is 
no longer present. The inherent GTPase activity of the G protein converts 
the bound GTP into inactive GDP, and phosphodiesterases always present 
in the cell convert cyclic AMP into AMP. Protein phosphatase 1 (PP1) 
removes the phosphoryl groups from phosphorylase kinase, thereby inacti- 
vating the enzyme. Finally, protein phosphatase 1 also removes the phos- 
phoryl group from glycogen phosphorylase, converting the enzyme into the 
usually inactive b form. 


The regulation of glycogen phosphorylase became more sophisticated 
as the enzyme evolved 


We Analyses of the primary structures of glycogen phosphorylase from 

human beings, rats, Dictyostelium (slime mold), yeast, potatoes, and 
E. coli have enabled inferences to be made about the evolution of this 
important enzyme. The 16 residues that come into contact with glucose at 
the active site are identical in nearly all the enzymes. There is more varia- 
tion but still substantial conservation of the 15 residues at the pyridoxal 
phosphate-binding site. Likewise, the glycogen-binding site is well con- 
served in all the enzymes. The high degree of similarity among these three 
sites shows that the catalytic mechanism has been maintained throughout 
evolution. 

Differences arise, however, when we compare the regulatory sites. 
The simplest type of regulation would be feedback inhibition by glucose 
6-phosphate. Indeed, the glucose 6-phosphate regulatory site is highly con- 
served among most of the phosphorylases. The crucial amino acid residues 
that participate in regulation by phosphorylation and nucleotide binding are 
well conserved only in the mammalian enzymes. Thus, this level of regula- 
tion was a later evolutionary acquisition. 


21.4 Glycogen Is Synthesized and Degraded 
by Different Pathways 


As with glycolysis and gluconeogenesis, biosynthetic and degradative path- 
ways rarely operate by precisely the same reactions in the forward and 
reverse directions. Glycogen metabolism provided the first known example 
of this important principle. Separate pathways afford much greater flexibility, 
both in energetics and in control. 

In 1957, Luis Leloir and his coworkers showed that glycogen is synthe- 
sized by a pathway that utilizes uridine diphosphate glucose (UDP-glucose) 
rather than glucose 1-phosphate as the activated glucose donor. 


Synthesis: Glycogen, + UDP-glucose —— glycogen, + ; + UDP 
Degradation: Glycogen, + ; + P; ——> glycogen, + glucose 1-phosphate 


UDP-glucose is an activated form of glucose 


UDP-glucose, the glucose donor in the biosynthesis of glycogen, is an 
activated form of glucose, just as ATP and acetyl CoA are activated forms 
of orthophosphate and acetate, respectively. The C-1 carbon atom of the 
glucosyl unit of UDP-glucose is activated because its hydroxyl group is 
esterified to the diphosphate moiety of UDP. 

UDP-glucose is synthesized from glucose 1-phosphate and uridine tri- 
phosphate (UTP) in a reaction catalyzed by UDP-glucose pyrophosphory- 
lase. This reaction liberates the outer two phosphoryl residues of UTP as 
pyrophosphate. 
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This reaction is readily reversible. However, pyrophosphate 1 is rapidly 
hydrolyzed in vivo to orthophosphate by an inorganic pyrophosphatase. 
The essentially irreversible hydrolysis of pyrophosphate drives the synthe- 
sis of UDP-glucose. 


Glucose 1-phosphate + UTP == UDP-glucose + PP; 
PP; + Hye) —> 2 Ps 


Glucose 1-phosphate + UTP + Hx0 —~> UDP-glucose + 2 P; 


The synthesis of UDP-glucose exemplifies another recurring theme in 
biochemistry: many biosynthetic reactions are driven by the hydrolysis of 
pyrophosphate. 


Glycogen synthase catalyzes the transfer of glucose from 
UDP-glucose to a growing chain 


New glucosyl units are added to the nonreducing terminal residues of 
glycogen. The activated glucosyl unit of UDP-glucose is transferred to the 
hydroxyl group at C-4 of a terminal residue to form an a-1,4-glycosidic 
linkage. UDP is displaced by the terminal hydroxyl group of the growing 
glycogen molecule. This reaction is catalyzed by glycogen synthase, the key 
regulatory enzyme in glycogen synthesis. 
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Glycogen synthase can add glucosyl] residues only to a polysaccharide 
chain already containing more than four residues. Thus, glycogen synthesis 
requires a primer. This priming function is carried out by glycogenin, a 
glycosyltransferase (see Figure 11.25) composed of two identical 37-kd 
subunits. Each subunit of glycogenin catalyzes the addition of eight glucosyl 
units to the other subunit. These glucosyl units form short a-1,4-glucose 
polymers, which are covalently attached to the phenolic hydroxyl group of 
a specific tyrosine residue in each glycogenin subunit. UDP-glucose is the 
donor in this autoglycosylation. At this point, glycogen synthase takes over 
to extend the glycogen molecule. Thus, every glycogen molecule has a 
glycogenin molecule at its core (see Figure 21.1). 


a Despite no detectable sequence similarity, structural studies have 

revealed that glycogen synthase is homologous to glycogen phospho- 
rylase. The binding site for UDP-glucose in glycogen synthase corresponds 
in position to the pyridoxal phosphate in glycogen phosphorylase. 


A branching enzyme forms a-1,6 linkages 


Glycogen synthase catalyzes only the synthe- 
sis of a-1,4 linkages. Another enzyme is 
required to form the a-1,6 linkages that make 
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a-1,4 linkage 


glycogen a branched polymer. Branching @@ 866 @_ con 


takes place after a number of glucosyl resi- 
dues are joined in a-1,4 linkages by glycogen 
synthase (Figure 21.17). A branch is created 
by the breaking of an a-1,4 link and the for- 
mation of an a-1,6 link: this reaction is differ- 
ent from debranching. A block of residues, 
typically 7 in number, is transferred to a more 
interior site. The branching enzyme that cata- 
lyzes this reaction requires that the block of 
7 or so residues must include the nonreducing 
terminus, and must come from a chain at least 
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| psn enzyme 
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11 residues long. In addition, the new branch ly a 9 Sy 
point must be at least 4 residues away from a one 


preexisting one. 
Branching is important because it increases 
the solubility of glycogen. Furthermore, branch- 


ing creates a large number of terminal resi- Synthase extends both nonreducing ends 

dues, the sites of action of glycogen phospho- followed by more branching 

tylase and synthase (Figure 21.18). Thus, Figure 21.17 Branching reaction. The branching enzyme removes an oligosaccharide 
branching increases the rate of glycogen synthe- of approximately seven residues from the nonreducing end and creates an internal 

sis and degradation. a-1,6 linkage. 


Nie Glycogen branching requires a single transferase activity. Glycogen 

debranching requires two enzyme activities: a transferase and an a-1,6 
glucosidase. Sequence analysis suggests that the two transferases and, 
perhaps, the a-1,6 glucosidase are members of the same enzyme family, 
termed the a-amylase family. An enzyme of this family catalyzes a reaction 
by forming a covalent intermediate attached to a conserved aspartate resi- 
due. Thus, the branching enzyme appears to transfer a chain of glucose 
molecules from an a-1,4 linkage to an aspartate residue on the enzyme and 
then from this site to a more interior location on the glycogen molecule to 
form an a-1,6 linkage. 


Glycogen synthase is the key regulatory enzyme in glycogen synthesis 


The activity of glycogen synthase, like that of phosphorylase, is regulated 
by covalent modification. Glycogen synthase is phosphorylated at multiple 
sites by several protein kinases, notably protein kinase A and glycogen syn- 
thase kinase (GSK). The resulting alteration of the charges in the protein 
lead to its inactivation. Phosphorylation has opposite effects on the enzymatic 
activities of glycogen synthase and phosphorylase. Phosphorylation converts 
the active a form of the synthase into a usually inactive b form. The phos- 
phorylated b form is active only if a high level of the allosteric activator 
glucose 6-phosphate is present, whereas the a form is active whether or not 
glucose 6-phosphate is present. 


Glycogen is an efficient storage form of glucose 


What is the cost of converting glucose 6-phosphate into glycogen and 
back into glucose 6-phosphate? The pertinent reactions have already been 
described, except for reaction 5, which is the regeneration of UTP. ATP 
phosphorylates UDP in a reaction catalyzed by nucleoside diphosphokinase. 


Figure 21.18 Cross section of a glycogen 
molecule. The component labeled G is 
glycogenin. 
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Glucose 6-phosphate ——> glucose 1-phosphate (1) 
Glucose 1-phosphate + UTP ——~> UDP-glucose + PP; (2) 
PP; + H,0 —> 2P; (3) 
UDP-glucose + glycogen, ——> glycogen,+; + UDP (4) 
UDP + ATP —> UTP + ADP (5) 


Sum: Glucose 6-phosphate + ATP + glycogen, + H20 ——> 
glycogenn+; + ADP + 2 P; 


Thus, 1 molecule of ATP is hydrolyzed to incorporate glucose 6-phosphate 
into glycogen. The energy yield from the breakdown of glycogen is highly 
efficient. About 90% of the residues are phosphorolytically cleaved to glu- 
cose 1-phosphate, which is converted at no cost into glucose 6-phosphate. 
The other 10% are branch residues, which are hydrolytically cleaved. One 
molecule of ATP is then used to phosphorylate each of these glucose mol- 
ecules to glucose 6-phosphate. The complete oxidation of glucose 6-phos- 
phate yields about 31 molecules of ATP, and storage consumes slightly 
more than 1 molecule of ATP per molecule of glucose 6-phosphate; so the 
overall efficiency of storage is nearly 97%. 


21.5 Glycogen Breakdown and Synthesis Are 
Reciprocally Regulated 


An important control mechanism prevents glycogen from being synthe- 
sized at the same time as it is being broken down. The same glucagon- and 
epinephrine-triggered cAMP cascades that initiate glycogen breakdown in the 
liver and muscle, respectively, also shut off glycogen synthesis. Glucagon and 
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Figure 21.19 Coordinate control of 
glycogen metabolism. Glycogen 
metabolism is regulated, in part, by hormone- 
triggered cyclic AMP cascades. The sequence 
of reactions leading to the activation of protein 
kinase A ultimately activates glycogen 
degradation. At the same time, protein kinase 
A also inactivates glycogen synthase, shutting 
down glycogen synthesis. 
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epinephrine control both glycogen breakdown and glycogen synthesis through 
protein kinase A (Figure 21.19). Recall that protein kinase A adds a phos- 
phoryl group to phosphorylase kinase, activating that enzyme and initiating 
glycogen breakdown. Likewise, protein kinase A adds a phosphoryl group 
to glycogen synthase, but this phosphorylation leads to a decrease in enzy- 
matic activity. Other kinases, such as glycogen synthase kinase, help to 
inactivate the synthase. In this way, glycogen breakdown and synthesis are 
reciprocally regulated. How is the enzymatic activity reversed so that glyco- 
gen breakdown halts and glycogen synthesis begins? 


Protein phosphatase 1 reverses the regulatory effects of kinases 
on glycogen metabolism 


After a bout of exercise, muscle must shift from a glycogen-degrading mode 
to one of glycogen replenishment. A first step in this metabolic task is to 
shut down the phosphorylated proteins that stimulate glycogen breakdown. 
This task is accomplished by protein phosphatases that catalyze the hydroly- 
sis of phosphorylated serine and threonine residues in proteins. Protein 
phosphatase 1 plays key roles in regulating glycogen metabolism (Figure 21.20). 
PP1 inactivates phosphorylase a and phosphorylase kinase by dephospho- 
rylating them. PP1 decreases the rate of glycogen breakdown; it reverses the 
effects of the phosphorylation cascade. Moreover, PP1 also removes phos- 
phoryl groups from glycogen synthase b to convert it into the much more active 
glycogen synthase a form. Here, PP1 also accelerates glycogen synthesis. PP1 
is yet another molecular device for coordinating carbohydrate storage. 

The catalytic subunit of PP1 is a 37-kd single-domain protein. This 
subunit is usually bound to one of a family of regulatory subunits with 
masses of approximately 120 kd; in skeletal muscle and heart, the most 
prevalent regulatory subunit is called Gyy, whereas, in the liver, the most 
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Figure 21.20 Regulation of glycogen 
synthesis by protein phosphatase 1. 
Protein phosphatase 1 stimulates glycogen 
synthesis while inhibiting glycogen breakdown. 
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Figure 21.21 Regulation of protein 
phosphatase 1 (PP1) in muscle takes 
place in two steps. Phosphorylation of Gry 
by protein kinase A dissociates the catalytic 
subunit from its substrates in the glycogen 
particle. Phosphorylation of the inhibitor 
subunit by protein kinase A inactivates the 
catalytic unit of PP1. 
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prevalent subunit is Gr. These regulatory subunits have modular structures 
with domains that participate in interactions with glycogen, with the cata- 
lytic subunit, and with target enzymes. Thus, these regulatory subunits act as 
scaffolds, bringing together the phosphatase and its substrates in the context of 
a glycogen particle. 

The phosphatase activity of PP1 must be reduced when glycogen degra- 
dation is called for (Figure 21.21). In such cases, epinephrine or glucagon 
has activated the cAMP cascade and protein kinase A is active. Protein 
kinase A reduces the activity of PP1 by two mechanisms. First, in muscle, 
Gyy is phosphorylated in the domain responsible for binding the catalytic 
subunit. The catalytic subunit is released from glycogen 
and from its substrates and dephosphoryation is greatly 
reduced. Second, almost all tissues contain small pro- 
teins that, when phosphorylated, bind to the catalytic 
subunit of PP1 and inhibit it. Thus, when glycogen deg- 
radation is switched on by cAMP, the accompanying 
phosphorylation of these inhibitors keeps phosphorylase 
in its active a form and glycogen synthase in its inactive 


IRS b form. 
—_ a =>, geateln kinadad Insulin stimulates glycogen synthesis by inactivating 
| glycogen synthase kinase 
After exercise, people often consume carbohydrate-rich 
i, foods to restock their glycogen stores. How is glycogen 
Glycogen synthesis stimulated? When blood-glucose levels are 
yang high, insulin stimulates the synthesis of glycogen by inacti- 
 ——-. vating glycogen synthase kinase, the enzyme that main- 
digas ean tains glycogen synthase in its phosphorylated, inactive 
synthase synthase state (Figure 21.22). The first step in the action of insulin 
KY is its binding to a receptor tyrosine kinase in the plasma 
membrane (Section 14.2). The binding of insulin acti- 
vates the tyrosine kinase activity of the receptor so that it 
Figure 21.22 Insulin inactivates glycogen synthase kinase. phosphorylates insulin-receptor substrates (IRSs). These 


Insulin triggers a cascade that leads to the phosphorylation and 
inactivation of glycogen synthase kinase and prevents the 
phosphorylation of glycogen synthase. Protein phosphatase 1 (PP1) 


phosphorylated proteins trigger signal-transduction 
pathways that eventually lead to the activation of protein 


removes the phosphates from glycogen synthase, thereby activating the kinases that phosphorylate and inactivate glycogen eye 
enzyme and allowing glycogen synthesis. IRS, insulin-receptor substrate. thase kinase. The inactive kinase can no longer maintain 


glycogen synthase in its phosphorylated, inactive state. Protein phosphatase 
1 dephosphorylates glycogen synthase, activating it, and restoring glycogen 
reserves. Recall that insulin also generates an increase in the amount of glu- 
cose in the cell by increasing the number of glucose transports in the mem- 
brane. The net effect of insulin is thus the replenishment of glycogen 
stores. 


Glycogen metabolism in the liver regulates the blood-glucose level 


After a meal rich in carbohydrates, blood-glucose levels rise, and glycogen 
synthesis is stepped up in the liver. Although insulin is the primary signal 
for glycogen synthesis, another is the concentration of glucose in the blood, 
which normally ranges from about 80 to 120 mg per 100 ml (4.4-6.7 mM). 
The liver senses the concentration of glucose in the blood and takes up or 
releases glucose accordingly. The amount of liver phosphorylase a decreases 
rapidly when glucose is infused (Figure 21.23). After a lag period, the 
amount of glycogen synthase a increases, which results in glycogen synthesis. 
In fact, phosphorylase a is the glucose sensor in liver cells. Phosphorylase a 
and PP1 are localized to the glycogen particle by interactions with the Gr 
subunit of PP1. The binding of glucose to phosphorylase a shifts its allos- 
teric equilibrium from the active R form to the inactive T form. This con- 
formational change renders the phosphoryl group on serine 14 a substrate for 
protein phosphatase 1. PP1 binds tightly to phosphorylase a only when the 
phosphorylase is in the R state but is inactive when bound. When glucose 
induces the transition to the T form, PP1 and the phosphorylase dissociate 
from each other and the glycogen particle, and PP1 becomes active. Recall 
that the R <> T transition of muscle phosphorylase a is unaffected by glucose 
and is thus unaffected by the rise in blood-glucose levels (Section 21.2). 
Efforts are underway to develop drugs that disrupt the interaction of liver 
phosphorylase with the Gy subunit as a treatment for type 2 diabetes 
(Section 27.2). Type 2 diabetes is characterized by excess blood glucose. 
Hence, disrupting the association of phosphorylase with the Gy; would 
render it a substrate for PP1, and glucose release into the blood would be 
inhibited. 

How does glucose activate glycogen synthase? The conversion of a into 
bis accompanied by the release of PP1, which is then free to activate glycogen 
synthase and dephosphorylate glycogen phosphorylase (Figure 21.24). The 
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Figure 21.23 Blood glucose regulates 
liver-glycogen metabolism. The infusion of 
glucose into the bloodstream leads to the 
inactivation of phosphorylase, followed by the 
activation of glycogen synthase, in the liver. 
{After W. Stalmans, H. De Wulf, L. Hue, and 
H.-G. Hers. Eur. J. Biochem. 41:117-134, 
1974] 


Figure 21.24 Glucose regulation of liver- 
glycogen metabolism. Glucose binds to and 
inhibits glycogen phosphorylase a in the liver, 
facilitating the formation of the T state of 
phosphorylase a. The T state of phosphorylase 
a does not bind protein phosphate 1 (PP1), 
leading to the dissociation and activation of 
PP1 from glycogen phosphorylase a. The free 
PP1 dephosphorylates glycogen 
phosphorylase a and glycogen synthase 5, 
leading to the inactivation of glycogen 
breakdown and the activation of glycogen 
synthesis. 
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Table 21.1 Glycogen-storage diseases 


Glycogen Metabolism 


removal of the phosphory] group of inactive glycogen synthase b converts it 
into the active a form. Initially, there are about 10 phosphorylase a mole- 
cules per molecule of phosphatase. Hence, the activity of glycogen synthase 
begins to increase only after most of phosphorylase a is converted into b. The 
lag between the decrease in glycogen degradation and the increase in glyco- 
gen synthesis prevents the two pathways from operating simultaneously. 
This remarkable glucose-sensing system depends on three key elements: 
(1) communication between the allosteric site for glucose and the serine 
phosphate, (2) the use of PP1 to inactivate phosphorylase and activate 
glycogen synthase, and (3) the binding of the phosphatase to phosphorylase 
a to prevent the premature activation of glycogen synthase. 


A biochemical understanding of glycogen-storage diseases is possible 


e Edgar von Gierke described the first glycogen-storage disease in 1929. 

A patient with this disease has a huge abdomen caused by a massive 
enlargement of the liver. There is a pronounced hypoglycemia between meals. 
Furthermore, the blood-glucose level does not rise on administration of epi- 
nephrine and glucagon. An infant with this glycogen-storage disease may 
have convulsions because of the low blood-glucose level. 

The enzymatic defect in von Gierke disease was elucidated in 1952 by 
Carl and Gerty Cori. They found that glucose 6-phosphatase is missing from 
the liver of a patient with this disease. This finding was the first demonstra- 
tion of an inherited deficiency of a liver enzyme. The glycogen in the liver is 
normal in structure but is present in abnormally large amounts. The absence 
of glucose 6-phosphatase in the liver causes hypoglycemia because glucose 
cannot be formed from glucose 6-phosphate. This phosphorylated sugar 
does not leave the liver, because it cannot cross the plasma membrane. The 
presence of excess glucose 6-phosphate triggers an increase in glycolysis in 
the liver, leading to a high level of lactate and pyruvate in the blood. Patients 
who have von Gierke disease also have an increased dependence on fat 
metabolism. This disease can also be produced by a mutation in the gene 


Glycogen 
Type Defective enzyme Organ affected in the affected organ Clinical features 
I Glucose 6-phosphatase Liver and kidney Increased amount; Massive enlargement of the liver. 
Von Gierke or transport system normal structure. Failure to thrive. Severe 
hypoglycemia, ketosis, 
hyperuricemia, hyperlipemia. 
II a-1,4-Glucosidase All organs Massive increase in Cardiorespiratory failure 
Pompe (lysosomal) amount; normal structure. causes death, usually before age 2. 
Ill Amylo-1,6-glucosidase Muscle and liver Increased amount; Like type I, but milder course. 
Cori (debranching enzyme) short outer branches. 
IV Branching enzyme Liver and spleen Normal amount; very long Progressive cirrhosis of the liver. 
Andersen (a-1,4 —> a-1,6) outer branches. Liver failure causes death, 
usually before age 2. 
Vv Phosphorylase Muscle Moderately increased Limited ability to perform strenuous 
McArdle amount; normal structure. exercise because of painful 
muscle cramps. Otherwise patient 
is normal and well developed. 
VI Phosphorylase Liver Increased amount. Like type 1, but milder 
Hers course. 
Vil Phosphofructokinase Muscle Increased amount; Like type V. 
normal structure. 
Vil Phosphorylase kinase Liver Increased amount; Mild liver enlargement. 


normal structure. Mild hypoglycemia. 


Note: Types I through VII are inherited as autosomal recessives. Type VIII is sex linked. 


that encodes the glucose 6-phosphate transporter. Recall that glucose 6- 
phosphate must be transported into the lumen of the endoplasmic reticulum 
to be hydrolyzed by phosphatase. Mutations in the other three essential 
proteins of this system can likewise lead to von Gierke disease. 

Seven other glycogen-storage diseases have been characterized 
(Table 21.1). In Pompe disease (type II), lysosomes become engorged with 
glycogen because they lack a-1,4-glucosidase, a hydrolytic enzyme con- 
fined to these organelles (Figure 21.25). Carl and Gerty Cori also elucidated 
the biochemical defect in another glycogen-storage disease (type III), which 
cannot be distinguished from von Gierke disease (type I) by physical exam- 
ination alone. In type III disease, the structure of liver and muscle glycogen 
is abnormal and the amount is markedly increased. Most striking, the outer 
branches of the glycogen are very short. Patients having this type lack the 
debranching enzyme (a-1,6-glucosidase), and so only the outermost branches 
of glycogen can be effectively utilized. Thus, only a small fraction of this 
abnormal glycogen is functionally active as an accessible store of glucose. 

A defect in glycogen metabolism confined to muscle is found in McArdle 
disease (type V). Muscle phosphorylase activity is absent, and a patient’s 
capacity to perform strenuous exercise is limited because of painful muscle 
cramps. The patient is otherwise normal and well developed. Thus, effec- 
tive utilization of muscle glycogen is not essential for life. Phosphorus-31 
nuclear magnetic resonance studies of these patients have been very infor- 
mative. The pH of skeletal-muscle cells of normal people drops during 
strenuous exercise because of the production of lactate. In contrast, the 
muscle cells of patients with McArdle disease become more alkaline during 
exercise because of the breakdown of creatine phosphate (Section 15.2). 
Lactate does not accumulate in these patients, because the glycolytic rate of 
their muscle is much lower than normal; their glycogen cannot be mobi- 
lized. NMR studies have also shown that the painful cramps in this disease 
are correlated with high levels of ADP (Figure 21.26). NMR spectroscopy 
is a valuable, noninvasive technique for assessing dietary and exercise 
therapy for this disease. 


Summary 


Glycogen, a readily mobilized fuel store, is a branched polymer of 
glucose residues. Most of the glucose units in glycogen are linked by 
a-1,4-glycosidic bonds. At about every tenth residue, a branch is cre- 
ated by an a-1,6-glycosidic bond. Glycogen is present in large 
amounts in muscle cells and in liver cells, where it is stored in the cyto- 
plasm in the form of hydrated granules. 


21.1 Glycogen Breakdown Requires the Interplay of Several Enzymes 

Most of the glycogen molecule is degraded to glucose 1-phosphate by 
the action of glycogen phosphorylase, the key enzyme in glycogen 
breakdown. The glycosidic linkage between C-1 of a terminal residue 
and C-4 of the adjacent one is split by orthophosphate to give glu- 
cose 1-phosphate, which can be reversibly converted into glucose 
6-phosphate. Branch points are degraded by the concerted action of an 
oligosaccharide transferase and an a-1,6-glucosidase. 


21.2 Phosphorylase Is Regulated by Allosteric Interactions and 
Reversible Phosphorylation 
Phosphorylase b, which is usually inactive, is converted into active 
phosphorylase a by the phosphorylation of a single serine residue in 
each subunit. This reaction is catalyzed by phosphorylase kinase. The 


1m 


Figure 21.25 Glycogen-engorged 
lysosome. This electron micrograph shows 
skeletal muscle from an infant with type II 
glycogen-storage disease (Pompe disease). 
The lysosomes are filled with glycogen 
because of a deficiency in a-1,4-glucosidase, 
a hydrolytic enzyme confined to lysosomes. 
The amount of glycogen in the cytoplasm is 
normal. [From H.-G. Hers and F. Van Hoof, 
Eds., Lysosomes and Storage Diseases 
(Academic Press, 1973), p. 205.] 
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Figure 21.26 NMR study of human arm 
muscle. The level of ADP during exercise 
increases much more in a patient with 
McArdle glycogen-storage disease (type V) 
than in normal controls. [After G. K. Radda. 
Biochem. Soc. Trans. 14:517-525, 1986.] 
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b form in muscle can also be activated by the binding of AMP, an effect 
counteracted by ATP and glucose 6-phosphate. The a form in the liver 
is inhibited by glucose. The AMP-binding sites and phosphorylation 
sites are located at the subunit interface. In muscle, phosphorylase is 
activated to generate glucose for use inside the cell as a fuel for contrac- 
tile activity. In contrast, liver phosphorylase is activated to liberate 
glucose for export to other organs, such as skeletal muscle and the 
brain. 


21.3 Epinephrine and Glucagon Signal the Need for Glycogen Breakdown 


Epinephrine and glucagon stimulate glycogen breakdown through 
specific 7TM receptors. Muscle is the primary target of epineph- 
rine, whereas the liver is responsive to glucagon. Both signal mole- 
cules initiate a kinase cascade that leads to the activation of glycogen 
phosphorylase. 


21.4 Glycogen Is Synthesized and Degraded by Different Pathways 


The pathway for glycogen synthesis differs from that for glycogen 
breakdown. UDP-glucose, the activated intermediate in glycogen 
synthesis, is formed from glucose 1-phosphate and UTP. Glycogen 
synthase catalyzes the transfer of glucose from UDP-glucose to the 
C-4 hydroxyl group of a terminal residue in the growing glycogen 
molecule. Synthesis is primed by glycogenin, an autoglycosylating 
protein that contains a covalently attached oligosaccharide unit on a 
specific tyrosine residue. A branching enzyme converts some of the 
a-1,4 linkages into a-1,6 linkages to increase the number of ends so 
that glycogen can be made and degraded more rapidly. 


21.5 Glycogen Breakdown and Synthesis Are Reciprocally Regulated 


Glycogen synthesis and degradation are coordinated by several ampli- 
fying reaction cascades. Epinephrine and glucagon stimulate glycogen 
breakdown and inhibit its synthesis by increasing the cytoplasmic 
level of cyclic AMP, which activates protein kinase A. Protein kinase 
A activates glycogen breakdown by attaching a phosphate to phosphor- 
ylase kinase and inhibits glycogen synthesis by phosphorylating 
glycogen synthase. 

The glycogen-mobilizing actions of protein kinase A are reversed 
by protein phosphatase 1, which is regulated by several hormones. 
Epinephrine inhibits this phosphatase by blocking its attachment to 
glycogen molecules and by turning on an inhibitor. Insulin, in con- 
trast, triggers a cascade that phosphorylates and inactivates glycogen 
synthase kinase, one of the enzymes that inhibits glycogen synthase. 
Hence, glycogen synthesis is decreased by epinephrine and increased 
by insulin. Glycogen synthase and phosphorylase are also regulated 
by noncovalent allosteric interactions. In fact, phosphorylase is a key 
part of the glucose-sensing system of liver cells. Glycogen metabolism 
exemplifies the power and precision of reversible phosphorylation in 
regulating biological processes. 


Key Terms 


glycogen phosphorylase (p. 617) 


phosphorolysis (p. 617) 


pyridoxal phosphate (PLP) (p. 619) 


phosphorylase kinase (p. 623) 
calmodulin (p. 624) 


epinephrine (adrenaline) (p. 624) glycogen synthase (p. 628) 

glucagon (p. 624) glycogenin (p. 628) 

protein kinase A (PKA) (p. 626) protein phosphatase 1 (PP1) (p. 631) 
uridine diphosphate glucose insulin (p. 632) 


(UDP-glucose) (p. 627) 


Problems 


1. Choice is good. Glycogen is not as reduced as fatty acids 
are and consequently not as energy rich. Why do animals 
store any energy as glycogen? Why not convert all excess 
fuel into fatty acids? 


2. If a little is good, a lot is better. a-Amylose is an 
unbranched glucose polymer. Why would this polymer not 
be as effective a storage form of glucose as glycogen? 


3. Telltale products. A sample of glycogen from a patient 
with liver disease is incubated with orthophosphate, phos- 
phorylase, the transferase, and the debranching enzyme 
(a-1,6-glucosidase). The ratio of glucose 1-phosphate to 
glucose formed in this mixture is 100. What is the most 
likely enzymatic deficiency in this patient? 


4, Dare to be different. Compare the allosteric regulation of 
phosphorylase in the liver and in muscle, and explain the 
significance of the difference. 


5. A thumb on the balance. The reaction catalyzed by phos- 
phorylase is readily reversible in vitro. At pH 6.8, the equi- 
librium ratio of orthophosphate to glucose 1-phosphate 
is 3.6. The value of AG®’ for this reaction is small because a 
glycosidic bond is replaced by a phosphory] ester bond that 
has a nearly equal transfer potential. However, phospho- 
rolysis proceeds far in the direction of glycogen breakdown 
in vivo. Suggest one means by which the reaction can be 
made irreversible in vivo. 


6. Excessive storage. Suggest an explanation for the fact that 
the amount of glycogen in type I glycogen-storage disease 
(von Gierke disease) is increased. 


7. Recouping an essential phosphoryl. The phosphoryl group 
on phosphoglucomutase is slowly lost by hydrolysis. 
Propose a mechanism that utilizes a known catalytic inter- 
mediate for restoring this essential phosphoryl group. How 
might this phosphoryl! donor be formed? 


8. Not all absences are equal. Hers disease results from an 
absence of liver glycogen phosphorylase and may result in 
serious illness. In McArdle disease, muscle glycogen phos- 
phorylase is absent. Although exercise is difficult for patients 
suffering from McArdle disease, the disease is rarely life 
threatening. Account for the different manifestations of the 
absence of glycogen phosphorylase in the two tissues. What 
does the existence of these two different diseases indicate 
about the genetic nature of the phosphorylase? 


9. Hydrophobia. Why is water excluded from the active site 
of phosphorylase? Predict the effect of a mutation that 
allows water molecules to enter. 


10. Removing all traces. In human liver extracts, the cata- 
lytic activity of glycogenin was detectable only after treat- 
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ment with a-amylase(p. 629). Why was a-amylase necessary 
to reveal the glycogenin activity? 


11. Two inone. A single polypeptide chain houses the trans- 
ferase and debranching enzyme. Cite a potential advantage 
of this arrangement. 


12. How did they do that? A strain of mice has been devel- 
oped that lack the enzyme phosphorylase kinase. Yet, after 
strenuous exercise, the glycogen stores of a mouse of this 
strain are depleted. Explain how this depletion is possible. 


13. An appropriate inhibitor. What is the rationale for the 
inhibition of muscle glycogen phosphorylase by glucose 
6-phosphate when glucose 1-phosphate is the product of 
the phosphorylase reaction? 


14. Passing along the information. Outline the signal- 
transduction cascade for glycogen degradation in muscle. 


15. Slammin’ on the breaks. There must be a way to shut 
down glycogen breakdown quickly to prevent the wasteful 
depletion of glycogen after energy needs have been met. 
What mechanisms are employed to turn off glycogen 
breakdown? 


16. Diametrically opposed. Phosphorylation has opposite 
effects on glycogen synthesis and breakdown. What is the 
advantage of its having opposing effects? 


17. Feeling depleted. Glycogen depletion resulting from 
intense, extensive exercise can lead to exhaustion and the 
inability to continue exercising. Some people also experi- 
ence dizziness, an inability to concentrate, and a loss of 
muscle control. Account for these symptoms. 


18. Everyone had a job to do. What accounts for the fact that 
liver phosphorylase is a glucose sensor, whereas muscle 
phosphorylase is not? 


19. If you insist. Why does activation of the phosphor- 
ylated b form of glycogen synthase by high concentrations 
of glucose 6-phosphate make good biochemical sense? 


20. An ATP saved is an ATP earned. The complete oxida- 
tion of glucose 6-phosphate derived from free glucose yields 
30 molecules ATP, whereas the complete oxidation of glu- 
cose 6-phosphate derived from glycogen yields 31 mole- 
cules of ATP. Account for this difference. 


21. Dual roles. Phosphoglucomutase is crucial for glycogen 
breakdown as well as for glycogen synthesis. Explain the 
role of this enzyme in each of the two processes. 


22. Working at cross-purposes. Write a balanced equation 
showing the effect of simultaneous activation of glycogen 
phosphorylase and glycogen synthase. Include the reactions 
catalyzed by phosphoglucomutase and UDP-glucose pyro- 
phosphorylase. 
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23. Achieving immortality. Glycogen synthase requires a 
primer. A primer was formerly thought to be provided 
when the existing glycogen granules are divided between 
the daughter cells produced by cell division. In other words, 
parts of the original glycogen molecule were simply passed 
from generation to generation. Would this strategy have 
been successful in passing glycogen stores from generation 
to generation? How are new glycogen molecules now known 
to be synthesized? 


24. Synthesis signal. How does insulin stimulate glycogen 
synthesis? 


Mechanism Problem 


25. Family resemblance. Propose mechanisms for the two 
enzymes catalyzing steps in glycogen debranching on the basis 
of their potential membership in the a-amylase family. 


Chapter Integration Problems 


26. Carbohydrate conversion. Write a balanced equation for 
the formation of glycogen from galactose. 


27. Working together. What enzymes are required for the 
liver to release glucose into the blood when an organism is 
asleep and fasting? 


28. A shattering experience. Crystals of phosphorylase a 
grown in the presence of glucose shatter when a substrate 
such as glucose 1-phosphate is added. Why? 


29. I know I’ve seen that face before. UDP-glucose is the 
activated form of glucose used in glycogen synthesis. 
However, we have previously met other similar activated 


forms of carbohydrate in our consideration of metabolism. 
Where else have we seen UDP-carbohydrate? 


30. Same symptoms, different cause. Suggest another muta- 
tion in glucose metabolism that causes symptoms similar to 
those of von Gierke disease. 


Data Interpretation Problems 


31. Glycogen isolation 1. The liver is a major storage site 
for glycogen. Purified from two samples of human liver, 
glycogen was either treated or not treated with a-amylase 
and subsequently analyzed by SDS-PAGE and western 
blotting with the use of antibodies to glycogenin. The 
results are presented in the adjoining illustration. 


212— 
158— 
116— 
a 
c 
S 66— — 
E — «= 
Oo 56— 
S 
43— 
ie Glycogen isolation 1. 
a aan : [Courtesy of Dr. Peter J. Roach, 
Sample 1 Sample 2 Indiana University School of 
p p 


a-Amylase Medicine.] 


(a) Why are no proteins visible in the lanes without amy- 
lase treatment? 


(b) What is the effect of treating the samples with a-amy- 
lase? Explain the results. 


(c) List other proteins that you might expect to be associ- 
ated with glycogen. Why are other proteins not visible? 


32. Glycogen isolation 2. The gene for glycogenin was trans- 
fected into a cell line that normally stores only small 
amounts of glycogen. The cells were then manipulated 
according to the following protocol, and glycogen was 
isolated and analyzed by SDS-PAGE and western blotting 
by using an antibody to glycogenin with and without 
a-amylase treatment. The results are presented in the 
adjoining illustration. 
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The protocol: Cells cultured in growth medium and 
25 mM glucose (lane 1) were switched to medium contain- 
ing no glucose for 24 hours (lane 2). Glucose-starved cells 
were refed with medium containing 25 mM glucose for 1 
hour (lane 3) or 3 hours (lane 4). Samples (12 wg of protein) 
were either treated or not treated with a-amylase, as indi- 
cated, before being loaded on the gel. 


(a) Why did the western analysis produce a “smear’’—that 
is, the high-molecular-weight staining in lane 1(—)? 
(b) What is the significance of the decrease in high- 


molecular-weight staining in lane 2(—)? 


(c) What is the significance of the difference between 
lanes 2(—) and 3(—)? 


(d) Suggest a plausible reason why there is essentially no 
difference between lanes 3(—) and 4(—)? 


(e) Why are the bands at 66 kd the same in the lanes 
treated with amylase, despite the fact that the cells were 
treated differently? 
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Fats provide efficient means for storing energy for later use. (Right) The processes of 
fatty acid synthesis (preparation for energy storage) and fatty acid degradation 


(preparation for energy use) are, in many ways, the reverse of each other. (Above) Degradation 
Studies of mice are revealing the interplay between these pathways and the ae 


biochemical bases of appetite and weight control. [Photograph © Jackson/Visuals 


Unlimited.] ars 


e turn now from the metabolism of carbohydrates to that of fatty acids. OUTLINE 
A fatty acid contains a long hydrocarbon chain and a terminal carboxy- 
late group. Fatty acids have four major physiological roles. First, fatty acids 
are fuel molecules. They are stored as triacylglycerols (also called neutral fats or 


22.1 Triacylglycerols Are Highly 
Concentrated Energy Stores 


22 


triglycerides), which are uncharged esters of fatty acids with glycerol. 22.2 The Use of Fatty Acids As Fuel 
Triacylglycerols are stored in adipose tissue, composed of cells called adipo- Requires Three Stages of 
cytes (Figure 22.1). Fatty acids mobilized from triacylglycerols are oxidized to Processing 

meet the energy needs of a cell or organism. During rest or moderate exercise, S88 Unsaturated and Odd-Chain 
such as walking, fatty acids are our primary source of energy. Second, Jatty Fatty Acids Require Additional 
acids are building blocks of phospholipids and glycolipids. These amphipathic Steps for Degradation 


molecules are important components of biological membranes, as discussed 
in Chapter 12. Third, many proteins are modified by the covalent attachment 
of fatty acids, which targets the proteins to membrane locations. Fourth, fatty 
acid derivatives serve as hormones and intracellular messengers. In this chapter, 22.5 The Elongation and 


Fatty Acid Synthase 


22.4 Fatty Acids Are Synthesized by 


we focus on the degradation and synthesis of fatty acids. Unsaturation of Fatty Acids Are 
Accomplished by Accessory 


re) Pa Enzyme Systems 
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Figure 22.1 Electron micrograph of an 
adipocyte. A small band of cytoplasm 

surrounds the large deposit of triacylglycerols. 
[Biophoto Associates/Photo Researchers.] 
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Fatty acid degradation and synthesis mirror each other in 
their chemical reactions 


Fatty acid degradation and synthesis consist of four steps that are the reverse 
of each other in their basic chemistry. Degradation is an oxidative process 
that converts a fatty acid into a set of activated acetyl units (acetyl CoA) that 
can be processed by the citric acid cycle (Figure 22.2). An activated fatty 
acid is oxidized to introduce a double bond; the double bond is hydrated to 
introduce a hydroxyl group; the alcohol is oxidized to a ketone; and, finally, 
the fatty acid is cleaved by coenzyme A to yield acetyl CoA and a fatty acid 
chain two carbons shorter. If the fatty acid has an even number of carbon 
atoms and is saturated, the process is simply repeated until the fatty acid is 
completely converted into acetyl CoA units. 

Fatty acid synthesis is essentially the reverse of this process. The pro- 
cess starts with the individual units to be assembled—uin this case with an 
activated acyl group (most simply, an acetyl unit) and a malonyl unit (see 
Figure 22.2). The malonyl unit condenses with the acetyl unit to form a 
four-carbon fragment. To produce the required hydrocarbon chain, the 
carbonyl group is reduced to a methylene group in three steps: a reduction, 
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Figure 22.2 Steps in fatty acid degradation and synthesis. The two processes are in many 
ways mirror images of each other. 


a dehydration, and another reduction, exactly the opposite of degradation. 
The product of the reduction is butyryl CoA. Another activated malonyl 
group condenses with the butyryl unit, and the process is repeated until a 
C16 or shorter fatty acid is synthesized. 


22.1 Triacylglycerols Are Highly Concentrated 
Energy Stores 


Triacylglycerols are highly concentrated stores of metabolic energy because 
they are reduced and anhydrous. The yield from the complete oxidation 
of fatty acids is about 38 kJ ¢ | (9 kcal g), in contrast with about 17 kJ g? 
(4 kcal g ') for carbohydrates and proteins. The basis of this large difference 
in caloric yield is that fatty acids are much more reduced than carbohydrates 
or proteins. Furthermore, triacylglycerols are nonpolar, and so they are 
stored in a nearly anhydrous form, whereas much more polar carbohydrates 
are more highly hydrated. In fact, 1 g of dry glycogen binds about 2 g of 
water. Consequently, a gram of nearly anhydrous fat stores 6.75 times as much 
energy as a gram of hydrated glycogen, which is likely the reason that triacyl- 
glycerols rather than glycogen were selected in evolution as the major 
energy reservoir. Consider a typical 70-kg man, who has fuel reserves of 
420,000 kJ (100,000 kcal) in triacylglycerols, 100,000 kJ (24,000 kcal) 
in protein (mostly in muscle), 2500 kJ (600 kcal) in glycogen, and 170 kJ 
(40 kcal) in glucose. Triacylglycerols constitute about 11 kg of his total body 
weight. If this amount of energy were stored in glycogen, his total body 
weight would be 64 kg greater. The glycogen and glucose stores provide 
enough energy to sustain physiological function for about 24 hours, where- 
as the triacylglycerol stores allow survival for several weeks. 

In mammals, the major site of triacylglycerol accumulation is the cyto- 
plasm of adipose cells (fat cells). This fuel-rich tissue is located throughout 
the body, notably under the skin (subcutaneous fat) and surrounding the 
internal organs (visceral fat). Droplets of triacylglycerol coalesce to form a 
large globule, called a lipid droplet, which may occupy most of the cell 
volume (see Figure 22.1). The lipid droplet is surrounded by a monolayer of 
phospholipids and proteins required for triacylglycerol metabolism. Adipose 
cells are specialized for the synthesis and storage of triacylglycerols and for 
their mobilization into fuel molecules that are transported to other tissues 
by the blood. Muscle also stores triacylglycerols for its own energy needs. 
Indeed, triacylglycerols are evident as the “marbling” of expensive cuts of 
beef. 

The utility of triacylglycerols as an energy source is dramatically illus- 
trated by the abilities of migratory birds, which can fly great distances 
without eating after having stored energy as triacylglycerols. Examples 
are the American golden plover and the ruby-throated hummingbird. The 
golden plover flies from Alaska to the southern tip of South America; a 
large segment of the flight (3800 km, or 2400 miles) is over open ocean, 
where the birds cannot feed. The ruby-throated hummingbird can fly non- 
stop across the Gulf of Mexico. Fatty acids provide the energy source for 
both these prodigious feats. 


Dietary lipids are digested by pancreatic lipases 


Most lipids are ingested in the form of triacylglycerols and must be degraded 
to fatty acids for absorption across the intestinal epithelium. Intestinal 
enzymes called lipases, secreted by the pancreas, degrade triacylglycerols to 
free fatty acids and monoacylglycerol (Figure 22.3). Lipids present a special 
problem because, unlike carbohydrates and proteins, these molecules are 
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Triacylglycerols fuel the long migration 
flights of the American golden plover 
(Pluvialis dominica). [Gerard Fuehrer/ 
Visuals Unlimited.] 
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Figure 22.4 Glycocholate. Bile salts, such 
as glycocholate, facilitate lipid digestion in the 
intestine. 


Figure 22.5 Chylomicron formation. Free 
fatty acids and monoacylglycerols are 
absorbed by intestinal epithelial cells. 
Triacylglycerols are resynthesized and 
packaged with other lipids and apolipoprotein 
B-48 to form chylomicrons, which are then 
released into the lymph system. 
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Figure 22.3 Action of pancreatic lipases. Lipases secreted by the pancreas convert 
triacylglycerols into fatty acids and monoacylglycerol for absorption into the intestine. 


not soluble in water. How are they made accessible to the 
lipases, which are in aqueous solution? The solution is to 
wrap lipids in a soluble container. Triacylglycerols in the 
intestinal lumen are incorporated into micelles composed of 
bile salts (Figure 22.4), amphipathic molecules synthesized 
from cholesterol in the liver and secreted from the gall blad- 
der. The ester bond of each lipid is oriented toward the 
surface of the micelle, rendering the bond more susceptible to digestion by 
lipases in aqueous solution. The final digestion products are carried in 
micelles to the intestinal epithelium where they are transported across the 
plasma membrane (Figure 22.5). If the production of bile salts is inadequate 
due to liver disease, large amounts of fats (as much as 30 g day !) are excret- 
ed in the feces. This condition is referred to as steatorrhea, after stearic acid, 
a common fatty acid. 


N CcoOo- 
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Dietary lipids are transported in chylomicrons 


In the intestinal mucosal cells, the triacylglycerols are resynthesized from 
fatty acids and monoacylglycerols and then packaged into lipoprotein trans- 
port particles called chylomicrons, stable particles approximately 2000 A 
(200 nm) in diameter (see Figure 22.5). These particles are composed 
mainly of triacylglycerols, with apoliprotein B-48 (apo B-48) as the main 
protein component. Protein constituents of lipoprotein particles are called 
apolipoproteins. Chylomicrons also transport fat-soluble vitamins and 
cholesterol. 

The chylomicrons are released into the lymph system and then into the 
blood. These particles bind to membrane-bound lipases, primarily at adi- 
pose tissue and muscle, where the triacylglycerols are once again degraded 
into free fatty acids and monoacylglycerol for transport into the tissue. The 
triacylglycerols are then resynthesized inside the cell and stored. In the 
muscle, they can be oxidized to provide energy. 


LUMEN MUCOSAL CELL 
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22.2 The Use of Fatty Acids As Fuel Requires 
Three Stages of Processing 


Tissues throughout the body gain access to the lipid energy reserves stored 
in adipose tissue through three stages of processing. First, the lipids must be 
mobilized. In this process, triacylglycerols are degraded to fatty acids and 
glycerol, which are released from the adipose tissue and transported to the 
energy-requiring tissues. Second, at these tissues, the fatty acids must 
be activated and transported into mitochondria for degradation. Third, the 
fatty acids are broken down in a step-by-step fashion into acetyl CoA, 
which is then processed in the citric acid cycle. 


Triacylglycerols are hydrolyzed by hormone-stimulated lipases 


Consider someone who has just awakened from a night’s sleep and begins a 
bout of exercise. Glycogen stores will be low, but lipids are readily available. 
How are these lipid stores mobilized? 

Before fats can be used as fuels, the triacylglycerol storage form must be 
hydrolyzed to yield isolated fatty acids. This reaction is catalyzed by a 
hormonally controlled lipase. Under the physiological conditions facing an 
early-morning runner, glucagon and epinephrine will be present. In 
adipose tissue, these hormones trigger 7 TM receptors that activate adeny- 
late cyclase (Section 14.1). The increased level of cyclic AMP then stimulates 
protein kinase A, which phosphorylates two key proteins: perilipin, a 
fat-droplet-associated protein, and hormone-sensitive lipase (Figure 22.6). 
The phosphorylation of perilipin has two crucial effects. First, it restructures 
the fat droplet so that the triacylglycerols are more accessible to the 
mobililzation. Second, the phosphorylation of perilipin triggers the release of 
a coactivator for the adipose triglyceride lipase (ATGL). ATGL initiates the 
mobilization of triacylglycerols by releasing a fatty acid from triacylglycerol, 
forming diacylglycerol. Diacylglycerol is converted into a free fatty acid 
and monoacylglycerol by the hormone-sensitive lipase. Finally, a monoacyl- 
glycerol lipase completes the mobilization of fatty acids with the production 
of a free fatty acid and glycerol. Thus, epinephrine and glucagon induce lipoly- 
sis. Although their role in muscle is not as firmly established, these hormones 
probably also regulate the use of triacylglycerol stores in that tissue. 
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Figure 22.6 Mobilization of 
triacylglycerols. Triacylglycerols in adipose 
tissue are converted into free fatty acids in 
response to hormonal signals. The 
phosphorylation of perilipin restructures the 
lipid droplet and releases the coactivator of 
ATGL. The activation of ATGL by binding with 
its coactivator initiates the mobilization. 
Hormone-sensitive lipase releases a fatty acid 
from diacylglycerol. Monoacylglycerol lipase 
completes the mobilization process. 
Abbreviations: 7TM, seven transmembrane 
receptor; ATGL, adipose triglyceride lipase; CA, 
coactivator; HS lipase, hormone-sensitive 
lipase; MAG lipase, monoacylglycerol lipase; 
DAG, diacylglycerol; TAG, triacylglycerol. 


644 


CHAPTER 22 Fatty Acid Metabolism 


HO—C-H 


The released fatty acids are not soluble in blood plasma, and so the blood 
protein albumin binds the fatty acids and serves as a carrier. By these means, 
free fatty acids are made accessible as a fuel in other tissues. At the tissues, 
fatty acid transport protein facilitates the transit of the fatty acids across the 
plasma membrane. 

Glycerol formed by lipolysis is absorbed by the liver and phosphorylated. 
It is then oxidized to dihydroxyacetone phosphate, which is isomerized to 
glyceraldehyde 3-phosphate. This molecule is an intermediate in both the 
glycolytic and the gluconeogenic pathways. 
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Hence, glycerol can be converted into pyruvate or glucose in the liver, 
which contains the appropriate enzymes (Figure 22.7). The reverse process 
can take place by the reduction of dihydroxyacetone phosphate to glycerol 
3-phosphate. Hydrolysis by a phosphatase then gives glycerol. Thus, 
glycerol and glycolytic intermediates are readily interconvertible. 
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Figure 22.7 Lipolysis generates fatty Pd 
acids and glycerol. The fatty acids are used 
as fuel by many tissues. The liver processes 
glycerol by either the glycolytic or the 
gluconeogenic pathway, depending on its 
metabolic circumstances. Abbreviation: CAC, CO> + H20 


citric acid cycle. Xe 


Fatty acids are linked to coenzyme A before they are oxidized 


Eugene Kennedy and Albert Lehninger showed in 1949 that fatty acids are 
oxidized in mitochondria. Subsequent work demonstrated that they are first 
activated through the formation of a thioester linkage to coenzyme A before 
they enter the mitochondrial matrix. Adenosine triphosphate drives the 
formation of the thioester linkage between the carboxyl group of a fatty acid 
and the sulfhydryl group of coenzyme A. This activation reaction takes 
place on the outer mitochondrial membrane, where it is catalyzed by acyl 
CoA synthetase (also called fatty acid thiokinase). 


ATP AMP. + :‘~ PP, 
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Paul Berg showed that acyl CoA synthetase accomplishes the activation 
of a fatty acid in two steps. First, the fatty acid reacts with ATP to form an 
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acyl adenylate. In this mixed anhydride, the carboxyl group of a fatty acid is 
bonded to the phosphoryl group of AMP. The other two phosphoryl groups 
of the ATP substrate are released as pyrophosphate. In the second step, the 
sulfhydryl group of coenzyme A attacks the acyl adenylate, which is tightly 
bound to the enzyme, to form acyl CoA and AMP. 
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These partial reactions are freely reversible. In fact, the equilibrium constant 
for the sum of these reactions is close to 1. One high-transfer-potential com- 
pound is cleaved (between PP; and AMP) and one high-transfer-potential 
compound is formed (the thioester acyl CoA). How is the overall reaction 
driven forward? The answer is that pyrophosphate is rapidly hydrolyzed by 
a pyrophosphatase. The complete reaction is 


RCOO™ + CoA + ATP + H,30 —> 
RCO-CoA + AMP + 2P;+2H* 


This reaction is quite favorable because the equivalent of two molecules of 
ATP is hydrolyzed, whereas only one high-transfer-potential compound is 
formed. We see here another example of a recurring theme in biochemistry: 
many biosynthetic reactions are made irreversible by the hydrolysis of inorganic 
pyrophosphate. 


n/a Another motif recurs in this activation reaction. The enzyme-bound 

acyl adenylate intermediate is not unique to the synthesis of acyl CoA. 
Acyl adenylates are frequently formed when carboxyl groups are activated in 
biochemical reactions. Amino acids are activated for protein synthesis by a 
similar mechanism (Section 30.2), although the enzymes that catalyze this 
process are not homologous to acyl CoA synthetase. Thus, activation by 
adenylation recurs in part because of convergent evolution. 


Carnitine carries long-chain activated fatty acids into the 
mitochondrial matrix 


Fatty acids are activated on the outer mitochondrial membrane, whereas they 
are oxidized in the mitochondrial matrix. A special transport mechanism is 
needed to carry activated long-chain fatty acids across the inner mitochon- 
drial membrane. These fatty acids must be conjugated to carnitine, a zwit- 
terionic alcohol. The acyl group is transferred from the sulfur atom of 
coenzyme A to the hydroxyl group of carnitine to form acyl carnitine. This 
reaction is catalyzed by carnitine acyltransferase I, also called carnitine 
palmitoyl transferase I (CPTI), which is bound to the outer mitochondrial 
membrane. 
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Figure 22.8 Acyl carnitine translocase. 
The entry of acyl carnitine into the 
mitochondrial matrix is mediated by a 
translocase. Carnitine returns to the 
cytoplasmic side of the inner mitochondrial 
membrane in exchange for acyl carnitine. 
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Acyl carnitine is then shuttled across the inner mitochondrial membrane by 
a translocase (Figure 22.8). The acyl group is transferred back to coenzyme 
A on the matrix side of the membrane. This reaction, which is catalyzed by 
carnitine acyltransferase II (carnitine palmitoyl transferase II), is simply the 
reverse of the reaction that takes place in the cytoplasm. The reaction 
is thermodynamically feasible because of the zwitterionic nature of carni- 
tine. The O-acyl link in carnitine has a high group-transfer potential, 
apparently because, being zwitterions, carnitine and its esters are solvated 
differently from most other alcohols and their esters. Finally, the translocase 
returns carnitine to the cytoplasmic side in exchange for an incoming acyl 
carnitine. 


e A number of diseases have been traced to a deficiency of carnitine, 
the transferase, or the translocase. The symptoms of carnitine def1- 
ciency range from mild muscle cramping to severe weakness and even 
death. Muscle, kidney, and heart are the tissues primarily impaired. Muscle 
weakness during prolonged exercise is a symptom of a deficiency of carni- 
tine acyltransferases because muscle relies on fatty acids as a long-term 
source of energy. Medium-chain (Cg—Cj9) fatty acids are oxidized normally 
in these patients because these fatty acids do not require carnitine to enter 
the mitochondria. These diseases illustrate that the impaired flow of a metab- 
olite from one compartment of a cell to another can lead to a pathological 
condition. 


Acetyl CoA, NADH, and FADH> are generated in each 
round of fatty acid oxidation 


A saturated acyl CoA is degraded by a recurring sequence of four reactions: 
oxidation by flavin adenine dinucleotide (FAD), hydration, oxidation by 
NAD‘, and thiolysis by coenzyme A (Figure 22.9). The fatty acid chain is 
shortened by two carbon atoms as a result of these reactions, and FADH), 
NADH, and acetyl CoA are generated. Because oxidation takes place at the 
8 carbon atom, this series of reactions is called the B-oxidation pathway. 

The first reaction in each round of degradation is the oxidation of acyl 
CoA by an acyl CoA dehydrogenase to give an enoyl CoA with a trans double 
bond between C-2 and C-3. 


Acyl CoA + E-FAD —> trans-A?-enoyl CoA + E-FADH) 


As in the dehydrogenation of succinate in the citric acid cycle, FAD rather 
than NAD7™ is the electron acceptor because the AG for this reaction is 
insufficient to drive the reduction of NAD*. Electrons from the FADH> 
prosthetic group of the reduced acyl CoA dehydrogenase are transferred to 
a second flavoprotein called electron-transferring flavoprotein (ETF). In 
turn, ETF donates electrons to ETF:ubiquinone reductase, an iron-sulfur 
protein. Ubiquinone is thereby reduced to ubiquinol, which delivers its high- 
potential electrons to the second proton-pumping site of the respiratory 
chain (Section 18.3). Consequently, 1.5 molecules of ATP are generated per 
molecule of FADH)> formed in this dehydrogenation step, as in the oxida- 
tion of succinate to fumarate. 


R—CH2—CH,)—R' E-FAD ETF-FADH2, , Fe-S (oxidized). ,Ubiquinol (QH2) 
R—CH=CH—R' 7 ere 6 ETF-FAD Ve ieinen Ubiquinone (Q) 
The next step is the hydration of the double bond between C-2 and C-3 
by enoyl CoA hydratase. 
trans-A?-Enoyl CoA + HxO ——> L-3-hydroxyacyl CoA 


Table 22.1 Principal reactions in fatty acid oxidation 


Step Reaction Enzyme 

1 Fatty acid + CoA + ATP = = acyl CoA + AMP + PP; Acyl CoA synthetase (also called fatty 
acid thiokinase and fatty acid:CoA ligase)* 

2 Carnitine + acyl CoA = — acyl carnitine + CoA Carnitine acyltransferase (also called 
carnitine palmitoyl transferase) 

3 Acyl CoA + E-FAD —> trans-A?-enoyl CoA + E-FADH) Acyl CoA dehydrogenases (several isozymes having 
different chain-length specificity) 

4 trans-A?-Enoyl CoA + H,O == L-3-hydroxyacyl CoA Enoyl CoA hydratase (also called crotonase or 
3-hydroxyacyl CoA hydrolyase) 

5 L-3-Hydroxyacyl CoA + NAD* == 3-ketoacyl CoA + NADH + H* L-3-Hydroxyacyl CoA dehydrogenase 

6 3-Ketoacyl CoA + CoA == acetyl CoA + acyl CoA (shortened by Co) 8-Ketothiolase (also called thiolase) 


*An AMP-forming ligase. 


The hydration of enoyl CoA is stereospecific. Only the L isomer of 
3-hydroxyacyl CoA is formed when the trans-A’ double bond is hydrated. 
The enzyme also hydrates a cis-A” double bond, but the product then is the 
D isomer. We shall return to this point shortly in considering how unsatu- 
rated fatty acids are oxidized. 

The hydration of enoyl CoA isa prelude to the second oxidation reaction, 
which converts the hydroxyl group at C-3 into a keto group and generates 
NADH. This oxidation is catalyzed by L-3-hydroxyacyl CoA dehydrogenase, 
which is specific for the L isomer of the hydroxyacyl substrate. 


L-3-Hydroxyacyl CoA + NAD* = = 3-ketoacyl CoA + NADH + H* 


The preceding reactions have oxidized the methylene group at C-3 toa 
keto group. The final step is the cleavage of 3-ketoacyl CoA by the thiol 
group of a second molecule of coenzyme A, which yields acetyl CoA and 
an acyl CoA shortened by two carbon atoms. This thiolytic cleavage is 
catalyzed by B-ketothiolase. 


3-Ketoacyl CoA + HS-CoA = — acetyl CoA + acyl CoA 
(n carbons) (n — 2 carbons) 


Table 22.1 summarizes the reactions in fatty acid oxidation. 

The shortened acyl CoA then undergoes another cycle of oxidation, start- 
ing with the reaction catalyzed by acyl CoA dehydrogenase (Figure 22.10). 
Fatty acid chains containing from 12 to 18 carbon atoms are oxidized by the 
long-chain acyl CoA dehydrogenase. The medium-chain acyl CoA dehydro- 
genase oxidizes fatty acid chains having from 14 to 4 carbons, whereas the 
short-chain acyl CoA dehydrogenase acts only on 4- and 6-carbon fatty acid 
chains. In contrast, B-ketothiolase, hydroxyacyl dehydrogenase, and enoyl 
CoA hydratase act on fatty acid molecules of almost any length. 


The complete oxidation of palmitate yields 106 molecules of ATP 


We can now calculate the energy yield derived from the oxidation of a fatty 
acid. In each reaction cycle, an acyl CoA is shortened by two carbon atoms, 
and one molecule each of FADH2, NADH, and acetyl CoA are formed. 


C,-acyl CoA + FAD + NAD* + H,O + CoA —> 
C,-2-acyl CoA + FADH, + NADH + acetyl CoA + H* 


The degradation of palmitoyl CoA (Cy,¢-acyl CoA) requires seven reaction 
cycles. In the seventh cycle, the C4-ketoacyl CoA is thiolyzed to two molecules 
of acetyl CoA. Hence, the stoichiometry of the oxidation of palmitoyl CoA is 


Palmitoyl CoA + 7 FAD + 7 NAD* + 7 CoA + 7H,O—> 
8 acetyl CoA + 7 FADH, + 7 NADH + 7H* 
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Figure 22.9 Reaction sequence for the 
degradation of fatty acids. Fatty acids are 
degraded by the repetition of a four-reaction 
sequence consisting of oxidation, hydration, 
oxidation, and thiolysis. 
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Figure 22.10 First three rounds in the 
degradation of palmitate. Two-carbon units 
are sequentially removed from the carboxyl 
end of the fatty acid. 
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Figure 22.11 The degradation of a 
monounsaturated fatty acid. Cis-A°-Enoy! 
CoA isomerase allows continued B-oxidation 
of fatty acids with a single double bond. 
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Approximately 2.5 molecules of ATP are generated when the respi- 
ratory chain oxidizes each of these NADH molecules, whereas 1.5 
molecules of ATP are formed for each FADH) because their elec- 
trons enter the chain at the level of ubiquinol. Recall that the oxida- 
tion of acetyl CoA by the citric acid cycle yields 10 molecules of 
ATP. Hence, the number of ATP molecules formed in the oxidation 
of palmitoyl CoA is 10.5 from the seven FADH), 17.5 from the 
seven NADH, and 80 from the eight acetyl CoA molecules, which 
gives a total of 108. The equivalent of 2 molecules of ATP is con- 
sumed in the activation of palmitate, in which ATP is split into AMP 
and two molecules of orthophosphate. Thus, the complete oxidation 
of a molecule of palmitate yields 106 molecules of ATP. 


22.3 Unsaturated and Odd-Chain Fatty Acids Require 
Additional Steps for Degradation 


The B-oxidation pathway accomplishes the complete degradation of 
saturated fatty acids having an even number of carbon atoms. Most 
fatty acids have such structures because of their mode of synthesis (to 
be addressed later in this chapter). However, not all fatty acids are so 
simple. The oxidation of fatty acids containing double bonds requires 
additional steps, as does the oxidation of fatty acids containing an 
odd number of carbon atoms. 


An isomerase and a reductase are required for the oxidation 
of unsaturated fatty acids 


The oxidation of unsaturated fatty acids presents some difficulties, yet 
many such fatty acids are available in the diet. Most of the reactions are the 
same as those for saturated fatty acids. In fact, only two additional 
enzymes—an isomerase and a reductase—are needed to degrade a wide 
range of unsaturated fatty acids. 

Consider the oxidation of palmitoleate (Figure 22.11) This Cy. unsatu- 
rated fatty acid, which has one double bond between C-9 and C-10, is acti- 
vated and transported across the inner mitochondrial membrane in the 
same way as saturated fatty acids are. Palmitoleoyl CoA then undergoes 
three cycles of degradation, which are carried out by the same enzymes as 
those in the oxidation of saturated fatty acids. However, the cis-A?-enoyl 
CoA formed in the third round is not a substrate for acyl CoA dehydroge- 
nase. The presence of a double bond between C-3 and C-4 prevents the 
formation of another double bond between C-2 and C-3. This impasse is 
resolved by a new reaction, that shifts the position and configuration of the 
cis-A? double, bond. cis-A% -Enoyl CoA isomerase converts this double bond 
into a trans-A? double bond. The double bond is now between C-2 and C-3. 
The subsequent reactions are, those of the saturated fatty acid oxidation 
pathway, in which the trans-A?-enoyl CoA is a regular substrate. 

Human beings require polyunsaturated fatty acids, which have multiple 
double bonds, as important precursors for signal molecules, but excess 
polyunsaturated fatty acids are degraded by B oxidation. However, another 
problem arises with the oxidation of polyunsaturated fatty acids. Consider 
linoleate, a Cy polyunsaturated fatty acid with cis-A’ and cis-A!” double 
bonds (Figure 22.12). The cis- A’ double bond (between carbons 3 and 4) 
formed after three rounds of B-oxidation is converted into a trans- A’ double 
bond (between carbons 2 and 3) by the aforementioned isomerase. The acyl 
CoA produced by another round of B-oxidation contains a cis-A’ (between 
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Figure 22.12 Oxidation of linoleoyl CoA. 


carbons 4 and 5) double bond. Dehydrogenation of this species by acyl CoA neo ee eee ene 
fatty acid linoleate is facilitated by the activity 


dehydrogenase yields a 2,4-dienoyl intermediate (double bond between car- of enoyl CoA isomerase and 2,4-dienioyl CoA 
bons 2 and 3 and carbons 4 and 5), which is not a substrate for the next reductase: 
enzyme in the B-oxidation pathway. This impasse is circumvented by 

2,4-dienoyl CoA reductase, an enzyme that uses NADPH to reduce the 

2,4-dienoyl intermediate to trans- A3-enoyl CoA. cis-A? ;Enoyl CoA 

isomerase then converts trans-A° -enoyl CoA into the trans- A? form, a cus- 

tomary intermediate in the B-oxidation pathway. These catalytic strategies 

are elegant and economical. Only two extra enzymes are needed for the 

oxidation of any polyunsaturated fatty acid. Odd-numbered double bonds are 

handled by the isomerase, and even-numbered ones by the reductase and the 

isomerase. 


Odd-chain fatty acids yield propionyl CoA in the final thiolysis step 


Fatty acids having an odd number of carbon atoms are minor species. They 
are oxidized in the same way as fatty acids having an even number, except pa 
that propionyl CoA and acetyl CoA, rather than two molecules of acetyl _~ a 
CoA, are produced in the final round of degradation. The activated three- 
carbon unit in propionyl CoA enters the citric acid cycle after it has been Propionyl CoA 
converted into succinyl CoA. 
The pathway from propionyl CoA to succinyl CoA is especially inter- 
esting because it entails a rearrangement that requires vitamin By» (also 
known as cobalamin). Propionyl CoA is carboxylated at the expense of the 
hydrolysis of a molecule of ATP to yield the D isomer of methylmalonyl 
CoA (Figure 22.13). This carboxylation reaction is catalyzed by propionyl 


Propionyl CoA 


Figure 22.13 Conversion of propionyl 
CoA into succinyl CoA. Propionyl CoA, 
generated from fatty acids with an odd 
number of carbons as well as some amino 
acids, is converted into the citric acid cycle 
intermediate succinyl CoA. 


Figure 22.14 Structure of coenzyme Bj. 
Coenzyme B)2 Is a class of molecules that 
vary, depending on the component 
designated X in the left-hand structure. 
5'-Deoxyadenosylcobalamin is the form of the 
coenzyme in methylmalonyl mutase. 
Substitution of cyano and methyl groups for X 
creates cyanocobalamin and methylcobalamin, 
respectively. 
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CoA carboxylase, a biotin enzyme that has a catalytic mechanism like that 
of the homologous enzyme pyruvate carboxylase. The D isomer of methyl- 
malonyl CoA is racemized to the L isomer, the substrate for a mutase 
that converts it into succinyl CoA by an intramolecular rearrangement. 
The —CO—S—CoA group migrates from C-2 to a methyl group in 
exchange for a hydrogen atom. This very unusual isomerization is catalyzed 
by methylmalonyl CoA mutase, which contains a derivative of cobalamin as 
its coenzyme. 


Vitamin B,2 contains a corrin ring and a cobalt atom 


Cobalamin enzymes, which are present in most organisms, catalyze three 
types of reactions: (1) intramolecular rearrangements; (2) methylations, as in 
the synthesis of methionine; and (3) the reduction of ribonucleotides to deoxy- 
ribonucleotides (Section 25.3). In mammals, only two reactions are known to 
require coenzyme By. The conversion of L-methylmalonyl CoA into suc- 
cinyl CoA is one, and the formation of methionine by the methylation of 
homocysteine is the other. The latter reaction is especially important because 
methionine is required for the generation of coenzymes that participate in 
the synthesis of purines and thymine, which are needed for nucleic acid 
synthesis. 

The core of cobalamin consists of a corrin ring with a central cobalt atom 
(Figure 22.14). The corrin ring, like a porphyrin, has four pyrrole units. Two 
of them are directly bonded to each other, whereas the others are joined by 
methine bridges, as in porphyrins. The corrin ring is more reduced than that 
of porphyrins and the substituents are different. A cobalt atom is bonded to 
the four pyrrole nitrogens. The fifth substituent linked to the cobalt atom is 
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a derivative of dimethylbenzimidazole that contains ribose 3-phosphate and 
aminoisopropanol. One of the nitrogen atoms of dimethylbenzimidazole is 
linked to the cobalt atom. In coenzyme By, the sixth substituent linked to the 
cobalt atom is a 5’-deoxyadenosyl unit. This position can also be occupied by 
a cyano group, a methy] group, or other ligands. In all of these compounds, 
the cobalt is in the +3 oxidation state. 


Mechanism: Methylmalonyl CoA mutase catalyzes a 
rearrangement to form succinyl CoA 


The rearrangement reactions catalyzed by coenzyme By are exchanges of 
two groups attached to adjacent carbon atoms of the substrate (Figure 22.15). 
A hydrogen atom migrates from one carbon atom to the next, and an R 
group (such as the —CO—S—CoA group of methylmalonyl CoA) con- 
comitantly moves in the reverse direction. The first step in these intramo- 
lecular rearrangements is the cleavage of the carbon—cobalt bond of 
5'-deoxyadenosylcobalamin to generate the Co”* form of the coenzyme 
and a 5'-deoxyadenosyl radical, —CH)>: (Figure 22.16). In this homolytic 
cleavage reaction, one electron of the Co—C bond stays with Co (reducing it 
from the +3 to the +2 oxidation state), whereas the other electron stays 
with the carbon atom, generating a free radical. In contrast, nearly all other 
cleavage reactions in biological systems are heterolytic: an electron pair is 
transferred to one of the two atoms that were bonded together. 


Homolytic 
bond cleavage 


Cobalamin (Co2* form) 


What is the role of this very unusual —CH): radical? This highly reac- 
tive species abstracts a hydrogen atom from the substrate to form 5'-deoxy- 
adenosine and a substrate radical (Figure 22.17). This substrate radical 
spontaneously rearranges: the carbonyl CoA group migrates to the position 
formerly occupied by H on the neighboring carbon atom to produce a 
different radical. This product radical abstracts a hydrogen atom from the 
methyl group of 5’-deoxyadenosine to complete the rearrangement and 
return the deoxyadenosyl unit to the radical form. The role of coenzyme By» 
in such intramolecular migrations is to serve as a source of free radicals for the 
abstraction of hydrogen atoms. 
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Figure 22.15 Rearrangement reaction 
catalyzed by cobalamin enzymes. The 
R group can be an amino group, a hydroxy! 
group, or a substituted carbon. 


Figure 22.16 Formation of a 
5’-deoxyadenosyl radical. The 
methylmalonyl CoA mutase reaction begins 
with the homolytic cleavage of the bond 
joining Co** of coenzyme B)2 to a carbon 
atom of the ribose of the adenosine moiety of 
the enzyme. The cleavage generates a 
5'-deoxyadenosyl radical and leads to the 
reduction of Co3* to Co?*. The letter R 
represents the 5'-deoxyadenosyl component 
of the coenzyme, and the green oval 
represents the remainder of the coenzyme. 


Figure 22.17 Formation of succinyl CoA 
by a rearrangement reaction. A free radical 
abstracts a hydrogen atom in the 
rearrangement of methylmalonyl CoA to 
succinyl CoA. 


Succinyl CoA 


Figure 22.19 Electron micrograph of a 
peroxisome in a liver cell. A crystal of urate 
oxidase is present inside the organelle, which 
is bounded by a single bilayer membrane. 
The dark granular structures outside the 
peroxisome are glycogen particles. [Courtesy 
of Dr. George Palade.] 
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h Figure 22.18 Active site of 
methylmalonyl CoA mutase. Notice that a 
histidine residue from the enzyme binds to 
cobalt in place of benzimidazole. This 
arrangement of substrate and coenzyme in 
the active site facilitates the cleavage of the 
cobalt—-carbon bond and the subsequent 
abstraction of a hydrogen atom from the 
substrate. [Drawn from 4REQ.pdb.] 


JL Cobalamin 


Displaced 
benzimidazole 


Cleavage of bond to 
cobalt creates a radical 
that abstracts the H atom 


An essential property of coenzyme By2 is the weakness of its cobalt— 
carbon bond, which is readily cleaved to generate a radical. To facilitate the 
cleavage of this bond, enzymes such as methylmalonyl CoA mutase displace 
the benzimidazole group from the cobalamin and bind to the cobalt atom 
through a histidine residue (Figure 22.18). The steric crowding around the 
cobalt—carbon bond within the corrin ring system contributes to the bond 
weakness. 


Fatty acids are also oxidized in peroxisomes 


Although most fatty acid oxidation takes place in mitochondria, some oxi- 
dation of fatty acids can take place in cellular organelles called peroxisomes 
(Figure 22.19). These organelles are small membrane-bounded compart- 
ments that are present in the cells of most eukaryotes. Fatty acid oxidation 
in these organelles, which halts at octanoyl CoA, may serve to shorten long 
chains to make them better substrates of B oxidation in mitochondria. 
Peroxisomal oxidation differs from B oxidation in the initial dehydrogena- 
tion reaction (Figure 22.20). In peroxisomes, acyl CoA dehydrogenase, a 
flavoprotein, transfers electrons from the substrate to FADHp) and then 
to Op to yield H2O> instead of capturing high-energy electrons as FADH)? 
for use in the electron-transport chain, as in mitochondrial B oxidation. 
Peroxisomes contain high concentrations of the enzyme catalase to degrade 
HO, into water and Oy. Subsequent steps are identical with those of their 
mitochondrial counterparts, although they are carried out by different iso- 
forms of the enzymes. 


e Peroxisomes do not function in patients with Zellweger syndrome. 

Liver, kidney, and muscle abnormalities usually lead to death by age 
six. The syndrome is caused by a defect in the import of enzymes into the 
peroxisomes. Here we see a pathological condition resulting from an inap- 
propriate cellular distribution of enzymes. 
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Ketone bodies are formed from acetyl CoA when fat 
breakdown predominates 
The acetyl CoA formed in fatty acid oxidation enters the citric acid cycle 
only if fat and carbohydrate degradation are appropriately balanced. Acetyl 
CoA must combine with oxaloacetate to gain entry to the citric acid cycle. 
The availability of oxaloacetate, however, depends on an adequate supply of 
carbohydrate. Recall that oxaloacetate is normally formed from pyruvate, 
the product of glucose degradation in glycolysis. If carbohydrate is unavail- 
able or improperly utilized, the concentration of oxaloacetate is lowered and 
acetyl CoA cannot enter the citric acid cycle. This dependency is the molec- 
ular basis of the adage that fats burn in the flame of carbohydrates. 
In fasting or diabetes, oxaloacetate is consumed to form glucose by the 
gluconeogenic pathway (Section 16.3) and hence is unavailable for conden- 
sation with acetyl CoA. Under these conditions, acetyl CoA is diverted to 
the formation of acetoacetate and D-3-hydroxybutyrate. Acetoacetate, D-3- 
hydroxybutyrate, and acetone are often referred to as ketone bodies. 
Abnormally high levels of ketone bodies are present in the blood of 
untreated diabetics. 
Acetoacetate is formed from acetyl CoA in three steps (Figure 22.21). 
Two molecules of acetyl CoA condense to form acetoacetyl CoA. This reac- 
tion, which is catalyzed by thiolase, is the reverse of the thiolysis step in the 
oxidation of fatty acids. Acetoacetyl CoA then reacts with acetyl CoA and 
water to give 3-hydroxy-3-methylglutaryl CoA (HMG-CoA) and CoA. 
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Figure 22.21 Formation of ketone bodies. The ketone bodies—acetoacetate, 
p-3-hydroxybutyrate, and acetone from acetyl CoA—are formed primarily in the liver. Enzymes 
catalyzing these reactions are (1) 3-ketothiolase, (2) hydroxymethylglutaryl CoA synthase, 

(3) hydroxymethylglutaryl CoA cleavage enzyme, and (4) p-3-hydroxybutyrate dehydrogenase. 
Acetoacetate spontaneously decarboxylates to form acetone. 
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Figure 22.22 PATHWAY 
INTEGRATION: Liver supplies 
ketone bodies to the peripheral 
tissues. During fasting or in 
untreated diabetics, the liver 
converts fatty acids into ketone 
bodies, which are a fuel source for 
a number of tissues. Ketone-body 
production is especially important 
during starvation, when ketone 
bodies are the predominant fuel. 


FASTING or DIABETES 


This condensation resembles the one catalyzed by citrate synthase 
(Section 17.2). This reaction, which has a favorable equilibrium owing to 
the hydrolysis of a thioester linkage, compensates for the unfavorable equi- 
librium in the formation of acetoacetyl CoA. 3-Hydroxy-3-methylglutaryl 
CoA is then cleaved to acetyl CoA and acetoacetate. The sum of these reac- 
tions 1s 


2 Acetyl CoA + HyO —— acetoacetate + 2 CoA + H™ 


D-3-Hydroxybutyrate is formed by the reduction of acetoacetate in the 
mitochondrial matrix by D-3-hydroxybutyrate dehydrogenase. The ratio 
of hydroxybutyrate to acetoacetate depends on the NADH/NAD* ratio 
inside mitochondria. 

Because it is a B-ketoacid, acetoacetate also undergoes a slow, spontane- 
ous decarboxylation to acetone. The odor of acetone may be detected in the 
breath of a person who has a high level of acetoacetate in the blood. 


Ketone bodies are a major fuel in some tissues 


The major site of the production of acetoacetate and 3-hydroxybutyrate is 
the liver. These substances diffuse from the liver mitochondria into the 
blood and are transported to other tissues such as heart and kidney 
(Figure 22.22). Acetoacetate and 3-hydroxybutyrate are normal fuels of 
respiration and are quantitatively important as sources of energy. Indeed, 
heart muscle and the renal cortex use acetoacetate in preference to glucose. 
In contrast, glucose is the major fuel for the brain and red blood cells in 
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well-nourished people on a balanced diet. However, the brain adapts to the 
utilization of acetoacetate during starvation and diabetes. In prolonged star- 
vation, 75% of the fuel needs of the brain are met by ketone bodies. 

Acetoacetate is converted into acetyl CoA in two steps. First, acetoace- 
tate is activated by the transfer of CoA from succinyl CoA in a reaction 
catalyzed by a specific CoA transferase. Second, acetoacetyl CoA is cleaved 
by thiolase to yield two molecules of acetyl CoA, which can then enter the 
citric acid cycle (Figure 22.23). The liver has acetoacetate available to sup- 
ply to other organs because it lacks this particular CoA transferase. 
3-Hydroxybutyrate requires an additional step to yield acetyl CoA. It is 
first oxidized to produce acetoacetate, which is processed as heretofore 
described, and NADH for use in oxidative phosphorylation. 


p-3-Hydroxybutyrate Acetoacetate 

Ketone bodies can be regarded as a water-soluble, transportable form of 
acetyl units. Fatty acids are released by adipose tissue and converted into 
acetyl units by the liver, which then exports them as acetoacetate. As might 
be expected, acetoacetate also has a regulatory role. High levels of acetoace- 
tate in the blood signify an abundance of acetyl units and lead to a decrease in 
the rate of lipolysis in adipose tissue. 


ki High blood levels of ketone bodies, the result of certain pathological 
conditions, can be life threatening. The most common of these condi- 
tions is diabetic ketosis in patients with insulin-dependent diabetes mellitus. 
These patients are unable to produce insulin. As stated earlier, this hormone, 
normally released after meals, signals tissues to take up glucose. In addition, 
it curtails fatty acid mobilization by adipose tissue. The absence of insulin 
has two major biochemical consequences (Figure 22.24). First, the liver 
cannot absorb glucose and consequently cannot provide oxaloacetate to 
process fatty acid-derived acetyl CoA. Second, adipose cells continue to 
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Figure 22.23 Utilization of acetoacetate 
as a fuel. Acetoacetate can be converted into 
two molecules of acetyl CoA, which then 
enter the citric acid cycle. 


Figure 22.24 Diabetic ketosis results 
when insulin is absent. In the absence of 
insulin, fats are released from adipose tissue, 
and glucose cannot be absorbed by the liver or 
adipose tissue. The liver degrades the fatty 
acids by B oxidation but cannot process the 
acetyl CoA, because of a lack of glucose- 
derived oxaloacetate (OAA). Excess ketone 
bodies are formed and released into the blood. 
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release fatty acids into the bloodstream, which are taken up by the liver and 
converted into ketone bodies. The liver thus produces large amounts of 
ketone bodies, which are moderately strong acids. The result is severe 
acidosis. The decrease in pH impairs tissue function, most importantly in 
the central nervous system. 

Interestingly, diets that promote ketone-body formation, called keto- 
genic diets, are frequently used as a therapeutic option for children with 
drug-resistant epilepsy. Ketogenic diets are rich in fats and low in carbohy- 
drates, with adequate amounts of protein. In essence, the body is forced into 
starvation mode, where fats and ketone bodies become the main fuel source 
(Section 27.5). How such diets reduce the seizures suffered by the children 
is currently unknown. 


Animals cannot convert fatty acids into glucose 


A typical human being has far greater fat stores than glycogen stores. 
However, glycogen is necessary to fuel very active muscle, as well as the 
brain, which normally uses only glucose as a fuel. When glycogen stores are 
low, why can’t the body make use of fat stores and convert fatty acids into 
glucose? Because animals are unable to effect the net synthesis of glucose 
from fatty acids. Specifically, acetyl CoA cannot be converted into pyruvate 
or oxaloacetate in animals. Recall that the reaction that generates acetyl 
CoA from pyruvate is irreversible (Section 17.1). The two carbon atoms of 
the acetyl group of acetyl CoA enter the citric acid cycle, but two carbon 
atoms leave the cycle in the decarboxylations catalyzed by isocitrate dehy- 
drogenase and a-ketoglutarate dehydrogenase. Consequently, oxaloacetate 
is regenerated, but it is not formed de novo when the acetyl unit of acetyl 
CoA is oxidized by the citric acid cycle. In essence, two carbon atoms enter 
the cycle as an acetyl group, but two carbons leave the cycle as CO, before 
oxaloacetate is generated. Consequently, no net synthesis of oxaloacetate is 
possible. In contrast, plants have two additional enzymes enabling them to 
convert the carbon atoms of acetyl CoA into oxaloacetate (Section 17.5). 


22.4 Fatty Acids Are Synthesized by Fatty Acid Synthase 


Fatty acids are synthesized by a complex of enzymes that together are called 
fatty acid synthase. Because eating a typical Western diet meets our physi- 
ological needs for fats and lipids, adult human beings have little need for de 
novo fatty acid synthesis. However, many tissues, such as liver and adipose 
tissue, are capable of synthesizing fatty acids, and this synthesis is required 
under certain physiological conditions. For instance, fatty acid synthesis is 
necessary during embryonic development and during lactation in mammary 
glands. Inappropriate fatty acid synthesis in the liver of alcoholics contrib- 
utes to liver failure. 

Acetyl CoA, the end product of fatty acid degradation, is the precursor 
for virtually all fatty acids. The biochemical challenge is to link the two 
carbon units together and reduce the carbons to produce palmitate, a C1, 
fatty acid. Palmitate then serves as a precursor for the variety of other fatty 
acids. 


Fatty acids are synthesized and degraded by different pathways 


Although fatty acid synthesis is the reversal of the degradative pathway in 
regard to basic chemical reactions, the synthetic and degradative pathways 
are different mechanistically, again exemplifying the principle that synthetic 
and degradative pathways are almost always distinct. Some important differ- 
ences between the pathways are as follows: 


1. Synthesis takes place in the cytoplasm, in contrast with degradation, 657 
which takes place primarily in the mitochondrial matrix. 22.4 Fatty Acid Synthesis 


2. Intermediates in fatty acid synthesis are covalently linked to the 
sulfhydryl groups of an acyl carrier protein (ACP), whereas intermediates in 
fatty acid breakdown are covalently attached to the sulfhydryl group of 
coenzyme A. 


3. The enzymes of fatty acid synthesis in higher organisms are joined in a 
single polypeptide chain called fatty acid synthase. In contrast, the degrada- 
tive enzymes do not seem to be associated. 


4. The growing fatty acid chain is elongated by the sequential addition of 
two-carbon units derived from acetyl CoA. The activated donor of two- 
carbon units in the elongation step is malonyl ACP. The elongation reaction 
is driven by the release of CO>. 


5. The reductant in fatty acid synthesis is NADPH, whereas the oxidants 
in fatty acid degradation are NAD™ and FAD. 


6. Elongation by the fatty acid synthase complex stops on the formation of 
palmitate (C16). Further elongation and the insertion of double bonds are 
carried out by other enzyme systems. 


The formation of malonyl CoA is the committed step in 
fatty acid synthesis 


Fatty acid synthesis starts with the carboxylation of acetyl CoA to malonyl 
CoA. This irreversible reaction is the committed step in fatty acid 


synthesis. 
fe) fe) fe) 
Jk CoA + ATP + HCO;; —— wk Jk CoA + ADP + P; + Ht 
H5C so Oo c Ss 
Hp 
Acetyl CoA Malonyl CoA 


The synthesis of malonyl CoA is catalyzed by acetyl CoA carboxylase, 
which contains a biotin prosthetic group. The carboxyl group of biotin is 
covalently attached to the e amino group of a lysine residue, as in pyruvate 
carboxylase (see Figure 16.24) and propionyl CoA carboxylase (p. 650). As 
with these other enzymes, a carboxybiotin intermediate is formed at the 
expense of the hydrolysis of a molecule of ATP. The activated CO» group in 
this intermediate is then transferred to acetyl CoA to form malonyl CoA. 


Biotin-enzyme + ATP + HCO; == 
CO>-biotin-enzyme + ADP + P; 


CO -biotin-enzyme + acetyl CoA — malonyl CoA + biotin-enzyme 


Acetyl CoA carbozylase is also the essential regulatory enzyme for fatty 
acid metabolism (Section 22.5). 


Intermediates in fatty acid synthesis are attached to an 
acyl carrier protein 


The intermediates in fatty acid synthesis are linked to an acyl carrier pro- 
tein. Specifically, they are linked to the sulfhydryl terminus of a phospho- 
pantetheine group. In the degradation of fatty acids, this unit is present as 
part of coenzyme A, whereas, in their synthesis, it is attached to a serine 
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Phosphopantetheine 
group 


Figure 22.25 Phosphopantetheine. Both 
acyl carrier protein and coenzyme A include 
phosphopantetheine as their reactive units. Acyl carrier protein Coenzyme A 


residue of the acyl carrier protein (Figure 22.25). Thus, ACP, a single poly- 
peptide chain of 77 residues, can be regarded as a giant prosthetic group, a 
“macro CoA.” 


Fatty acid synthesis consists of a series of condensation, reduction, 
dehydration, and reduction reactions 


The enzyme system that catalyzes the synthesis of saturated long-chain 
fatty acids from acetyl CoA, malonyl CoA, and NADPH is called the fatty 
acid synthase. The synthase is actually a complex of distinct enzymes. The 
fatty acid synthase complex in bacteria is readily dissociated into individual 
enzymes when the cells are broken apart. The availability of these isolated 
enzymes has helped biochemists elucidate the steps in fatty acid synthesis 
(Table 22.2). In fact, the reactions leading to fatty acid synthesis in higher 
organisms are very much like those of bacteria. 

The elongation phase of fatty acid synthesis starts with the formation of 
acetyl ACP and malonyl ACP. Acetyl transacylase and malonyl transacylase 
catalyze these reactions. 


Acetyl CoA + ACP = — cetyl ACP + CoA 
Malonyl CoA + ACP = = malonyl ACP + CoA 


Malonyl transacylase is highly specific, whereas acetyl transacylase can 
transfer acyl groups other than the acetyl unit, though at a much slower 
rate. The synthesis of fatty acids with an odd number of carbon atoms 
starts with propionyl ACP, which is formed from propionyl CoA by acetyl 
transacylase. 

Acetyl ACP and malonyl ACP react to form acetoacetyl ACP 
(Figure 22.26). The B-ketoacyl synthase, also called the condensing enzyme, 
catalyzes this condensation reaction. 


Acetyl ACP + malonyl ACP —— acetoacetyl ACP + ACP + CO, 


Table 22.2 Principal reactions in fatty acid synthesis in bacteria 


Step Reaction Enzyme 
1 Acetyl CoA + HCO; + ATP —> malonyl CoA + ADP + P; + H* Acetyl CoA carboxylase 
2 Acetyl CoA + ACP = = acetyl ACP + CoA Acetyl transacylase 
3 Malonyl CoA + ACP = = malonyl ACP + CoA Malony] transacylase 
4 Acetyl ACP + malonyl ACP —> acetoacetyl ACP + ACP + CO» B-Ketoacyl synthase 
5 Acetoacetyl ACP + NADPH + H* == p-3-hydroxybutyryl ACP + NADP* B-Ketoacyl reductase 
6 D-3-Hydroxybutyryl ACP ==  crotonyl ACP + H2O 3-Hydroxyacyl dehydratase 
7 Crotonyl ACP + NADPH + H* —> butyryl ACP + NADPt Enoyl reductase 


In the condensation reaction, a four-carbon unit is formed from a two- 
carbon unit and a three-carbon unit, and COz is released. Why is the four- 
carbon unit not formed from two 2-carbon units—say, two molecules of 
acetyl ACP? The answer is that the equilibrium for the synthesis of ace- 
toacetyl ACP from two molecules of acetyl ACP is highly unfavorable. In 
contrast, the equilibrium is favorable if malonyl ACP is a reactant because its 
decarboxylation contributes a substantial decrease in free energy. In effect, 
ATP drives the condensation reaction, though ATP does not directly parti- 
cipate in the condensation reaction. Instead, ATP is used to carboxylate 
acetyl CoA to malonyl CoA. The free energy thus stored in malonyl CoA is 
released in the decarboxylation accompanying the formation of acetoacetyl 
ACP. Although HCO; is required for fatty acid synthesis, its carbon atom 
does not appear in the product. Rather, all the carbon atoms of fatty acids 
containing an even number of carbon atoms ave derived from acetyl CoA. 

The next three steps in fatty acid synthesis reduce the keto group at C-3 
to a methylene group (see Figure 22.26). First, acetoacetyl ACP is reduced 
to D-3-hydroxybutyryl ACP by B-ketoacyl reductase. This reaction differs 
from the corresponding one in fatty acid degradation in two respects: (1) the 
D rather than the L isomer is formed; and (2) NADPH is the reducing agent, 
whereas NAD™ is the oxidizing agent in 8 oxidation. This difference exem- 
plifies the general principle that NADPH is consumed in biosynthetic 
yeactions, whereas NADH is generated in energy-yielding reactions. Then 
D-3-hydroxybutyryl ACP is dehydrated to form crotonyl ACP, which is a 
trans-A?-enoyl ACP by 3-hydroxyacyl dehydratase. The final step in the 
cycle reduces crotonyl ACP to butyryl ACP. NADPH is again the reduc- 
tant, whereas FAD is the oxidant in the corresponding reaction in B oxida- 
tion. The bacterial enzyme that catalyzes this step, enoyl reductase, can be 
inhibited by triclosan, a broad-spectrum antibacterial agent that is added to 
a variety of products such as toothpaste, soaps, and skin creams. These last 
three reactions—a reduction, a dehydration, and a second reduction— 
convert acetoacetyl ACP into butyryl ACP, which completes the first elon- 
gation cycle. 

In the second round of fatty acid synthesis, butyryl ACP condenses with 
malonyl ACP to form a C»g-B-ketoacyl ACP. This reaction is like the one in 
the first round, in which acetyl ACP condenses with malonyl ACP to form 
a C4-B-ketoacyl ACP. Reduction, dehydration, and a second reduction 
convert the C»-B-ketoacyl ACP into a Cg-acyl ACP, which is ready for a 
third round of elongation. The elongation cycles continue until Cy6-acyl 
ACP is formed. This intermediate is a good substrate for a thioesterase that 
hydrolyzes Cy¢-acyl ACP to yield palmitate and ACP. The thioesterase acts 
as a ruler to determine fatty acid chain length. The synthesis of longer-chain 
fatty acids is discussed in Section 22.6. 


Fatty acids are synthesized by a multifunctional enzyme 
complex in animals 


Although the basic biochemical reactions in fatty acid synthesis are very 
similar in E. coli and eukaryotes, the structure of the synthase varies consid- 
erably. The component enzymes of animal fatty acid synthases, in contrast 
with those of E. coli and plants, are linked in a large polypeptide chain. 

The structure of a large part of the mammalian fatty acid synthase has 
recently been determined, with the acyl carrier protein and thioesterase 
remaining to be resolved. The enzyme is a dimer of identical 270-kd sub- 
units. Each chain contains all of the active sites required for activity, as well 
as an acyl carrier protein tethered to the complex (Figure 22.27A). Despite 
the fact that each chain possesses all of the enzymes required for fatty acid 
synthesis, the monomers are not active. A dimer is required. 


Malonyl ACP 


Condensation 
ACP + CO; 


I of 
or, a ZAP 


H2 
Acetoacetyl ACP 


NADPH 


HC 


Reduction 
NADP* 


HO fe) 
\" ! ACP 
Hc CO i a 
Hp 
p-3-Hydroxbutyryl ACP 


Dehydration 
H,0 


O 


H | 

I 

Cx 5G ACP 
Hc SC ge 

H 
Crotonyl ACP 
NADPH 
Reduction 


NADP* 


Butyryl ACP 


Figure 22.26 The steps of fatty acid 
synthesis. Fatty acid synthesis begins with 
the condensation of malonyl ACP and acetyl 
ACP to form acetoacetyl ACP. Acetoacetyl ACP 
is then reduced, dehydrated, and reduced 
again to form butyryl ACP. Another cycle 
begins with the condensation of butyryl ACP 
and malonyl ACP. The sequence of reactions 
is repeated until the final product palmitate is 
formed. 
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Figure 22.27 The structure of the 
mammalian fatty acid synthase. (A) The 
arrangement of the catalytic activities present 
in a single polypeptide chain. (B) A cartoon of 
the dimer based on an x-ray crystallographic 
result. The W-MT and W-KR are inactive 
domains similar to methyl transferase and 
ketoreductase sequences. Although there are 
two domains for DH, only one is active. The 
inactive domains are presented in faded 
colors. Dotted lines outline domains for which 
the structure has not yet been determined. 
Abbreviations: KS, ketosynthase; MAT, malonyl- 
acetyl transferase; DH, dehydratase; ‘P-MT, 
methyl transferase (inactive); ‘-KR, 
ketoreductase (inactive); ER, enoyl reductase; 
KR, ketoreductase; ACP, acyl carrier protein; 
TE, thioesterase. 


(A) 


KS MAT 


DH1 DH2 ‘WMT ‘KR ER 


(B) 


Modification 
compartment 


Selecting and condensing compartment 


The two component chains interact such that the enzyme activities are 
partitioned into two distinct compartments (Figure 22.27B). The selecting 
and condensing compartment binds the acetyl and malonyl substrates and 
condenses them to form the growing chain. Interestingly, the mammalian 
fatty acid synthase has one active site, malonyl-acetyl transacylase, that 
adds both acetyl CoA and malonyl CoA. In contrast, most other fatty acid 
synthases have two separate enzyme activities, one for acetyl CoA and one 
for malonyl CoA. The modification compartment is responsible for the 
reduction and dehydration activities that result in the saturated fatty acid 
product. 

Let us consider one catalytic cycle of the fatty acid synthase complex 
(Figure 22.28). An elongation cycle begins when methyl-acetyl transferase 
(MAT) moves an acetyl unit from coenzyme A to the acyl carrier protein 
(ACP). B-Keto synthase (B-KS) accepts the acetyl unit, which forms a 
thioester with a cysteine residue at the B-KS active site. The vacant ACP is 
reloaded by MAT, this time with a malonyl! moiety. Malonyl ACP visits the 
active site of B-KS where the condensation of the two 2-carbon fragments 
takes place on the ACP with the concomitant release of CO». The selecting 
and condensing process concludes with the B-ketoacyl product attached to 
the ACP. 

The loaded ACP then sequentially visits the active sites of the modifica- 
tion compartment of the enzyme, where the B-keto group of the substrate is 
reduced to —OH, dehydrated, and finally reduced to yield the saturated 
acyl product, still attached to the ACP. With the completion of the modifi- 
cation process, the reduced product is transferred to the B-KS while the 
ACP accepts another malonyl unit. Condensation takes place and is fol- 
lowed by another modification cycle. The process is repeated until the 
thioesterase releases the final C16 palmitic acid product. 

Many eukaryotic multienzyme complexes are multifunctional proteins 
in which different enzymes are linked covalently. An advantage of this 
arrangement is that the synthetic activity of different enzymes is coordi- 
nated. In addition, intermediates can be efficiently handed from one active 
site to another without leaving the assembly. Furthermore, a complex of 
covalently joined enzymes is more stable than one formed by noncovalent 
attractions. Each of the component enzymes is recognizably homologous to 


Figure 22.28 A catalytic cycle of mammalian fatty acid synthase. The cycle begins when 
MAT (not shown) attaches an acetyl unit to ACP. (1) ACP delivers the acetyl unit to KS, and MAT 
then attaches a malonyl unit to ACP. (2) ACP visits KS again, which condenses the acetyl and 
malonyl units to form the B-ketoacyl product, attached to the ACP. (3) ACP delivers the B-ketoacyl 
product to the KR enzyme, which reduces the keto group to an alcohol. (4) The B-hydroxyl product 
then visits the DH, which introduces a double bond with the loss of water. (5) The enoyl product is 
delivered to the ER enzyme, where the double bond is reduced. (6) ACP hands the reduced 
product to KS and is recharged with malonyl CoA by MAT. (7) KS condenses the two molecules on 
ACP, which is now ready to begin another cycle. See Figure 22.27 for abbreviations. 


its bacterial counterpart. Multifunctional enzymes such as fatty acid syn- 
thase seem likely to have arisen in eukaryotic evolution by fusion of the 
individual genes of evolutionary ancestors. 


The synthesis of palmitate requires 8 molecules of acetyl CoA, 
14 molecules of NADPH, and 7 molecules of ATP 


The stoichiometry of the synthesis of palmitate is 


Acetyl CoA + 7 malonyl CoA + 14 NADPH + 20 H* —~> 
palmitate + 7 CO, + 14 NADP* + 8 CoA + 6H,O 


The equation for the synthesis of the malonyl CoA used in the preceding 
reaction is 


7 Acetyl CoA + 7 CO, + 7 ATP —> 
7 malonyl CoA + 7 ADP +7 P; + 14H* 
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Lyases 


Enzymes catalyzing the cleavage of C—C, 
C—O, or C—N bonds by elimination. A double 
bond is formed in these reactions. 


MITOCHONDRION CYTOPLASM 


Acetyl CoA Acetyl CoA 


Citrate Citrate 


Oxaloacetate 
Oxaloacetate 


Malate 


Pyruvate Pyruvate 


NADPH 


Figure 22.29 Transfer of acetyl CoA to the cytoplasm. Acetyl CoA is transferred from 
mitochondria to the cytoplasm, and the reducing potential of NADH is concomitantly converted into 
that of NADPH by this series of reactions. 


Hence, the overall stoichiometry for the synthesis of palmitate is 


8 Acetyl CoA + 7 ATP + 14 NADPH + 6H* —> 
palmitate + 14 NADP* + 8 CoA + 6H,O + 7 ADP + 7P; 


Citrate carries acetyl groups from mitochondria to the cytoplasm for 
fatty acid synthesis 


Fatty acids are synthesized in the cytoplasm, whereas acetyl CoA is formed 
from pyruvate in mitochondria. Hence, acetyl CoA must be transferred 
from mitochondria to the cytoplasm for fatty acid synthesis. Mitochondria, 
however, are not readily permeable to acetyl CoA. Recall that carnitine car- 
ries only long-chain fatty acids. The barrier to acetyl CoA is bypassed by cit- 
vate, which carries acetyl groups across the inner mitochondrial membrane. 
Citrate is formed in the mitochondrial matrix by the condensation of acetyl 
CoA with oxaloacetate (Figure 22.29). When present at high levels, citrate 
is transported to the cytoplasm, where it is cleaved by ATP-citrate lyase. 


Citrate + ATP + CoA + H,0 —> 
acetyl CoA + ADP + P; + oxaloacetate 


Thus, acetyl CoA and oxaloacetate are transferred from mitochondria to the 
cytoplasm at the expense of the hydrolysis of a molecule of ATP. 


Several sources supply NADPH for fatty acid synthesis 


Oxaloacetate formed in the transfer of acetyl groups to the cytoplasm must 
now be returned to the mitochondria. The inner mitochondrial membrane 
is impermeable to oxaloacetate. Hence, a series of bypass reactions are 
needed. Most importantly, these reactions generate much of the NADPH 
needed for fatty acid synthesis. First, oxaloacetate is reduced to malate 
by NADH. This reaction is catalyzed by a malate dehydrogenase in the 
cytoplasm. 


Oxaloacetate + NADH + H* = malate + NAD* 


Second, malate is oxidatively decarboxylated by an NADP” -linked malate 
enzyme (also called malic enzyme). 


Malate + NADP* —— pyruvate + CO, + NADPH 


The pyruvate formed in this reaction readily enters mitochondria, where it 
is carboxylated to oxaloacetate by pyruvate carboxylase. 


Pyruvate + CO) + ATP + HxO — oxaloacetate + ADP + P; + 2 H~ 
The sum of these three reactions is 


NADP* + NADH + ATP + H,0 —> 
NADPH + NAD* + ADP + P; + H* 


Thus, one molecule of NADPH is generated for each molecule of acetyl CoA 
that is transferred from mitochondria to the cytoplasm. Hence, eight molecules 
of NADPH are formed when eight molecules of acetyl CoA are transferred 
to the cytoplasm for the synthesis of palmitate. The additional six molecules 
of NADPH required for this process come from the pentose phosphate pathway 
(Section 20.3). 

The accumulation of the precursors for fatty acid synthesis is a wonder- 
ful example of the coordinated use of multiple pathways. The citric acid 
cycle, transport of oxaloacetate from the mitochondria, and pentose phos- 
phate pathway provide the carbon atoms and reducing power, whereas 
glycolysis and oxidative phosphorylation provide the ATP to meet the 
needs for fatty acid synthesis (Figure 22.30). 
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Fatty acid synthase inhibitors may be useful drugs 


ki Fatty acid synthase is overexpressed in most human cancers and its 
expression is correlated with tumor malignancy. The fatty acids are 
not stored as an energy source, but rather are used as precursors for the 
synthesis of phospholipids, which are then incorporated into membranes in 
the rapidly growing cancer cells. Researchers intrigued by this observation 
have tested inhibitors of fatty acid synthase on mice to see if the inhibitors 
slow tumor growth. These inhibitors do indeed slow tumor growth, appar- 
ently by inducing apoptosis. However, another startling observation was 
made: mice treated with inhibitors of the condensing enzyme showed remark- 
able weight loss because they ate less. Thus, fatty acid synthase inhibitors are 
exciting candidates both as antitumor and as antiobesity drugs. 


22.5 The Elongation and Unsaturation of Fatty Acids Are 
Accomplished by Accessory Enzyme Systems 


The major product of the fatty acid synthase is palmitate. In eukaryotes, 
longer fatty acids are formed by elongation reactions catalyzed by enzymes 
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22.5 Elongation and Unsaturation 


of Fatty Acids 


Figure 22.30 PATHWAY INTEGRATION: 
Fatty acid synthesis. Fatty acid synthesis 
requires the cooperation of various metabolic 
pathways located in different cellular 


compartments. 
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Figure 22.31 Electron-transport chain in 
the desaturation of fatty acids. 


Precursor 


Formula 


Linolenate (w-3) 
Linoleate (w-6) 


CH3;—(CH»)>=CH—R 
CH;—(CH));=CH—R 


Palmitoleate (w-7) CH3—(CH2),—=CH—R 


Oleate (w-9) 


CH3;—(CH)s=CH—R 


on the cytoplasmic face of the endoplasmic reticulum membrane. These 
reactions add two-carbon units sequentially to the carboxyl ends of both 
saturated and unsaturated fatty acyl CoA substrates. Malonyl CoA is the 
two-carbon donor in the elongation of fatty acyl CoAs. Again, condensation 
is driven by the decarboxylation of malonyl CoA. 


Membrane-bound enzymes generate unsaturated fatty acids 


Endoplasmic reticulum systems also introduce double bonds into long- 
chain acyl CoAs. For example, in the conversion of stearoyl CoA into oleoyl 
CoA, acis-A’ double bond is inserted by an oxidase that employs molecular 
oxygen and NADH (or NADPH). 


Stearoyl CoA + NADH + H* + 0, —> 
oleoyl CoA + NAD* + 2 H,O 


This reaction is catalyzed by a complex of three membrane-bound pro- 
teins: NADH-cytochrome bs reductase, cytochrome bs, and a desaturase 
(Figure 22.31). First, electrons are transferred from NADH to the FAD 
moiety of NADH-cytochrome b; reductase. The heme iron atom of cyto- 
chrome b; is then reduced to the Fe”* state. The nonheme iron atom of the 
desaturase is subsequently converted into the Fe’~ state, which enables it to 
interact with O, and the saturated fatty acyl CoA substrate. A double bond 
is formed and two molecules of H2O are released. Two electrons come from 
NADH and two from the single bond of the fatty acyl substrate. 


H* + NADH E-FAD > Cc leoyl CoA + 2 HO 
NAD* a < E-FADH, Stearoyl CoA + O, 
NADH-cytochrome Cytochrome Desaturase 
b,, reductase b, 


A variety of unsaturated fatty acids can be formed from oleate by a com- 
bination of elongation and desaturation reactions. For example, oleate can 
be elongated to a 20:1 cis-Al fatty acid. Alternatively, a second double 
bond can be inserted to yield an 18:2 cis-A°, A’ fatty acid. Similarly, palmi- 
tate (16:0) can be oxidized to palmitoleate (16: 1 cis-A’), which can then be 
elongated to cis-vaccenate (18:1 cis-A'?). 

Unsaturated fatty acids in mammals are derived from either palmitoleate 
(16:1), oleate (18:1), linoleate (18:2), or linolenate (18:3). The number of 
carbon atoms from the w end of a derived unsaturated fatty acid to the 
nearest double bond identifies its precursor. 

Mammals lack the enzymes to introduce double bonds at carbon atoms 
beyond C-9 in the fatty acid chain. Hence, mammals cannot synthesize 
linoleate (18:2 cis-A’,A'”) and linolenate (18:3 cis-A’,A’’, A). Linoleate 
and linolenate are the two essential fatty acids. The term essential means 
that they must be supplied in the diet because they are required by an organ- 
ism and cannot be synthesized by the organism itself. Linoleate and lino- 
lenate furnished by the diet are the starting points for the synthesis of a 
variety of other unsaturated fatty acids. 


Eicosanoid hormones are derived from polyunsaturated fatty acids 


Arachidonate, a 20:4 fatty acid derived from linoleate, is the major precursor 
of several classes of signal molecules: prostaglandins, prostacyclins, throm- 
boxanes, and leukotrienes (Figure 22.32). 


A prostaglandin is a 20-carbon fatty acid containing a 
5-carbon ring (Figure 22.33). This basic compound is 
modified by reductases and isomerases to yield nine major 
classes of prostaglandins, designated PGA through PGI; a 
subscript denotes the number of carbon-carbon double 
bonds outside the ring. Prostaglandins with two double 
bonds, such as PGE», are derived from arachidonate; 
the other two double bonds of this precursor are lost in 
forming a 5-membered ring. Prostacyclin and throm- 
boxanes are related compounds that arise from a nascent 
prostaglandin. They are generated by prostacyclin synthase 
and thromboxane synthase, respectively. Alternatively, 
arachidonate can be converted into leukotrienes by the 
action of lipoxygenase. Leukotrienes, first found in leuko- 
cytes, contain three conjugated double bonds—hence, the 
name. Prostaglandins, prostacyclin, thromboxanes, and 
leukotrienes are called eicosanoids (from the Greek eikosi, 
“twenty”) because they contain 20 carbon atoms. 
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Figure 22.32 Arachidonate is the major precursor of eicosanoid 
hormones. Prostaglandin synthase catalyzes the first step in a pathway 
leading to prostaglandins, prostacyclins, and thromboxanes. 
Lipoxygenase catalyzes the initial step in a pathway leading to 
leukotrienes. 


Prostaglandins and other eicosanoids are local hormones because they are 
short-lived. They alter the activities both of the cells in which they are syn- 
thesized and of adjoining cells by binding to 7TM receptors. Their effects 
may vary from one cell type to another, in contrast with the more-uniform 
actions of global hormones such as insulin and glucagon. Prostaglandins 
stimulate inflammation, regulate blood flow to particular organs, control 
ion transport across membranes, modulate synaptic transmission, and 


induce sleep. 


¥ Recall that aspirin blocks access to the active site of the enzyme that 
converts arachidonate into prostaglandin H» (Section 12.3). Because 
arachidonate is the precursor of other prostaglandins, prostacyclin, and 
thromboxanes, blocking this step interferes with many signaling pathways. 
Aspirin’s ability to obstruct these pathways accounts for its wide-ranging 


effects on inflammation, fever, pain, and blood clotting. 
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Figure 22.33 Structures of several eicosanoids. 
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Figure 22.34 Control of acetyl CoA carboxylase. 
carboxylase is inhibited by phosphorylation. 


Figure 22.35 Filaments of acetyl CoA 
carboxylase. The electron micrograph shows 
the enzymatically active filamentous form of 
acetyl CoA carboxylase from chicken liver. The 
inactive form is a dimer of 265-kd subunits. 
[Courtesy of Dr. M. Daniel Lane_] 


22.6 Acetyl CoA Carboxylase Plays a Key Role in 
Controlling Fatty Acid Metabolism 


Fatty acid metabolism is stringently controlled so that synthesis and degra- 
dation are highly responsive to physiological needs. Fatty acid synthesis is 
maximal when carbohydrates and energy are plentiful and when fatty acids 
are scarce. Acetyl CoA carboxylase plays an essential role in regulating fatty 
acid synthesis and degradation. Recall that this enzyme catalyzes the com- 
mitted step in fatty acid synthesis: the production of malonyl CoA (the 
activated two-carbon donor). This important enzyme is subject to both local 
and hormonal regulation. We will examine each of these levels of regulation 
1n turn. 


Acetyl CoA carboxylase is regulated by conditions in the cell 


Acetyl CoA carboxylase responds to changes in its immediate 
environment. Acetyl CoA carboxylase is switched off by phos- 
phorylation and activated by dephosphorylation (Figure 22.34). 
AMP-dependent protein kinase (AMPK) converts the carboxy- 
lase into an inactive form by modifying three serine residues. 
AMPK is essentially a fuel gauge; it is activated by AMP and 
inhibited by ATP. Thus, the carboxylase is inactivated when 
the energy charge is low. Fats are not synthesized when energy 
is required. 

The carboxylase is also allosterically stimulated by citrate. 
Citrate acts in an unusual manner on inactive acetyl CoA car- 
boxylase, which exists as isolated inactive dimers. Citrate 
facilitates the polymerization of the inactive dimers into active filaments 
(Figure 22.35). Citrate-induced polymerization can partly reverse the inhi- 
bition produced by phosphorylation (Figure 22.36). The level of citrate is 
high when both acetyl CoA and ATP are abundant, signifying that raw 
materials and energy are available for fatty acid synthesis. The stimulatory 
effect of citrate on the carboxylase is counteracted by palmitoyl CoA, which 
is abundant when there is an excess of fatty acids. Palmitoyl CoA causes the 
filaments to disassemble into the inactive subunits. Palmitoyl CoA also 
inhibits the translocase that transports citrate from mitochondria to the 
cytoplasm, as well as glucose 6-phosphate dehydrogenase, which generates 
NADPH in the pentose phosphate pathway. 

Acetyl CoA carboxylase also plays a role in the regulation of fatty acid 
degradation. Malonyl CoA, the product of the carboxylase reaction, is pres- 
ent at a high level when fuel molecules are abundant. Malonyl CoA inhibits 
carnitine acyltransferase I, preventing the entry of fatty acyl CoAs into the 
mitochondrial matrix in times of plenty. Malonyl CoA is an especially effec- 
tive inhibitor of carnitine acyltransferase I in heart and muscle, tissues that 
have little fatty acid synthesis capacity of their own. In these tissues, acetyl 
CoA carboxylase may be a purely regulatory enzyme. 


Acetyl CoA 


Acetyl CoA carboxylase is regulated by a variety of hormones 


Acetyl CoA carboxylase is controlled by the hormones glucagon, epineph- 
rine, and insulin, which denote the overall energy status of the organism. 
Insulin stimulates fatty acid synthesis by activating the carboxylase, whereas 
glucagon and epinephrine have the reverse effect. 


Regulation by glucagon and epinephrine. Consider, again, a person who 
has just awakened from a night’s sleep and begins a bout of exercise. As 
mentioned, glycogen stores will be low, but lipids are readily available for 
mobilization. 


As stated earlier, the hormones glucagon and epinephrine, present 
under conditions of fasting and exercise, will stimulate the release of 
fatty acids from triacylglycerols in fat cells, which will be released into 
the blood, and probably from muscle cells, where they will be used 
immediately as fuel. These same hormones will inhibit fatty acid syn- 
thesis by inhibiting acetyl CoA carboxylase. Although the exact mecha- 
nism by which these hormones exert their effects is not known, the net 
result is to augment the inhibition by the AMP-dependent kinase. This 
result makes sound physiological sense: when the energy level of the cell 
is low, as signified by a high concentration of AMP, and the energy level 
of the organism is low, as signaled by glucagon, fats should not be syn- 
thesized. Epinephrine, which signals the need for immediate energy, 
enhances this effect. Hence, these catabolic hormones switch off fatty acid 
synthesis by keeping the carboxylase in the inactive phosphorylated state. 


Regulation by insulin. Now consider the situation after the exercise 
has ended and the runner has had a meal. In this case, the hormone 
insulin inhibits the mobilization of fatty acids and stimulates their accu- 
mulation as triacylglycerols by muscle and adipose tissue. Insulin also 
stimulates fatty acid synthesis by activating acetyl CoA carboxylase. 
Insulin stimulates the carboxylase by stimulating the activity of a pro- 
tein phosphatase that dephosphorylates and activates acetyl CoA car- 
boxylase. Thus, the signal molecules glucagon, epinephrine, and insulin 
act in concert on triacylglycerol metabolism and acetyl CoA carboxylase 
to carefully regulate the utilization and storage of fatty acids. 


Response to diet. Long-term control is mediated by changes in the rates 
of synthesis and degradation of the enzymes participating in fatty acid 
synthesis. Animals that have fasted and are then fed high-carbohydrate, 
low-fat diets show marked increases in their amounts of acetyl CoA 
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Figure 22.36 Dependence of the 
catalytic activity of acetyl CoA 
carboxylase on the concentration of 
citrate. (A) Citrate can partly activate the 
phosphorylated carboxylase. (B) The 
dephosphorylated form of the carboxylase is 
highly active even when citrate is absent. 
Citrate partly overcomes the inhibition 
produced by phosphorylation. [After G. M. 
Mabrouk, |. M. Helmy, K. G. Thampy, and S. J. 


carboxylase and fatty acid synthase within a few days. This type of Wells) Biol Chet: 269 6250-6598; 1990) 


regulation is known as adaptive control. This regulation, which is medi- 
ated both by insulin and glucose, is at the level of gene transcription. 


Summary 


22.1 Triacylglycerols Are Highly Concentrated Energy Stores 
Fatty acids are physiologically important as (1) fuel molecules, (2) com- 
ponents of phospholipids and glycolipids, (3) hydrophobic modifiers 
of proteins, and (4) hormones and intracellular messengers. They are 
stored in adipose tissue as triacylglycerols (neutral fat). 


22.2 The Use of Fatty Acids As Fuel Requires Three Stages of Processing 
Triacylglycerols can be mobilized by the hydrolytic action of lipases 
that are under hormonal control. Glucagon and epinephrine stimulate 
triacylglycerol breakdown by activating the lipase. Insulin, in contrast, 
inhibits lipolysis. Fatty acids are activated to acyl CoAs, transported 
across the inner mitochondrial membrane by carnitine, and degraded 
in the mitochondrial matrix by a recurring sequence of four reactions: 
oxidation by FAD, hydration, oxidation by NAD“, and thiolysis by 
coenzyme A. The FADH) and NADH formed in the oxidation steps 
transfer their electrons to Oz by means of the respiratory chain, 
whereas the acetyl CoA formed in the thiolysis step normally enters 
the citric acid cycle by condensing with oxaloacetate. Mammals are 
unable to convert fatty acids into glucose, because they lack a pathway 
for the net production of oxaloacetate, pyruvate, or other gluconeo- 
genic intermediates from acetyl CoA. 
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22.3 


22.4 


Unsaturated and Odd-Chain Fatty Acids Require Additional 

Steps for Degradation 

Fatty acids that contain double bonds or odd numbers of carbon atoms 
require ancillary steps to be degraded. An isomerase and a reductase 
are required for the oxidation of unsaturated fatty acids, whereas 
propionyl CoA derived from chains with odd numbers of carbon 
atoms requires a vitamin B;2-dependent enzyme to be converted into 
succinyl CoA. 


Fatty Acids Are Synthesized by Fatty Acid Synthase 

Fatty acids are synthesized in the cytoplasm by a different pathway 
from that of B oxidation. Fatty acid synthase is the enzyme complex 
responsible for fatty acid synthase. Synthesis starts with the carboxy- 
lation of acetyl CoA to malonyl CoA, the committed step. This ATP- 
driven reaction is catalyzed by acetyl CoA carboxylase, a biotin 
enzyme. The intermediates in fatty acid synthesis are linked to an acyl 
carrier protein. Acetyl ACP is formed from acetyl CoA, and malonyl 
ACP is formed from malonyl CoA. Acetyl ACP and malonyl ACP 
condense to form acetoacetyl ACP, a reaction driven by the release of 
CO> from the activated malonyl unit. A reduction, a dehydration, and 
a second reduction follow. NADPH is the reductant in these steps. 
The butyryl ACP formed in this way is ready for a second round of 
elongation, starting with the addition of a two-carbon unit from malo- 
nyl ACP. Seven rounds of elongation yield palmitoyl ACP, which is 
hydrolyzed to palmitate. In higher organisms, the enzymes catalyzing 
fatty acid synthesis are covalently linked in a multifunctional enzyme 
complex. A reaction cycle based on the formation and cleavage of 
citrate carries acetyl groups from mitochondria to the cytoplasm. 
NADPH needed for synthesis is generated in the transfer of reducing 
equivalents from mitochondria by the combined action of cytoplasmic 
malate dehydrogenase and malic enzyme and by the pentose phos- 
phate pathway. 


22.5 The Elongation and Unsaturation of Fatty Acids Are Accomplished 


by Accessory Enzyme Systems 

Fatty acids are elongated and desaturated by enzyme systems in the 
endoplasmic reticulum membrane. Desaturation requires NADH and 
Oy» and is carried out by a complex consisting of a flavoprotein, a cyto- 
chrome, and a nonheme iron protein. Mammals lack the enzymes to 
introduce double bonds distal to C-9, and so they require linoleate and 
linolenate in their diets. 

Arachidonate, an essential precursor of prostaglandins and other 
signal molecules, is derived from linoleate. This 20:4 polyunsatu- 
rated fatty acid is the precursor of several classes of signal molecules— 
prostaglandins, prostacyclins, thromboxanes, and leukotrienes—that 
act as messengers and local hormones because of their transience. 
They are called eicosanoids because they contain 20 carbon atoms. 
Aspirin (acetylsalicylate), an anti-inflammatory and antithrombotic 
drug, irreversibly blocks the synthesis of these eicosanoids. 


22.6 Acetyl CoA Carboxylase Plays a Key Role in Controlling 


Fatty Acid Metabolism 

Fatty acid synthesis and degradation are reciprocally regulated so that 
both are not simultaneously active. Acetyl CoA carboxylase, the essen- 
tial control site, is phosphorylated and inactivated by AMP-dependent 
kinase. The phosphorylation is reversed by a protein phosphatase. 
Citrate, which signals an abundance of building blocks and energy, 


partly reverses the inhibition by phosphorylation. Carboxylase activity 669 


is stimulated by insulin and inhibited by glucagon and epinephrine. In 


Problems 


times of plenty, fatty acyl CoAs do not enter the mitochondrial matrix, 
because malonyl CoA inhibits carnitine acyltransferase I. 


Key Terms 


triacylglycerol (neutral fat, triglyceride) 
(p. 639) 

acyl adenylate (p. 645) 

carnitine (p. 645) (p. 657) 

B-oxidation pathway (p. 646) 


vitamin By, (cobalamin) (p. 649) 


peroxisome (p. 652) 
ketone body (p. 653) 
acyl carrier protein (ACP) 


fatty acid synthase (p. 657) 
malonyl CoA (p. 657) 


acetyl CoA carboxylase (p. 657) 
arachidonate (p. 664) 
prostaglandin (p. 665) 
eicosanoid (p. 665) 


AMP-dependent protein kinase 
(AMPK) (p. 666) 


Problems 


1. After lipolysis. Write a balanced equation for the conver- 
sion of glycerol into pyruvate. Which enzymes are required 
in addition to those of the glycolytic pathway? 


2. Forms of energy. The partial reactions leading to the syn- 
thesis of acyl CoA (equations 1 and 2, p. 645) are freely 
reversible. The equilibrium constant for the sum of these 
reactions is close to 1, meaning that the energy levels of the 
reactants and products are about equal, even though a mol- 
ecule of ATP has been hydrolyzed. Explain why these reac- 
tions are readily reversible. 


3. Activation fee. The reaction for the activation of fatty 
acids before degradation is 


R—C—SCoA + AMP + 2P; + 2H* 


This reaction is quite favorable because the equivalent of 
two molecules of ATP is hydrolyzed. Explain why, from a 
biochemical bookkeeping point of view, the equivalent of 
two molecules of ATP is used despite the fact that the left 
side of the equation has only one molecule of ATP. 


4. Proper sequence. Place the following list of reactions or 
relevant locations in the B oxidation of fatty acids in the 
proper order. 


a) Reaction with carnitine 

b) Fatty acid in the cytoplasm 

c) Activation of fatty acid by joining to CoA 
d) Hydration 

e) NAD*-linked oxidation 

f) Thiolysis 

g) Acyl CoA in mitochondrion 


( 
( 
( 
( 
( 
( 
( 
(h) FAD-linked oxidation. 


5. Remembrance of reactions past. We have encountered 
reactions similar to the oxidation, hydration, and oxidation 
reactions of fatty acid degradation earlier in our study of bio- 
chemistry. What other pathway employs this set of reactions? 


6. A phantom acetyl CoA? In the equation for fatty acid 
degradation shown here, only seven molecules of CoA are 
required to yield eight molecules of acetyl CoA. How is this 
difference possible? 


Palmitoyl CoA + 7 FAD + 7 NAD* 
+ 7 CoASH + 7H,O—> 
8 Acetyl CoA + 7 FADH) + 7 NADH + 7H™ 


7. Comparing yields. Compare the ATP yields from palm- 
itic acid and palmitoleic acid. 


8. Counting ATPs 1. What is the ATP yield for the com- 
plete oxidation of C17 (heptadecanoic) fatty acid? Assume 
that the propionyl CoA ultimately yields oxaloacetate in the 
citric acid cycle. 


9. Sweet temptation. Stearic acid is a Cyg fatty acid compo- 
nent of chocolate. Suppose you had a depressing day and 
decided to settle matters by gorging on chocolate. How 
much ATP would you derive from the complete oxidation 
of stearic acid to CO»? 


10. The best storage form. Compare the ATP yield from the 
complete oxidation of glucose, a six-carbon carbohydrate, 
and hexanoic acid, a six-carbon fatty acid. Hexanoic acid is 
also called caprioic acid and is responsible for the “aroma” 
of goats. Why are fats better fuels than carbohydrates? 


11. From fatty acid to ketone body. Write a balanced equa- 
tion for the conversion of stearate into acetoacetate. 


12. Generous, but not to a fault. Liver is the primary site of 
ketone-body synthesis. However, ketone bodies are not 
used by the liver but are released for other tissues to use. 
The liver does gain energy in the process of synthesizing 
and releasing ketone bodies. Calculate the number of mol- 
ecules of ATP generated by the liver in the conversion of 
palmitate, a C16 fatty acid, into acetoacetate. 
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13. Counting ATPs 2. How much energy is attained with the 
complete oxidation of the ketone body D-3-hydroxybutyrate? 


14. Another view. Why might someone argue that the 
answer to Problem 13 is wrong? 


15. An accurate adage. An old biochemistry adage is that 
fats burn in the flame of carbohydrates. What is the molecular 
basis of this adage? 


16. Refsum disease. Phytanic acid is a branched-chain fatty 
acid component of chlorophyll and is a significant compo- 
nent of milk. In susceptible people, phytanic acid can accu- 
mulate, leading to neurological problems. This syndrome is 
called Refsum disease or phytanic acid storage disease. 


CH; CH; CH; CH, 


H,C — CH— (CH); CH— (CH), CH— (CH); CH CH, —COO- 


Phytanic acid 


(a) Why does phytanic acid accumulate? 


(b) What enzyme activity would you invent to prevent its 
accumulation? 


17. A hot diet. Tritium is a radioactive isotope of hydrogen 
and can be readily detected. A fully tritiated, six-carbon 
saturated fatty acid is administered to a rat, and a muscle 
biopsy of the rat is taken by concerned, sensitive, and dis- 
crete technical assistants. These assistants carefully isolate 
all of the acetyl CoA generated from the 8 oxidation of the 
radioactive fatty acid and remove the CoA to form acetate. 
What will be the overall tritium-to-carbon ratio of the iso- 
lated acetate? 


18. Finding triacylglycerols in all the wrong places. Insulin- 
dependent diabetes is often accompanied by high levels 
of triacylglycerols in the blood. Suggest a biochemical 
explanation. 
19. Counterpoint. Compare and contrast fatty acid oxida- 
tion and synthesis with respect to 
a) site of the process. 
b) acyl carrier. 
c) reductants and oxidants. 
d) stereochemistry of the intermediates. 

) direction of synthesis or degradation. 


( 
( 
( 
( 
( 
( 


e 
f) organization of the enzyme system. 


20. A supple synthesis. Myristate, a saturated Cy, fatty acid, 
is used as an emollient for cosmetics and topical medicinal 
preparations. Write a balanced equation for the synthesis of 
myristate. 


21. The cost of cleanliness. Lauric acid is a 12-carbon fatty 
acid with no double bonds. The sodium salt of lauric acid 
(sodium laurate) is a common detergent used in a variety of 


products, including laundry detergent, shampoo, and 
toothpaste. How many molecules of ATP and NADPH are 
required to synthesize lauric acid? 


22. Proper organization. Arrange the following steps in 
fatty acid synthesis in their proper order. 

(a) Dehydration 

(b) Condensation 

(c) Release of a Cy¢ fatty acid 

(d) Reduction of a carbonyl group 
(e) Formation of malonyl ACP 


23. No access to assets. What would be the effect on fatty 
acid synthesis of a mutation in ATP-citrate lyase that 
reduces the enzyme’s activity? Explain. 


24. The truth and nothing but. True or False. If false, 
explain. 

(a) Biotin is required for fatty acid synthase activity. 

(b) The condensation reaction in fatty acid synthesis is 
powered by the decarboxylation of malonyl CoA. 

(c) Fatty acid synthesis does not depend on ATP. 

(d) Palmitate is the end product of fatty acid synthase. 

(e) All of the enzyme activities required for fatty acid 
synthesis in mammals are contained in a single polypeptide 
chain. 

(f) Fatty acid synthase in mammals is active as a monomer. 


(g) The fatty acid arachidonate is a precursor for signal 
molecules. 


(h) Acetyl CoA carboxylase is inhibited by citrate. 


25. Odd fat out. Suggest how fatty acids with odd numbers 
of carbons are synthesized. 


26. Labels. Suppose that you had an in vitro fatty acid- 
synthesizing system that had all of the enzymes and cofactors 
required for fatty acid synthesis except for acetyl CoA. To 
this system, you added acetyl CoA that contained radioac- 
tive hydrogen (3H, tritium) and carbon 14 ('*C) as shown 
here. 


The ratio of 7H/1*C is 3. What would the “H/C ratio be 
after the synthesis of palmitic acid (C1¢) with the use of the 
radioactive acetyl CoA? 


27. A tight embrace. Avidin, a glycoprotein found in eggs, 
has a high affinity for biotin. Avidin can bind biotin and 


prevent its use by the body. How might a diet rich in raw 
eggs affect fatty acid synthesis? What will be the effect on 
fatty acid synthesis of a diet rich in cooked eggs? Explain. 


28. Alpha or omega? Only one acetyl CoA molecule is used 
directly in fatty acid synthesis. Identify the carbon atoms in 
palmitic acid that were donated by acetyl CoA. 


29. Now you see it, now you don’t. Although HCO3_ is 
required for fatty acid synthesis, its carbon atom does not 
appear in the product. Explain how this omission is 
possible. 


30. It is all about communication. Why is citrate an appro- 
priate inhibitor of phosphofructokinase? 


31. Tracing carbon atoms. Consider a cell extract that act- 
ively synthesizes palmitate. Suppose that a fatty acid syn- 
thase in this preparation forms one molecule of palmitate in 
about 5 minutes. A large amount of malonyl CoA labeled 
with '‘C in each carbon atom of its malonyl unit is sud- 
denly added to this system, and fatty acid synthesis is 
stopped a minute later by altering the pH. The fatty acids 
are analyzed for radioactivity. Which carbon atom of the 
palmitate formed by this system is more radioactive—C-1 
or C-14? 


32. An unaccepting mutant. The serine residues in acetyl 
CoA carboxylase that are the target of the AMP-dependent 
protein kinase are mutated to alanine. What is a likely con- 
sequence of this mutation? 


33. Sources. For each of the following unsaturated fatty 
acids, indicate whether the biosynthetic precursor in ani- 
mals is palmitoleate, oleate, linoleate, or linolenate. 


(a) 18:1 cis-Al™ (d) 20:3 cis-A°, A®, Al! 
(b) 18:3 cis-A®, A’, Al? (e) 22:1 cis-Al3 
CO 202ceM 4A () 2b" A AM AY AY AY 


34. Driven by decarboxylation. What is the role of decarboxy- 
lation in fatty acid synthesis? Name another key reaction ina 
metabolic pathway that employs this mechanistic motif. 


35. Kinase surfeit. Suppose that a promoter mutation leads 
to the overproduction of protein kinase A in adipose 
cells. How might fatty acid metabolism be altered by this 
mutation? 


36. Blocked assets. The presence of a fuel molecule in the 
cytoplasm does not ensure that the fuel molecule can 
be effectively used. Give two examples of how impaired 
transport of metabolites between compartments leads to 
disease. 


37. Elegant inversion. Peroxisomes have an alternative 
pathway for oxidizing polyunsaturated fatty acids. They 
contain a hydratase that converts D-3-hydroxyacyl CoA 
into trans-A*-enoyl CoA. How can this enzyme be used to 


671 


Problems 


oxidize CoAs containing a cis double bond at an even- 
numbered carbon atom (e.g., the cis-A'* double bond of 
linoleate)? 


38. Covalent catastrophe. What is a potential disadvantage 
of having many catalytic sites together on one very long 
polypeptide chain? 


39. Missing acyl CoA dehydrogenases. A number of genetic 
deficiencies in acyl CoA dehydrogenases have been 
described. This deficiency presents early in life after a 
period of fasting. Symptoms include vomiting, lethargy, 
and sometimes coma. Not only are blood levels of glucose 
low (hypoglycemia), but starvation-induced ketosis is 
absent. Provide a biochemical explanation for these last two 
observations. 


40. Effects of clofibrate. High blood levels of triacylglycer- 
ides are associated with heart attacks and strokes. Clofibrate, 
a drug that increases the activity of peroxisomes, is some- 
times used to treat patients with such a condition. What is 
the biochemical basis for this treatment? 


41. A different kind of enzyme. Figure 22.36 shows the 
response of acetyl CoA carboxylase to varying amounts of 
citrate. Explain this effect in light of the allosteric effects 
that citrate has on the enzyme. Predict the effects of increas- 
ing concentrations of palmitoyl CoA. 


Mechanism Problems 


42. Variation on a theme. Thiolase is homologous in struc- 
ture to the condensing enzyme. On the basis of this obser- 
vation, propose a mechanism for the cleavage of 3-ketoacyl 


CoA by CoA. 


43. Two plus three to make four. Propose a reaction mecha- 
nism for the condensation of an acetyl unit with a malonyl 
unit to form an acetoacetyl unit in fatty acid synthesis. 


Chapter Integration Problems 


44, I[[l-advised diet. Suppose that, for some bizarre reason, 
you decided to exist on a diet of whale and seal blubber, 
exclusively. 


(a) How would lack of carbohydrates affect your ability to 
utilize fats? 


(b) What would your breath smell like? 


(c) One of your best friends, after trying unsuccessfully to 
convince you to abandon this diet, makes you promise to 
consume a healthy dose of odd-chain fatty acids. Does your 
friend have your best interests at heart? Explain. 


45. Fats to glycogen. An animal is fed stearic acid that is 
radioactively labeled with [!*C]carbon. A liver biopsy 
reveals the presence of !*C-labeled glycogen. How is this 
finding possible in light of the fact that animals cannot con- 
vert fats into carbohydrates? 
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Data Interpretation Problem 


46. Mutant enzyme. Carnitine palmitoyl transferase I 
(CPTI) catalyzes the conversion of long-chain acyl CoA 
into acyl carnitine, a prerequisite for transport into mito- 
chondria and subsequent degradation. A mutant enzyme 
was constructed with a single amino acid change at position 
3 of glutamic acid for alanine. Graphs A through C show 
data from studies performed to identify the effect of the 
mutation [data from J. Shi, H. Zhu, D. N. Arvidson, and G. 
J. Woldegiorgis. J. Biol. Chem. 274:9421-9426, 1999]. 


(a) What is the effect of the mutation on enzyme activity 
when the concentration of carnitine is varied (Graph A)? 
What are the Ky and Viyax values for the wild-type and 
mutant enzymes? 
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(b) What is the effect when the experiment is repeated with 
varying concentrations of palmitoyl CoA (Graph B)? What 
are the Kyy and Vmax values for the wild-type and mutant 
enzymes? 
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(c) Graph C shows the inhibitory effect of malonyl CoA on 

the wild-type and mutant enzymes. Which enzyme is more 

sensitive to malonyl CoA inhibition? 
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(d) Suppose that the concentration of palmitoyl CoA = 
100 1M, that of carnitine = 100 M, and that of malonyl 
CoA = 510 pM. Under these conditions, what is the most 
prominent effect of the mutation on the properties of the 
enzyme? 


(e) What can you conclude about the role of glutamate 3 in 
carnitine acyltransferase I function? 


CHAPTER 23 


Protein Turnover and 
Amino Acid Catabolism 


a, ?|@ 


Degradation of cyclin B. This important protein in cell-cycle regulation is visible as the green areas in 
the images above (the protein was fused with green fluorescent protein). Cyclin B is prominent 
during metaphase (top) but is degraded in anaphase (bottom) to prevent the premature initiation 
of another cell cycle. A large protease complex called the proteasome digests the protein into 
peptides, which are then degraded into amino acids. These amino acids are either reused or further 
processed so that the carbon skeletons can be used as fuel or building blocks. The released amino 
group is converted into urea for excretion by the urea cycle. [(Left) Courtesy of Dr. Jonathan Pines, 
University of Cambridge, Wellcome/CRC Institute of Cancer and Developmental Biology.] 


f Arginine 
Argininosuccinate ) 


Ornithine 
Carbamoyl 
; phosphate 


Citrulline A 


he digestion of dietary proteins in the intestine and the degradation of OUTLINE 
proteins within the cell provide a steady supply of amino acids to the cell. 


Many cellular proteins are constantly degraded and resynthesized in ae Precise: Degraded te Angin 


: : ; Acid 
response to changing metabolic demands. Others are misfolded or become o— 
damaged and they, too, must be degraded. Unneeded or damaged proteins 23.2 Protein Turnover Is Tightly 
are marked for destruction by the covalent attachment of chains of a small Regulated 


protein called ubiquitin and then degraded by a large, ATP-dependent com- 23.3 
plex called the proteasome. The primary use of amino acids provided through 
degradation or digestion is as building blocks for the synthesis of proteins and 
other nitrogenous compounds such as nucleotide bases. 

Amino acids in excess of those needed for biosynthesis can neither be 
stored, in contrast with fatty acids and glucose, nor excreted. Rather, 


The First Step in Amino Acid 
Degradation Is the Removal of 
Nitrogen 


23.4 Ammonium lon Is Converted 
into Urea in Most Terrestrial 


surplus amino acids are used as metabolic fuel. The a-amino group is Vertebrates 

removed, and the resulting carbon skeleton is converted into a major metabolic 23.5 Carbon Atoms of Degraded 
intermediate. Most of the amino groups harvested from surplus amino acids Amino Acids Emerge As Major 
are converted into urea through the urea cycle, and their carbon skeletons Metabolic Intermediates 


are transformed into acetyl CoA, acetoacetyl CoA, pyruvate, or one of the 
intermediates of the citric acid cycle. The carbon skeletons are converted 
into glucose, glycogen, and fats. 


23.6 Inborn Errors of Metabolism 
Can Disrupt Amino Acid 
Degradation 
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Table 23.1 Essential amino acids in 
human beings 

Histidine 

Isoleucine 

Leucine 

Lysine 

Methionine 

Phenylalanine 

Threonine 

Tryptophan 

Valine 


Figure 23.1 Digestion and absorption of 
proteins. Protein digestion is primarily a 
result of the activity of enzymes secreted by 
the pancreas. Aminopeptidases associated 
with the intestinal epithelium further digest 
proteins. The amino acids and di- and 
tripeptides are absorbed into the intestinal 
cells by specific transporters. Free amino acids 
are then released into the blood for use by 
other tissues. 


Proteolytic 


. enzymes 
Proteins 


Amino acids 


Oligopeptides 


Several coenzymes play key roles in amino acid degradation; foremost 
among them is pyridoxal phosphate. This coenzyme forms Schiff-base inter- 
mediates, which are a type of aldimine, that allow a-amino groups to be 
shuttled between amino acids and ketoacids. We will consider several 
genetic errors of amino acid degradation that lead to brain damage and 
mental retardation unless remedial action is initiated soon after birth. 
Phenylketonuria, which is caused by a block in the conversion of phenylala- 
nine into tyrosine, is readily diagnosed and can be treated by removing 
phenylalanine from the diet. The study of amino acid metabolism is espe- 
cially rewarding because it is rich in connections between basic biochemis- 
try and clinical medicine. 


23.1 Proteins Are Degraded to Amino Acids 


Dietary protein is a vital source of amino acids. Especially important dietary 
proteins are those containing the essential amino acids—amino acids that 
cannot be synthesized and must be acquired in the diet (Table 23.1). 
Proteins ingested in the diet are digested into amino acids or small peptides 
that can be absorbed by the intestine and transported in the blood. Another 
crucial source of amino acids is the degradation of cellular proteins. 


The digestion of dietary proteins begins in the stomach and is 
completed in the intestine 


Protein digestion begins in the stomach, where the acidic environment 
favors the denaturation of proteins into random coils. Denatured proteins 
are more accessible as substrates for proteolysis than are native proteins. 
The primary proteolytic enzyme of the stomach is pepsin, a nonspecific 
protease that, remarkably, is maximally active at pH 2. Thus, pepsin can 
function in the highly acidic environment of the stomach that disables 
other proteins. 

Protein degradation continues in the lumen of the intestine. The pan- 
creas secretes a variety of proteolytic enzymes into the intestinal lumen as 
inactive zymogens that are then converted into active enzymes (Sections 9.1 
and 10.4). The battery of enzymes displays a wide array of specificity, and 
so the substrates are degraded into free amino acids as well as di- and tri- 
peptides. Digestion is further enhanced by proteolytic enzymes, such as 
aminopeptidase N, that are located in the plasma membrane of the intesti- 
nal cells. Aminopeptidases digest proteins from the amino-terminal end. 
Single amino acids, as well as di- and tripeptides, are transported into the 
intestinal cells from the lumen and subsequently released into the blood for 
absorption by other tissues (Figure 23.1). 
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Cellular proteins are degraded at different rates 


Protein turnover—the degradation and resynthesis of proteins—takes place 
constantly in cells. Although some proteins are very stable, many proteins 
are short lived, particularly those that participate in metabolic regulation. 
These proteins can be quickly degraded to activate or shut down a signaling 
pathway. In addition, cells must eliminate damaged proteins. A significant 
proportion of newly synthesized protein molecules are defective because of 
errors in translation or misfolding. Even proteins that are normal when first 
synthesized may undergo oxidative damage or be altered in other ways with 
the passage of time. These proteins must be removed before they accumu- 
late and aggregate. Indeed, a number of pathological conditions, such as 
certain forms of Parkinson disease and Huntington disease, are associated 
with protein aggregation. 

The half-lives of proteins range over several orders of magnitude. 
Ornithine decarboxylase, at approximately 11 minutes, has one of the 
shortest half-lives of any mammalian protein. This enzyme participates in 
the synthesis of polyamines, which are cellular cations essential for growth 
and differentiation. The life of hemoglobin, on the other hand, is limited 
only by the life of the red blood cell, and the lens protein, crystallin, by the 
life of the organism. 


23.2 Protein Turnover Is Tightly Regulated 


How can a cell distinguish proteins that should be degraded? Ubiquitin 
(Ub), a small (8.5-kd) protein present in all eukaryotic cells, is a tag that 
marks proteins for destruction (Figure 23.2). Ubiquitin is the cellular 
equivalent of the “black spot” of Robert Louis Stevenson’s Treasure Island: 
the signal for death. 


Ubiquitin tags proteins for destruction 


Ubiquitin is highly conserved in eukaryotes: yeast and human ubiquitin 
differ at only 3 of 76 residues. The carboxyl-terminal glycine residue of 
ubiquitin becomes covalently attached to the e-amino groups of several 
lysine residues on a protein destined to be degraded. The energy for the 
formation of these isopeptide bonds (iso because e- rather than a-amino 
groups are targeted) comes from ATP hydrolysis. 

Three enzymes participate in the attachment of ubiquitin to a protein 
(Figure 23.3): ubiquitin-activating enzyme, or E1; ubiquitin-conjugating 
enzyme, or E2; and ubiquitin—protein ligase, or E3. First, the C-terminal 


PP; El AMP Aa 
+ ATP a SNA ie) 
El @ 
o*-o @ o* ~AMP 
(0) 


C terminus 


h Figure 23.2 Structure of ubiquitin. 
Notice that ubiquitin has an extended carboxy! 
terminus, which is activated and linked to 
proteins targeted for destruction. Lysine 
residues, including lysine 48, the major site 
for linking additional ubiquitin molecules, are 
shown as ball-and-stick models. [Drawn from 
1UBI.pdb.] 
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Figure 23.3 Ubiquitin conjugation. The 
ubiquitin-activating enzyme E1 adenylates 
ubiquitin (Ub) (1) and transfers the ubiquitin 
to one of its own cysteine residues (2). 
Ubiquitin is then transferred to a cysteine 
residue in the ubiquitin-conjugating enzyme 
E2 by the E2 enzyme. (3). Finally, the 
ubiquitin—protein ligase E3 transfers the 
ubiquitin to a lysine residue on the target 
protein (4a and 4b). 
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Isopeptide bonds 


h Figure 23.4 Structure of 
tetraubiquitin. Four ubiquitin molecules are 
linked by isopeptide bonds. Notice that each 
isopeptide bond is formed by the linkage of 
the carboxylate group at the end of the 
extended C terminus with the e-amino group 
of a lysine residue. Dashed lines indicate the 
positions of the extended C-termini that were 
not observed in the crystal structure. This unit 
is the primary signal for degradation when 
linked to a target protein. [Drawn from 1TBE. 
pdb] 


Table 23.2 Dependence of the half- 
lives of cytoplasmic yeast 
proteins on the identity 
of their amino-terminal 
residues 


Highly stabilizing residues 
(1/2 > 20 hours) 


Ala Cys Gly Met 
Pro Ser Thr Val 


Intrinsically destabilizing residues 

(t1/2 = 2 to 30 minutes) 

Arg His lle Leu 
Lys Phe Trp Tyr 


Destabilizing residues after chemical 
modification (t,/2 = 3 to 30 minutes) 


Asn Asp Gln Glu 
Source: J. W. Tobias, T. E. Schrader, G. Rocap, 
and A. Varshavsky. Science 254(1991):1374-1377. 
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carboxylate group of ubiquitin becomes linked to a sulfhy- 
dryl group of E1 by a thioester bond. This ATP-driven 
reaction is reminiscent of fatty acid activation (Section 
22.2). In this reaction, ATP is linked to the C-terminal 
carboxylate of ubiquitin with the release of pyrophos- 
phate, and the ubiquitin is transferred to a sulfhydryl 
group of a key cysteine residue in E1. The activated ubiq- 
uitin is then shuttled to a sulfhydryl group of E2, a reac- 
tion catalyzed by E2 itself. Finally, E3 catalyzes the trans- 
fer of ubiquitin from E2 to an e-amino group on the target 
protein. 

A chain of four or more ubiquitin molecules is espe- 
cially effective in signaling the need for degradation 
(Figure 23.4). The ubiquitination reaction is processive: 
E3 remains bound to the target proteins and generates a 
chain of ubiquitin molecules by linking the €-amino group 
of lysine residue 48 of one ubiquitin molecule to the termi- 
nal carboxylate of another. 

What determines whether a protein becomes ubiquit- 
inated? A specific sequence of amino acids, termed a degron, indicates that 
a protein should be degraded. One such signal turns out to be unexpectedly 
simple. The half-life of a cytoplasmic protein is determined to a large extent by 
its amino-terminal residue (‘Table 23.2). This dependency is referred to as the 
N-terminal rule or the N-terminal degron. A yeast protein with methionine 
at its N terminus typically has a half-life of more than 20 hours, whereas one 
with arginine at this position has a half-life of about 2 minutes. A highly 
destabilizing N-terminal residue such as arginine or leucine favors rapid 
ubiquitination, whereas a stabilizing residue such as methionine or proline 
does not. Other degrons thought to identify proteins for degradation include 
cyclin destruction boxes, which are amino acid sequences that mark cell-cycle 
proteins for destruction, and PEST sequences, which contain the amino acid 
sequence proline (P, single-letter abbreviation), glutamic acid (E), serine 
(S), and threonine (T). 

E3 enzymes are the readers of N-terminal residues. Although most 
eukaryotes have only one or a small number of distinct E1 enzymes, all 
eukaryotes have many distinct E2 and E3 enzymes. Moreover, there appears 
to be only a single family of evolutionarily related E2 proteins but three 
distinct families of E3 proteins, all together consisting of hundreds of 
members. Indeed, the E3 family is one of the largest gene families in human 
beings. The diversity of target proteins that must be tagged for destruction 
requires a large number of E3 proteins as readers. 


To target 
i protein 


e Three examples demonstrate the importance of E3 proteins to normal 

cell function. Proteins that are not broken down owing to a defective 
E3 may accumulate to create a disease of protein aggregation such as juve- 
nile or early-onset Parkinson disease. A defect in another member of the E3 
family causes Angelman syndrome, a severe neurological disorder charac- 
terized by mental retardation, absence of speech, uncoordinated movement, 
and hyperactivity. Conversely, uncontrolled protein turnover also can create 
dangerous pathological conditions. For example, human papilloma virus 
(HPV) encodes a protein that activates a specific E3 enzyme. The enzyme 
ubiquitinates the tumor suppressor p53 and other proteins that control 
DNA repair, which are then destroyed. The activation of this E3 enzyme is 
observed in more than 90% of cervical carcinomas. Thus, the inappropriate 
marking of key regulatory proteins for destruction can trigger further 
events, leading to tumor formation. 


It is important to note that the role of ubiquitin is much broader than 
merely marking proteins for destruction. Although we have focused on pro- 
tein degradation, ubiquitination also regulates proteins involved in DNA 
repair, chromatin remodeling, and protein kinase activation, among other 
biochemical processes. 


The proteasome digests the ubiquitin-tagged proteins 


If ubiquitin is the mark of death, what is the executioner? a. subunits 


A large protease complex called the proteasome or the 26S 

proteasome digests the ubiquitinated proteins. This ATP- 

driven multisubunit protease spares ubiquitin, which is _ B subunits 

then recycled. The 265 proteasome is a complex of two 

components: a 205 catalytic unit and a 195 regulatory unit. o. subunits 
The 208 unit is constructed from 14 copies each of two 


B subunits 
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23.2 Regulation of Protein Turnover 


homologous subunits (a and B) and has a mass of 700 kd N-terminal 

(Figure 23.5). The subunits are arranged in four rings of nice thE 

seven subunits that stack to form a structure resembling a 

barrel. The outer two rings of the barrel are made up of a Figure 23.5 20S proteasome. The 20S proteasome comprises 


subunits and the inner two rings of B subunits. The 20S 
catalytic core is a sealed barrel. Access to its interior is con- 
trolled by a 19S regulatory unit, itself a 700-kd complex 
made up of 20 subunits. Two such 195 complexes bind to 
the 205 proteasome core, one at each end, to form the complete 265 protea- 
some (Figure 23.6). The 19S regulatory unit has three functions. First, the 
19S unit binds specifically to polyubiquitin chains, thereby ensuring that 
only ubiquitinated proteins are degraded. Second, an isopeptidase in the 
195 unit cleaves off intact ubiquitin molecules from the proteins so that they 
can be reused. Finally, the doomed protein is unfolded and directed into the 
catalytic core. Key components of the 195 complex are six ATPases of a 
type called the AAA class (ATPase associated with various cellular activi- 
ties). ATP hydrolysis assists the 195 complex to unfold the substrate and 
induce conformational changes in the 20S catalytic core so that the substrate 
can be passed into the center of the complex. 

The proteolytic active sites are sequestered in the interior of the barrel to 
protect potential substrates until they are directed into the barrel. There are 
three types of active sites in the B subunits, each with a different specificity, 
but all employ an N-terminal threonine. The hydroxyl group of the threo- 
nine residue is converted into a nucleophile that attacks the carbonyl groups 
of peptide bonds to form acyl-enzyme intermediates. Substrates are degrad- 
ed in a processive manner without the release of degradation intermediates, 
until the substrate is reduced to peptides ranging in length from seven to 
nine residues. These peptide products are released from the proteasome and 
further degraded by other cellular proteases to yield individual amino acids. 
Thus, the ubiquitination pathway and the proteasome cooperate to degrade 
unwanted proteins. Figure 23.7 presents an overview of the fates of amino 
acids following proteasomal digestion. 


The ubiquitin pathway and the proteasome have 
prokaryotic counterparts 


Both the ubiquitin pathway and the proteasome appear to be present 
in all eukaryotes. Homologs of the proteasome are also found in some 
prokaryotes. The proteasomes of some archaea are quite similar in overall 
structure to their eukaryotic counterparts and similarly have 28 subunits 
(Figure 23.8). In the archaeal proteasome, however, all a outer-ring subunits 
and all 8 inner-ring subunits are identical; in eukaryotes, each a or B subunit 


28 homologous subunits (a, red; B, blue), arranged in four rings of 
7 subunits each. Some of the B subunits (right) include protease 
active sites at their amino termini. [Subunit drawn from 1RYP.pdb.] 


r 19S cap 


> 20S catalytic 
core 


r 19S cap 


Figure 23.6 26S proteasome. A 19S cap is 
attached to each end of the 20S catalytic unit. 
[From W. Baumeister, J. Walz, F. Zuhl, and E. 
Seemuller. Ce// 92(1998):367—380; courtesy 
of Dr. Wolfgang Baumeister. ] 
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Archaeal proteasome Eukaryotic proteasome 


Figure 23.8 Proteasome evolution. The 
archaeal proteasome consists of 14 identical 
a subunits and 14 identical 8 subunits. In the 
eukaryotic proteasome, gene duplication and 
specialization has led to 7 distinct subunits of 
each type. The overall architecture of the 
proteasome is conserved. 


Bortezomib 
(a dipeptidyl boronic acid) 


Figure 23.7 The proteasome and other 
proteases generate free amino acids. 
Ubiquitinated proteins are processed to 
peptide fragments from which the ubiquitin is 
subsequently removed and recycled. The 
peptide fragments are further digested to yield 
free amino acids, which can be used for 
biosynthetic reactions, most notably protein 
synthesis. Alternatively, the amino group can 
be removed and processed to urea (p. 685) 
and the carbon skeleton can be used to 
synthesize carbohydrate or fats or used 
directly as a fuel for cellular respiration. 


Ubiquitinated protein 


3 


Proteasome 


I \ OO 
S 
o~« 


Peptide fragments Released ubiquitin 


| Proteolysis 


o e 
@ © C) 
O) A Ad 
Amino acids 
[A Nitrogen 
Left intact oj Amino > disposal 
for biosynthesis groups by the 
v urea cycle 


Carbon skeletons 


Glucose or glycogen . . ; 
eynthesie 8 “ Fatty acid synthesis 


Cellular respiration 


is one of seven different isoforms. This specialization provides distinct 
substrate specificity. 

Although ubiquitin has not been found in prokaryotes, ubiquitin’s 
molecular ancestors were recently identified in prokaryotes. Remarkably, 
these proteins take part not in protein modification but in biosynthesis of 
the coenzyme thiamine. A key enzyme in thiamine biosynthesis is Thi, 
which activates the protein ThiS as an acyl adenylate and then adds a sulfide 
ion derived from cysteine (Figure 23.9). ThiF is homologous to human F1, 
which includes two tandem regions of 160 amino acids that are 28% identi- 
cal in amino acid sequence with a region of ThiF from E. coli. The evolu- 
tionary relationships between these two pathways were cemented by the 
determination of the three-dimensional structure of ThiS, which revealed a 
structure very similar to that of ubiquitin, despite being only 14% identical 
in amino acid sequence (Figure 23.10). Thus, a eukaryotic system for pro- 
tein modification evolved from a preexisting prokaryotic pathway for coen- 
zyme biosynthesis. 


Protein degradation can be used to regulate biological function 


e Table 23.3 lists a number of physiological processes that are con- 
trolled at least in part by protein degradation through the ubiquitin— 
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proteasome pathway. In each case, the proteins being degraded are regula- 
tory proteins. Consider, for example, control of the inflammatory response. 
A transcription factor called NF-«B (NF for nuclear factor) initiates the 
expression of a number of the genes that take part in this response. This fac- 
tor is itself activated by the degradation of an attached inhibitory protein, 
I-«B (I for inhibitor). In response to inflammatory signals that bind to 
membrane-bound receptors, I-«KB is phosphorylated at two serine residues, 
creating an E3 binding site. The binding of E3 leads to the ubiquitination 
and degradation of I-KB, unleashing NF-«B. The liberated transcription 
factor migrates from the cytoplasm to the nucleus to stimulate the transcrip- 
tion of the target genes. The NF-«B-I-kB system illustrates the interplay of 
several key regulatory motifs: receptor-mediated signal transduction, phos- 
phorylation, compartmentalization, controlled and specific degradation, 
and selective gene expression. The importance of the ubiquitin—proteasome 
system for the regulation of gene expression is highlighted by the recent 
approval of bortezomib (Velcade), a potent inhibitor of the proteasome, as a 
therapy for multiple myeloma. Bortezomib is a dipeptidyl boronic acid 
inhibitor of the proteasome. 


Ubiquitin 


C terminus 


C terminus 


‘2 The evolutionary studies of proteasomes described above have also 
yielded potential clinical benefits. The bacterial pathogen 
Mycobacterium tuberculosis, the cause of tuberculosis, harbors a proteasome 
that is very similar to the human counterpart. Nevertheless, recent work has 
shown that it is possible to exploit the differences between the human and 
the bacterial proteasomes to develop specific inhibitors of the M. tuberculo- 
sis complex. Oxathiazol-2-one compounds such as HT1171 are suicide 
inhibitors of the proteolytic activity of the M. tuberculosis proteasome, but 
have no effect on the proteasomes of the human host. This is especially 
exciting because these drugs kill the nonreplicating form of M. tuberculosis, 
and thus may not require the prolonged treatment required with conven- 
tional drugs, thereby reducing the likelihood of drug resistance due to inter- 
ruption of the treatment regime. 
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Figure 23.9 Biosynthesis of thiamine. 
The biosynthesis of thiamine begins with the 
addition of sulfide to the carboxyl terminus of 
the protein ThiS. This protein is activated by 
adenylation and conjugated in a manner 
analogous to the first steps in the ubiquitin 
pathway. 


Table 23.3 Processes regulated by 
protein degradation 


Gene transcription 
Cell-cycle progression 
Organ formation 
Circadian rhythms 
Inflammatory response 
‘Tumor suppression 
Cholesterol metabolism 
Antigen processing 


h Figure 23.10 Structures of ThiS and 
ubiquitin compared. Notice that ThiS is 
structurally similar to ubiquitin despite only 
14% sequence identity. This observation 
suggests that a prokaryotic protein such as 
ThiS evolved into ubiquitin. [Drawn from 1UBI. 
pdb and 1FOZ.pdb.] 
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23.3 The First Step in Amino Acid Degradation Is the 
Removal of Nitrogen 


What is the fate of amino acids released on protein digestion or turnover? 
The first call is for use as building blocks for biosynthetic reactions. 
However, any not needed as building blocks are degraded to compounds 
able to enter the metabolic mainstream. The amino group is first removed, 
and then the remaining carbon skeleton is metabolized to glucose, one of 
several citric acid cycle intermediates, or to acetyl CoA. The major site of 
amino acid degradation in mammals is the liver, although muscles readily 
degrade the branched-chain amino acids (Leu, Ile, and Val). The fate of 
the a-amino group will be considered first, followed by that of the carbon 
skeleton (Section 23.5). 


Alpha-amino groups are converted into ammonium ions by the 
oxidative deamination of glutamate 


The a-amino group of many amino acids is transferred to a-ketoglutarate 
to form glutamate, which is then oxidatively deaminated to yield ammoni- 
um ion (NH4"). 
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Aminotransferases catalyze the transfer of an a-amino group from an a-amino 
acid to an a-ketoacid. These enzymes, also called transaminases, generally 
funnel a-amino groups from a variety of amino acids to a-ketoglutarate 
for conversion into NH,". 
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Aspartate aminotransferase, one of the most important of these enzymes, 
catalyzes the transfer of the amino group of aspartate to a-ketoglutarate. 


Aspartate + a-ketoglutarate == oxaloacetate + glutamate 


Alanine aminotransferase catalyzes the transfer of the amino group of alanine 
to a-ketoglutarate. 


Alanine + a-ketoglutarate == pyruvate + glutamate 


These transamination reactions are reversible and can thus be used to syn- 
thesize amino acids from a-ketoacids, as we shall see in Chapter 24. 

The nitrogen atom in glutamate is converted into free ammonium ion by 
oxidative deamination. This reaction is catalyzed by glutamate dehydrogenase. 
This enzyme is unusual in being able to utilize either NAD* or NADP“, at 
least in some species. The reaction proceeds by dehydrogenation of the 
C—N bond, followed by hydrolysis of the resulting aldimine. 


NADH + H* 
(or NADP*) (or NADPH + H*) H,0 NH,* 
~OOC CcoOo- ~OOC COoO- ~OOC CcoO- 
Glutamate Ketimine a-Ketoglutarate 


intermediate 


This reaction equilibrium constant is close to 1 in the liver, so the direction 
of the reaction is determined by the concentrations of reactants and prod- 
ucts. Normally, the reaction is driven forward by the rapid removal of 
ammonium ion. In mammals but not in other organisms, glutamate is allos- 
terically inhibited by GTP and palmitoyl CoA, and stimulated by ADP and 
leucine. Glutamate dehydrogenase, essentially a liver-specific enzyme, is 
located in mitochondria, as are some of the other enzymes required for the 
production of urea. This compartmentalization sequesters free ammonium 
ion, which is toxic. 

The sum of the reactions catalyzed by aminotransferases and glutamate 
dehydrogenase is 


a-Amino acid + NAD* + H,O == a-ketoacid + NH,* + NADH + H* 
(or NADP*) (or NADPH) 


‘ es . ‘ 
In most terrestrial vertebrates, NH4" is converted into urea, which is 
excreted. 


fe) 
a-Amino acid a-Ketoglutarate NADH + NH,* ———> Jk 
HoN NH> 
Urea 
a-Ketoacid Glutamate NADt + H20 
Mechanism: Pyridoxal phosphate forms Schiff-base 
intermediates in aminotransferases 


All aminotransferases contain the prosthetic group pyridoxal phosphate 
(PLP), which is derived from pyridoxine (vitamin Bg). Pyridoxal phosphate 
includes a pyridine ring that is slightly basic to which is attached an 
OH group that is slightly acidic. Thus, pyridoxal phosphate derivatives 
can form a stable tautomeric form in which the pyridine nitrogen atom is 
protonated and, hence, positively charged, while the OH group loses a pro- 
ton and hence is negatively charged, forming a phenolate. 
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PLP (protonated) PLP (phenolate) 


The most important functional group on PLP is the aldehyde. This 
group forms covalent Schiff-base intermediates with amino acid substrates. 
Indeed, even in the absence of substrate, the aldehyde group of PLP usu- 
ally forms a Schiff-base linkage with the e-amino group of a specific lysine 
residue at the enzyme’s active site. A new Schiff-base linkage is formed on 
addition of an amino acid substrate. 
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Figure 23.11 Transamination mechanism. 
(1) The external aldimine loses a proton 

to form a quinonoid intermediate. 

(2) Reprotonation of this intermediate at 

the aldehyde carbon atom yields a ketimine. 


The a-amino group of the amino acid substrate displaces the €-amino group 
of the active-site lysine residue. In other words, an internal aldimine becomes 


(3) This intermediate is hydrolyzed to an external aldimine. The amino acid—PLP Schiff base that is formed 

generate the a-ketoacid product and remains tightly bound to the enzyme by multiple noncovalent interactions. 

pyridoxamine phosphate. The Schiff-base linkage often accepts a proton at the N, with the positive 
charge stabilized by interaction with the negatively charged phenolate 
group of PLP. 


The Schiff base between the amino acid substrate and PLP, the external 
aldimine, loses a proton from the a-carbon atom of the amino acid to form a 
quinonoid intermediate (Figure 23.11). Reprotonation of this intermediate at 
the aldehyde carbon atom yields a ketimine. The ketimine is then hydrolyzed 
to an a-ketoacid and pyridoxamine phosphate (PMP). These steps constitute 
half of the transamination reaction. 


Amino acid; + E-PLP = — a-ketoacid, + E-PMP 


The second half takes place by the reverse of the preceding pathway. A 
second a-ketoacid reacts with the enzyme—pyridoxamine phosphate com- 
plex (E-PMP) to yield a second amino acid and regenerate the enzyme— 
pyridoxal phosphate complex (E-PLP). 


Byridakaniiie phosphate a-Ketoacidg + E-PMP == amino acid2 + E-PLP 
(PMP) : i : 
The sum of these partial reactions is 


Amino acid; + a-ketoacid) === amino acid) + a-ketoacid, 


Aspartate aminotransferase is an archetypal 
pyridoxal-dependent transaminase 


The mitochondrial enzymeaspartate aminotransferase provides an especially 
well studied example of PLP as a coenzyme for transamination reactions 
(Figure 23.12). X-ray crystallographic studies provided detailed views of 
how PLP and substrates are bound and confirmed much of the proposed 
catalytic mechanism. Each of the identical 45-kd subunits of this dimer 
consists of a large domain and a small one. PLP is bound to the large 
domain, in a pocket near the subunit interface. In the absence of substrate, 
the aldehyde group of PLP is in a Schiff-base linkage with lysine 258, as 
expected. Adjacent to the coenzyme’s binding site is a conserved arginine 
residue that interacts with the a-carboxylate group of the amino acid 
substrate, helping to orient the substrate appropriately in the active site. A 
base is necessary to remove a proton from the a-carbon group of the amino 
acid and to transfer it to the aldehyde carbon atom of PLP (see Figure 23.11, 
steps 1 and 2). The lysine amino group that was initially in Schiff-base 
linkage with PLP appears to serve as the proton donor and acceptor. 
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Pyridoxal phosphate enzymes catalyze a wide array of reactions 


Transamination is Just one of a wide range of amino acid transformations 
that are catalyzed by PLP enzymes. The other reactions catalyzed by PLP 
enzymes at the a-carbon atom of amino acids are decarboxylations, deami- 
nations, racemizations, and aldol cleavages (Figure 23.13). In addition, PLP 
enzymes catalyze elimination and replacement reactions at the B-carbon 
atom (e.g., tryptophan synthetase in the synthesis of tryptophan) and the 
y-carbon atom (e.g., cystathionine B-synthase in the synthesis of cysteine) 
of amino acid substrates. Three common features of PLP catalysis underlie 
these diverse reactions. 


1. A Schiff base is formed by the amino acid substrate (the amine compo- 
nent) and PLP (the carbonyl component). 


2. The protonated form of PLP acts as an electron sink to stabilize catalytic 
intermediates that are negatively charged. Electrons from these intermedi- 
ates are attracted to the positive charge on the ring nitrogen atom. In other 
words, PLP is an electrophilic catalyst. 


3. The product Schiff base is cleaved at the completion of the reaction. 


How does an enzyme selectively break a particular one of three bonds at 
the a-carbon atom of an amino acid substrate? An important principle is 
that the bond being broken must be perpendicular to the zr orbitals of the 
electron sink (Figure 23.14). An aminotransferase, for example, binds the 
amino acid substrate so that the C.—H bond is perpendicular to the PLP 
ring (Figure 23.15). In serine hydroxymethyltransferase, the enzyme that 
converts serine into glycine, the N—C, bond is rotated so that the C,—Cg 
bond is most nearly perpendicular to the plane of the PLP ring, favoring its 


Bond most nearly Figure 23.14 Stereoelectronic effects. The 
perpendicular to the 


delocalized x orbitals orientation about the NH—C,, bond determines the 
most favored reaction catalyzed by a pyridoxal 
\ Ri phosphate enzyme. The bond that is most nearly 


L ‘Rs perpendicular to the plane of delocalized w orbitals 
H a (represented by dashed lines) of the pyridoxal 
7 orbitals R3 phosphate electron sink is most easily cleaved. 
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hk Figure 23.12 Aspartate 
aminotransferase. The active site of this 
prototypical PLP-dependent enzyme includes 
pyridoxal phosphate attached to the enzyme 
by a Schiff-base linkage with lysine 258. An 
arginine residue in the active site helps orient 
substrates by binding to their a-carboxylate 
groups. Only one of the enzyme’s two 
subunits is shown. [Drawn from 1AAW.pdb.] 


Figure 23.13 Bond cleavage by PLP 
enzymes. Pyridoxal phosphate enzymes 
labilize one of three bonds at the a-carbon 
atom of an amino acid substrate. For example, 
bond a is labilized by aminotransferases, bond 
b by decarboxylases, and bond c by aldolases 
(such as threonine aldolases). PLP enzymes 
also catalyze reactions at the B- and y-carbon 
atoms of amino acids. 
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Figure 23.15 Reaction choice. In aspartate 
aminotransferase, the C.—H bond is most nearly 
perpendicular to the a-orbital system and is 


Bond most nearly 


cleaved. In serine hydroxymethyltransferase, a (.) perpendicular 
small rotation about the N—C, bond places the ak to x orbitals 
C.—Cg bond perpendicular to the 7 system, C) 


A ; q 
favoring its cleavage. O 


cleavage. This means of choosing one of several possible catalytic outcomes 
is called stereoelectronic control. 


Many of the PLP enzymes that catalyze amino acid transformations, 

such as serine hydroxymethyltransferase, have a similar structure and 
are clearly related by divergent evolution. Others, such as tryptophan 
synthetase, have quite different overall structures. Nonetheless, the active 
sites of these enzymes are remarkably similar to that of aspartate amino- 
transferase, revealing the effects of convergent evolution. 


Serine and threonine can be directly deaminated 


The a-amino groups of serine and threonine can be directly converted into 
NH,” without first being transferred to a-ketoglutarate. These direct 
deaminations are catalyzed by serine dehydratase and threonine dehydratase, 
in which PLP is the prosthetic group. 


Serine ——> pyruvate + NH,~ 
Threonine ——> a-ketobutyrate + NH4* 


These enzymes are called dehydratases because dehydration precedes 
deamination. Serine loses a hydrogen ion from its a-carbon atom and a 
hydroxide ion group from its B-carbon atom to yield aminoacrylate. This 
unstable compound reacts with H2O to give pyruvate and NH,4". Thus, the 
presence of a hydroxyl group attached to the B-carbon atom in each of these 
amino acids permits the direct deamination. 


Peripheral tissues transport nitrogen to the liver 


Although most amino acid degradation takes place in the liver, other tissues 
can degrade amino acids. For instance, muscle uses branched-chain amino 
acids as a source of fuel during prolonged exercise and fasting. How is the 
nitrogen processed in these other tissues? As in the liver, the first step is 
the removal of the nitrogen from the amino acid. However, muscle lacks the 
enzymes of the urea cycle, and so the nitrogen must be released in a non- 
toxic form that can be absorbed by the liver and converted into urea. 
Nitrogen is transported from muscle to the liver in two principal trans- 
port forms. Glutamate is formed by transamination reactions, but the nitro- 
gen is then transferred to pyruvate to form alanine, which is released into 
the blood (Figure 23.16). The liver takes up the alanine and converts it back 
into pyruvate by transamination. The pyruvate can be used for gluconeo- 


LIVER 


Active pathways: Glucose 
1. Glycogen breakdown, Chapter 21 
2. Glycolysis, Chapter 16 
3. Citric acid cycle, Chapter 17 ©) B 
4. Oxidative phosphorylation, Chapter 18 ff L 
5. Gluconeogenesis, Chapter 16 Pyruvate fe) 
6. Urea cycle, Chapter 23 
- O 
_. Glutamate 
( D 
NH," Alanine 
Urea 


genesis and the amino group eventually appears as urea. This transport is 
referred to as the glucose—alanine cycle. It is reminiscent of the Cori cycle 
discussed earlier (see Figure 16.33). However, in contrast with the Cori 
cycle, pyruvate is not reduced to lactate by NADH, and thus more high- 
energy electrons are available for oxidative phosphorylation. 

Nitrogen can also be transported as glutamine. Glutamine synthetase 
catalyzes the synthesis of glutamine from glutamate and NH," in an ATP- 
dependent reaction: 


Glutamine synthetase 


NH,* + glutamate + ATP ———————> glutamine + ADP + P; 


The nitrogens of glutamine can be converted into urea in the liver. 


23.4 Ammonium lon Is Converted into Urea in 
Most Terrestrial Vertebrates 


Some of the NH,* formed in the breakdown of amino acids is consumed in 
the biosynthesis of nitrogen compounds. In most terrestrial vertebrates, the 
excess NH," is converted into urea and then excreted. Such organisms are 
referred to as ureotelic. 

In terrestrial vertebrates, urea is synthesized by the urea 
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@ 


Pyruvate 
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Figure 23.16 PATHWAY INTEGRATION: 
The glucose-alanine cycle. During 
prolonged exercise and fasting, muscle uses 
branched-chain amino acids as fuel. The 
nitrogen removed is transferred (through 
glutamate) to alanine, which is released into 
the bloodstream. In the liver, alanine is taken 
up and converted into pyruvate for the 
subsequent synthesis of glucose. 


H20 


cycle (Figure 23.17). The urea cycle, proposed by Hans Krebs Fumarate Arginine 
and Kurt Henseleit in 1932, was the first cyclic metabolic 
pathway to be discovered. One of the nitrogen atoms of urea 
is transferred from an amino acid, aspartate. The other nitro- 
. . . + 
gen atom is derived directly from: free NH,", and the Arginincsuccnate Smithing 
carbon atom comes from HCO3_ (derived by the hydration of 
COsz; see Section 9.2). 
ig 
oO Carbamoyl 
Jk Aspartate Citrulline phosphate 
2- 
NH,t ——> HN NH> R—NH) 0; a a 
Urea MITOCHONDRIAL 
CYTOPLASM MATRIX i 
The urea cycle begins with the formation of carbamoyl f 
phosphate on3 


The urea cycle begins with the coupling of free NH3 with 
HCO3_ to form carbamoyl phosphate. Carbamoy]! phosphate 


Figure 23.17 The urea cycle. 


685 


686 


Ce. 
HO So 


Bicarbonate 


is a simple molecule, but its synthesis is complex. Carbamoyl phosphate 
synthetase catalyzes the required three steps. 


: aoe ; 0 o2 NH; P, ; ATP ADP 2 es 7 
b= ek “ Aw KA NY 
Ges ee prises eee an 
HO fe) e) HO NH> 0 6) NH 
Carboxyphosphate Carbamic acid Carbamoyl phosphate 


Note that NH3, because it is a strong base, normally exists as NH,* in 
aqueous solution. However, carbamoyl phosphate synthetase uses only 
NH as a substrate. The reaction begins with the phosphorylation of 
HCO3_ to form carboxyphosphate, which then reacts with NH3 to form 
carbamic acid. Finally, a second molecule of ATP phosphorylates carbamic 
acid to form carbamoyl phosphate. The structure and mechanism of the 
enzyme that catalyzes these reactions will be presented in Chapter 25. The 
consumption of two molecules of ATP makes this synthesis of carbamoyl 
phosphate essentially irreversible. The mammalian enzyme requires 
N-acetylglutamate for activity, as will be described shortly. 

The carbamoyl group of carbamoyl phosphate has a high transfer 
potential because of its anhydride bond. The carbamoyl group is trans- 
ferred to ornithine to form citrulline, in a reaction catalyzed by ornithine 
transcarbamoylase. 


fP 


H»N——C 


NH 


Qo > Pi 
‘ Reo Ornithine + _ 
oa . 6) transcarbamoylase HsN coo 
Ornithine Carbamoyl phosphate Citrulline 


Ornithine and citrulline are amino acids, but they are not used as building 
blocks of proteins. The formation of NH4* by glutamate dehydrogenase, its 
incorporation into carbamoyl phosphate as NH3, and the subsequent syn- 
thesis of citrulline take place in the mitochondrial matrix. In contrast, the 
next three reactions of the urea cycle, which lead to the formation of urea, 
take place in the cytoplasm. 

Citrulline is transported to the cytoplasm, where it condenses with 
aspartate, the donor of the second amino group of urea. This synthesis of 
argininosuccinate, catalyzed by argininosuccinate synthetase, is driven by the 
cleavage of ATP into AMP and pyrophosphate and by the subsequent 
hydrolysis of pyrophosphate. 


~OOC 
/ a. 
a eee ail 
AMP 
+ 
ATP PP; 
“OOC H 
*HZN COO- *HZN CcOO- ia Nit ic *H3N CcOO- 
Citrulline Aspartate nn Argininosuccinate 


Argininosuccinase cleaves argininosuccinate into arginine and fumarate. 
Thus, the carbon skeleton of aspartate is preserved in the form of 
fumarate. 


H 
Argininosuccinase L 
> + 
+ & H = 
H3N coo coo 
Argininosuccinate Arginine Fumarate 


Finally, arginine is hydrolyzed to generate urea and ornithine in a reaction 
catalyzed by arginase. Ornithine is then transported back into the mito- 
chondrion to begin another cycle. The urea is excreted. Indeed, human 
beings excrete about 10 kg (22 pounds) of urea per year. 


NH, 
HaNe==== = a 

HO 

es, 1 
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rginase On 
*H3N COO- *H3N H2N NH 
Arginine Ornithine Urea 


The urea cycle is linked to gluconeogenesis 


The stoichiometry of urea synthesis is 


CO, + NH,* + 3 ATP + aspartate + 2 H,O—> 
urea + 2 ADP + P; + AMP + PP; + fumarate 


Pyrophosphate 1s rapidly hydrolyzed, and so the equivalent of four mol- 
ecules of ATP are consumed in these reactions to synthesize one molecule 
of urea. The synthesis of fumarate by the urea cycle is important because it 
is a precursor for glucose synthesis (Figure 23.18). Fumarate is hydrated to 
malate, which is in turn oxidized to oxaloacetate. Oxaloacetate can be con- 
verted into glucose by gluconeogenesis or transaminated to aspartate. 


a-Keto- 
v acid 
Carbamoyl Citrulline Aspartate 
phosphate fr acid 
Vv ===> Glucose 
ae rginino- 
Ornithine eur eiaake Oxaloacetate 


Malate 
aw 
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In ancient Rome, urine was a valuable 
commodity. Vessels were placed on street 
corners for passersby to urinate into. 
Bacteria would degrade the urea, releasing 
ammonium ion, which would act as a 
bleach to brighten togas. 


Figure 23.18 Metabolic integration of 
nitrogen metabolism. The urea cycle, 
gluconeogenesis, and the transamination of 
oxaloacetate are linked by fumarate and 
aspartate. 
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kh Figure 23.19 Homologous enzymes. 
The structure of the catalytic subunit of 
ornithine transcarbamoylase (blue) is quite 
similar to that of the catalytic subunit of 
aspartate transcarbamoylase (red), indicating 
that these two enzymes are homologs. 
[Drawn from 1RKM.pdb and 1RAI.pdb.] 
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Urea-cycle enzymes are evolutionarily related to enzymes in 
other metabolic pathways 


a Carbamoyl phosphate synthetase generates carbamoyl phosphate for 
both the urea cycle and the first step in pyrimidine biosynthesis 
(Section 25.1). In mammals, two distinct isozymes of the enzyme are present. 
The carbamoyl phosphate synthetase used in pyrimidine biosynthesis differs 
in two important ways from its urea-cycle counterpart. First, this enzyme 
utilizes glutamine as a nitrogen source rather than NH3. The side-chain 
amide of glutamine 1s hydrolyzed within one domain of the enzyme, and the 
ammonia generated moves through a tunnel in the enzyme to a second 
active site, where it reacts with carboxyphosphate. Second, this enzyme is 
part of a large polypeptide called CAD that comprises three distinct 
enzymes: carbamoyl phosphate synthetase, aspartate transcarbamoylase, 
and dihydroorotase. All three enzymes catalyze steps in pyrimidine biosyn- 
thesis (Section 25.1). Interestingly, the domain in which glutamine hydrolysis 
takes place is largely preserved in the urea-cycle enzyme, although that 
domain is catalytically inactive. This site binds N-acetylglutamate, an allos- 
teric activator of the enzyme. N-Acetylglutamate is synthesized whenever 
the rate of amino acid catabolism increases and, consequently, signals that 
the ammonium ion generated in the catabolism of the free amino acids must 
be disposed of. A catalytic site in one isozyme has been adapted to act as an 
allosteric site in another isozyme having a different physiological role. 

Can we find homologs for the other enzymes in the urea cycle? Ornithine 
transcarbamoylase is homologous to aspartate transcarbamoylase, which 
catalyzes the first step in pyrimidine biosynthesis, and the structures of 
their catalytic subunits are quite similar (Figure 23.19). Thus, two consecu- 
tive steps in the pyrimidine biosynthetic pathway were adapted for urea 
synthesis. The next step in the urea cycle is the addition of aspartate to 
citrulline to form argininosuccinate, and the subsequent step is the removal 
of fumarate. These two steps together accomplish the net addition of an 
amino group to citrulline to form arginine. Remarkably, these steps are 
analogous to two consecutive steps in the purine biosynthetic pathway 
(Section 25.2). 


Aspartate 


Fumarate 
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The enzymes that catalyze these steps are homologous to argininosuccinate 
synthetase and argininosuccinase, respectively. Thus, four of the five 
enzymes in the urea cycle were adapted from enzymes taking part in nucle- 
otide biosynthesis. The remaining enzyme, arginase, appears to be an 
ancient enzyme found in all domains of life. 


Inherited defects of the urea cycle cause hyperammonemia and can 
lead to brain damage 


e The synthesis of urea in the liver is the major route for the removal of 
NH,". A blockage of carbamoyl phosphate synthesis or of any of the 
four steps of the urea cycle has devastating consequences because there is no 


alternative pathway for the synthesis of urea. All defects in the urea cycle lead 
to an elevated level of NH," in the blood (hyperammonemia). Some of these 
genetic defects become evident a day or two after birth, when the afflicted 
infant becomes lethargic and vomits periodically. Coma and irreversible 
brain damage may soon follow. Why are high levels of NH4” toxic? The 
answer to this question is not yet known. Recent work, however, suggests 
that NH4* may inappropriately activate a sodium-potassium-chloride 
cotransporter. This activation disrupts the osmotic balance of the nerve 
cell, causing swelling that damages the cell and results in neurological 
disorders. 

Ingenious strategies for coping with deficiencies in urea synthesis have 
been devised on the basis of a thorough understanding of the underlying 
biochemistry. Consider, for example, argininosuccinase deficiency. This 
defect can be partly bypassed by providing a surplus of arginine in the diet 
and restricting the total protein intake. In the liver, arginine is split into urea 
and ornithine, which then reacts with carbamoyl! phosphate to form citrul- 
line (Figure 23.20). This urea-cycle intermediate condenses with aspartate 
to yield argininosuccinate, which is then excreted. Note that two nitrogen 
atoms—one from carbamoyl phosphate and the other from aspartate—are 
eliminated from the body per molecule of arginine provided in the diet. In 
essence, argininosuccinate substitutes for urea in carrying nitrogen out of the 
body. 

The treatment of carbamoyl phosphate synthetase deficiency or ornithine 
transcarbamoylase deficiency illustrates a different strategy for circumvent- 
ing a metabolic block. Citrulline and argininosuccinate cannot be used to 
dispose of nitrogen atoms because their formation is impaired. Under these 
conditions, excess nitrogen accumulates in glycine and glutamine. The chal- 
lenge then is to rid the body of the nitrogen accumulating in these two 
amino acids. That goal is accomplished by supplementing a protein- 
restricted diet with large amounts of benzoate and phenylacetate. Benzoate is 
activated to benzoyl CoA, which reacts with glycine to form hippurate 
(Figure 23.21). Likewise, phenylacetate is activated to phenylacetyl CoA, 
which reacts with glutamine to form phenylacetylglutamine. These conju- 
gates substitute for urea in the disposal of nitrogen. Thus, latent biochemical 
pathways can be activated to partly bypass a genetic defect. 


Urea is not the only means of disposing of excess nitrogen 


As stated earlier, most terrestrial vertebrates are ureotelic; they excrete 
excess nitrogen as urea. However, urea is not the only excretable form of 
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Figure 23.20 Treatment of 
argininosuccinase deficiency. 
Argininosuccinase deficiency can be 
managed by supplementing the diet with 
arginine. Nitrogen is excreted in the form 
of argininosuccinate. 


Figure 23.21 Treatment of carbamoyl 
phosphate synthetase and ornithine 
transcarbamoylase deficiencies. Both 
deficiencies can be treated by supplementing 
the diet with benzoate and phenylacetate. 
Nitrogen is excreted in the form of hippurate 
and phenylacetylglutamine. 
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Figure 23.22 Fates of the carbon 
skeletons of amino acids. Glucogenic 
amino acids are shaded red, and ketogenic 
amino acids are shaded yellow. Several amino 
acids are both glucogenic and ketogenic. 


nitrogen. Ammoniotelic organisms, such as aquatic vertebrates and inverte- 
brates, release nitrogen as NH,” and rely on the aqueous environment to 
dilute this toxic substance. Interestingly, lungfish, which are normally 
ammoniotelic, become ureotelic in time of drought, when they live out of 
the water. 

Both ureotelic and ammoniotelic organisms depend on sufficient water, 
to varying degrees, for nitrogen excretion. In contrast, uricotelic organisms, 
such as birds and reptiles, secrete nitrogen as the purine uric acid. Uric acid is 
secreted as an almost solid slurry requiring little water. The secretion of uric 
acid also has the advantage of removing four atoms of nitrogen per mole- 
cule. The pathway for nitrogen excretion developed in the course of evolu- 
tion clearly depends on the habitat of the organism. 


23.5 Carbon Atoms of Degraded Amino Acids Emerge As 
Major Metabolic Intermediates 


We now turn to the fates of the carbon skeletons of amino acids after the 
removal of the a-amino group. The strategy of amino acid degradation is to 
transform the carbon skeletons into major metabolic intermediates that can be 
converted into glucose or oxidized by the citric acid cycle. The conversion 
pathways range from extremely simple to quite complex. The carbon skel- 
etons of the diverse set of 20 fundamental amino acids are funneled into 
only seven molecules: pyruvate, acetyl CoA, acetoacetyl CoA, a-ketoglutarate, 
succinyl CoA, fumarate, and oxaloacetate. We see here an example of the 
remarkable economy of metabolic conversions. 

Amino acids that are degraded to acetyl CoA or acetoacetyl CoA are 
termed ketogenic amino acids because they can give rise to ketone bodies or 
fatty acids. Amino acids that are degraded to pyruvate, a-ketoglutarate, 
succinyl CoA, fumarate, or oxaloacetate are termed glucogenic amino acids. 
The net synthesis of glucose from these amino acids 1s feasible because these 
citric acid cycle intermediates and pyruvate can be converted into phospho- 
enolpyruvate and then into glucose (Section 16.3). Recall that mammals 
lack a pathway for the net synthesis of glucose from acetyl CoA or ace- 
toacetyl CoA. 

Of the basic set of 20 amino acids, only leucine and lysine are solely 
ketogenic (Figure 23.22). Isoleucine, phenylalanine, tryptophan, and 
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tyrosine are both ketogenic and glucogenic. Some of their carbon atoms 
emerge in acetyl CoA or acetoacetyl CoA, whereas others appear in poten- 
tial precursors of glucose. The other 14 amino acids are classed as solely 
glucogenic. This classification is not universally accepted, because different 
quantitative criteria are applied. Whether an amino acid 1s regarded as being 
glucogenic, ketogenic, or both depends partly on the eye of the beholder, 
although the majority of amino acid carbons clearly end up in glucose or 
glycogen. We will identify the degradation pathways by the entry point into 
metabolism. 


Pyruvate is an entry point into metabolism for a number 
of amino acids 


Pyruvate is the entry point of the three-carbon amino acids—alanine, 
serine, and cysteine—into the metabolic mainstream (Figure 23.23). The 
transamination of alanine directly yields pyruvate. 


Alanine + a-ketoglutarate == pyruvate + glutamate 


As mentioned earlier in the chapter, glutamate is then oxidatively deami- 
nated, yielding NH,” and regenerating a-ketoglutarate. The sum of these 
reactions is 


Alanine + NAD(P)* + H,»O —— pyruvate + NH,* + NAD(P)H + H* 


Another simple reaction in the degradation of amino acids is the deami- 
nation of serine to pyruvate by serine dehydratase (p. 684). 


Serine —— pyruvate + NH,* 


Cysteine can be converted into pyruvate by several pathways, with its 
sulfur atom emerging in H»5, SCN, or SO", 

The carbon atoms of three other amino acids can be converted into pyru- 
vate. Glycine can be converted into serine by the enzymatic addition of a 
hydroxymethyl group or it can be cleaved to give CO», NH4", and an acti- 
vated one-carbon unit. Threonine can give rise to pyruvate through the 
intermediate 2-amino-3-ketobutyrate. Three carbon atoms of tryptophan 
can emerge in alanine, which can be converted into pyruvate. 
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Figure 23.23 Pyruvate formation from amino acids. Pyruvate is the point of entry for alanine, 
serine, cysteine, glycine, threonine, and tryptophan. 
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Aspartate and asparagine are converted into oxaloacetate, a citric acid cycle 
intermediate. Aspartate, a four-carbon amino acid, is directly transaminated 
to oxaloacetate. 


Aspartate + a-ketoglutarate == oxaloacetate + glutamate 


Asparagine is hydrolyzed by asparaginase to NH4” and aspartate, which is 
then transaminated. 

Recall that aspartate can also be converted into fumarate by the urea 
cycle (see Figure 23.18). Fumarate is a point of entry for half the carbon 
atoms of tyrosine and phenylalanine, as will be discussed shortly. 
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Glutamate a-Ketoglutarate (Figure 23.2 4) 
Figure 23.24 a-Ketoglutarate formation Histidine is converted into 4-imidazolone 5-propionate (Figure 23.25). 
from amino acids. a-Ketoglutarate is the The amide bond in the ring of this intermediate is hydrolyzed to the 


point of entry of several five-carbon amino 
acids that are first converted into glutamate. 


N-formimino derivative of glutamate, which is then converted into gluta- 
mate by the transfer of its formimino group to tetrahydrofolate, a carrier of 
activated one-carbon units (see Figure 24.9). 
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Figure 23.25 Histidine degradation. Conversion of histidine into glutamate. 


Glutamine is hydrolyzed to glutamate and NH," by glutaminase. Proline 
and arginine are each converted into glutamate y-semialdehyde, which is 
then oxidized to glutamate (Figure 23.26). 
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Figure 23.26 Glutamate formation. 
Conversion of proline and arginine into 
Arginine Ornithine glutamate. 
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Succinyl coenzyme A is a point of entry for several nonpolar 
amino acids 


Succinyl CoA is a point of entry for some of the carbon atoms of methio- 
nine, isoleucine, and valine. Propionyl CoA and then methylmalonyl CoA 
are intermediates in the breakdown of these three nonpolar amino acids 
(Figure 23.27). The mechanism for the interconversion of propionyl CoA 
and methylmalonyl CoA was presented in Section 22.3. This pathway from 
propionyl CoA to succinyl CoA is also used in the oxidation of fatty acids 
that have an odd number of carbon atoms (Section 22.3). 


Methionine degradation requires the formation of a key methyl donor, 
S-adenosylmethionine 


Methionine is converted into succinyl CoA in nine steps (Figure 23.28). 
The first step is the adenylation of methionine to form S-adenosylmethionine 
(SAM), a common methyl donor in the cell (Section 24.2). Loss of the 
methyl and adenosyl groups yields homocysteine, which is eventually pro- 
cessed to a-ketobutyrate. This a-ketoacid is oxidatively decarboxylated by 
the a-ketoacid dehydrogenase complex to propionyl CoA, which is processed 
to succinyl CoA, as described in Section 22.3. 


The branched-chain amino acids yield acetyl CoA, acetoacetate, 
or propionyl CoA 


The branched-chain amino acids are degraded by reactions that we have 
already encountered in the citric acid cycle and fatty acid oxidation. Leucine 
is transaminated to the corresponding a-ketoacid, a-ketoisocaproate. This 
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Figure 23.27 Succinyl CoA 
formation. Conversion of methionine, 
isoleucine, and valine into succinyl 
CoA. 


Figure 23.28 Methionine metabolism. 
The pathway for the conversion of methionine 
into succinyl CoA. S-Adenosylmethionine, 
formed along this pathway, is an important 
molecule for transferring methyl groups. 
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a-ketoacid is oxidatively decarboxylated to isovaleryl CoA by the branched- 
chain a-ketoacid dehydrogenase complex. 


+HN CHs fe) CH; fe) CH; 


— JL 
~OOC CH; 


Leucine a-Ketoisocaproate Isovaleryl CoA 


The a-ketoacids of valine and isoleucine, the other two branched-chain 
aliphatic amino acids, also are substrates (as is a-ketobutyrate derived from 
methionine). The oxidative decarboxylation of these a-ketoacids is analo- 
gous to that of pyruvate to acetyl CoA and of a-ketoglutarate to succinyl 
CoA. The branched-chain a-ketoacid dehydrogenase, a multienzyme 
complex, is a homolog of pyruvate dehydrogenase (Section 17.1) and 
a-ketoglutarate dehydrogenase (Section 17.2). Indeed, the E3 components 
of these enzymes, which regenerate the oxidized form of lipoamide, are 
identical. 

The isovaleryl CoA derived from leucine is dehydrogenated to yield 
B-methylcrotonyl CoA. This oxidation is catalyzed by isovaleryl CoA dehy- 
drogenase. The hydrogen acceptor is FAD, as in the analogous reaction 
in fatty acid oxidation that is catalyzed by acyl CoA dehydrogenase. 
B-Methylglutaconyl CoA is then formed by the carboxylation of B-methy]l- 
crotonyl CoA at the expense of the hydrolysis of a molecule of ATP. As 
might be expected, the carboxylation mechanism of B-methylcrotonyl 
CoA carboxylase is similar to that of pyruvate carboxylase and acetyl CoA 
carboxylase. 


co, ADP 
+ + 
FAD FADH, ATP Pi 
0) CHs 0 CH; CH; 
CoA A CoA es SZ, CoA JA coo- 
Se CH — A CH, Sg ea < 
2 
Isovaleryl CoA B-Methylcrotonyl CoA 6-Methylglutaconyl CoA 
B-Methylglutaconyl CoA is then hydrated to form 3-hydroxy-3-methyl- 
glutaryl CoA, which is cleaved into acetyl CoA and acetoacetate. This reac- 
tion has already been discussed in regard to the formation of ketone bodies 
from fatty acids (Section 22.3). 
O 
CoA 2k. 
re) CH; — CH, 
CoA JA coo Acetyl CoA 
Ns ZA co 4 
H2 O 
B-Methylglutaconyl CoA 3-Hydroxy-3-methylglutaryl CoA 

_00- 

HzC Cc 

Hp 


Acetoacetate 


The degradative pathways of valine and isoleucine resemble that of leucine. 
After transamination and oxidative decarboxylation to yield a CoA deriva- 
tive, the subsequent reactions are like those of fatty acid oxidation. 
Isoleucine yields acetyl CoA and propionyl CoA, whereas valine yields 
CO, and propionyl CoA. The degradation of leucine, valine, and isoleucine 


validate a point made earlier (Chapter 15): the number of reactions in 
metabolism is large, but the number of kinds of reactions is relatively small. 
The degradation of leucine, valine, and isoleucine provides a striking illus- 
tration of the underlying simplicity and elegance of metabolism. 


Oxygenases are required for the degradation of aromatic amino acids 


The degradation of the aromatic amino acids yields the common intermedi- 
ates acetoacetate, fumarate, and pyruvate. The degradation pathway is not 
as straightforward as that of the amino acids previously discussed. For the 
aromatic amino acids, molecular oxygen is used to break an aromatic ring. 

The degradation of phenylalanine begins with its hydroxylation to 
tyrosine, a reaction catalyzed by phenylalanine hydroxylase. This enzyme is 
called a monooxygenase (or mixed-function oxygenase) because one atom of O» 
appears in the product and the other in H2O. 


Phenylalanine 
hydroxylase 


+ QO, + tetrahydrobiopterin 
*H3N CcoOo- 


Phenylalanine Tyrosine 


The reductant here is tetrahydrobiopterin, an electron carrier that has 
not been previously discussed and is derived from the cofactor biopterin. 
Because biopterin is synthesized in the body, it is not a vitamin. 
Tetrahydrobiopterin is initially formed by the reduction of dihydrobiop- 
terin by NADPH in a reaction catalyzed by dihydrofolate reductase 
(Figure 23.29). The quinonoid form of dihydrobiopterin is produced in the 
hydroxylation of phenylalanine. It is reduced back to tetrahydrobiopterin 
by NADPH in a reaction catalyzed by dihydropteridine reductase. The sum 
of the reactions catalyzed by phenylalanine hydroxylase and dihydropteri- 
dine reductase is 


Phenylalanine + O, + NADPH + H* —— tyrosine + NADP* + H,O 


Note that these reactions can also be used to synthesize tyrosine from 
phenylalanine. 

The next step in the degradation of phenylalanine and tyrosine is the 
transamination of tyrosine to p-hydroxyphenylpyruvate (Figure 23.30). This 
a-ketoacid then reacts with O2 to form homogentisate. The enzyme catalyzing 


NADPH NADPH 
+ + 
Ht NADPt NADPt Ht 


Dihydrofolate 
reductase 


Dihydropteridine 
reductase 


Dihydrobiopterin Tetrahydrobiopterin Quinonoid dihydrobiopterin 


Figure 23.29 Formation of tetrahydrobiopterin, an important electron carrier. 
Tetrahydrobiopterin can be formed by the reduction of either of two forms of dihydrobiopterin. 
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Figure 23.30 Phenylalanine and tyrosine degradation. The pathway for the conversion of 
phenylalanine into acetoacetate and fumarate. 


this complex reaction, p-hydroxyphenylpyruvate hydroxylase, is called a dioxy- 
genase because both atoms of O2 become incorporated into the product, one on 
the ring and one in the carboxyl group. The aromatic ring of homogentisate 
is then cleaved by O», which yields 4-maleylacetoacetate. This reaction is 
catalyzed by homogentisate oxidase, another dioxygenase. 4-Maleylacetoacetate 
is then isomerized to 4-fumarylacetoacetate by an enzyme that uses glutathi- 
one as a cofactor. Finally, 4-fumarylacetoacetate is hydrolyzed to fumarate 
and acetoacetate. 

Tryptophan degradation requires several oxygenases (Figure 23.31). 
Tryptophan 2,3-dioxygenase cleaves the pyrrole ring, and kynureinine 
3-monooxygenase hydroxylates the remaining benzene ring, a reaction 
similar to the hydroxylation of phenylalanine to form tyrosine. Alanine is 
removed and the 3-hydroxyanthranilate is cleaved by another dioxygenase 
and subsequently processed to acetoacetyl CoA. Nearly all cleavages of 
aromatic rings in biological systems are catalyzed by dioxygenases, a class of 
enzymes discovered by Osamu Hayaishi. The active sites of these enzymes 
contain iron that is not part of heme or an iron-sulfur cluster. 


02 
Dioxygenase 
Tryptophan N-Formylkynurenine Kynurenine 
02 
Monooxygenase 
H,0 
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Figure 23.31 Tryptophan degradation. The pathway for the conversion of tryptophan into 
alanine and acetoacetate. 


23.6 Inborn Errors of Metabolism Can Disrupt 
Amino Acid Degradation 


cl Errors in amino acid metabolism provided some of the first examples 

of biochemical defects linked to pathological conditions. For instance, 
alcaptonuria is an inherited metabolic disorder caused by the absence of 
homogentisate oxidase. In 1902, Archibald Garrod showed that alcaptonu- 
ria is transmitted as a single recessive Mendelian trait. Furthermore, he 
recognized that homogentisate is a normal intermediate in the degradation 
of phenylalanine and tyrosine (see Figure 23.30) and that it accumulates in 
alcaptonuria because its degradation is blocked. He concluded that “the 
splitting of the benzene ring in normal metabolism is the work of a special 
enzyme, that in congenital alcaptonuria this enzyme is wanting.” 
Homogentisate accumulates and is excreted in the urine, which turns dark 
on standing as homogentisate 1s oxidized and polymerized to a melanin-like 
substance. 

Although alcaptonuria is a relatively harmless condition, such is not the 
case with other errors in amino acid metabolism. In maple syrup urine dis- 
ease, the oxidative decarboxylation of a-ketoacids derived from valine, iso- 
leucine, and leucine is blocked because the branched-chain dehydrogenase 
is missing or defective. Hence, the levels of these a-ketoacids and the 
branched-chain amino acids that give rise to them are markedly elevated in 
both blood and urine. The urine of patients has the odor of maple syrup— 
hence the name of the disease (also called branched-chain ketoaciduria). 
Maple syrup urine disease usually leads to mental and physical retardation 
unless the patient is placed on a diet low in valine, isoleucine, and leucine 
early in life. The disease can be readily detected in newborns by screening urine 
samples with 2,4-dinitrophenylhydrazine, which reacts with a-ketoacids to 
form 2,4-dinitrophenylhydrazone derivatives. A definitive diagnosis can be 
made by mass spectrometry. 


NO, 
NOz 
NO, 
NOz 
NH 
HN _NH 
O 2,4-Dinitrophenyl- N 
hydrazine 
COO- > COO- 
H3C CHz H3C CHz 
a-Ketoacid 2,4-Dinitrophenylhydrazone 
derivative 


Phenylketonuria is perhaps the best known of the diseases of amino acid 
metabolism. Phenylketonuria is caused by an absence or deficiency of phenyl- 
alanine hydroxylase or, more rarely, of its tetrahydrobiopterin cofactor. 
Phenylalanine accumulates in all body fluids because it cannot be converted into 
tyrosine. Normally, three-quarters of phenylalanine molecules are converted 
into tyrosine, and the other quarter become incorporated into proteins. 
Because the major outflow pathway is blocked in phenylketonuria, the 
blood level of phenylalanine is typically at least 20-fold as high as in nor- 
mal people. Minor fates of phenylalanine in normal people, such as the 
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formation of phenylpyruvate, become major fates in phenylketonurics. 
Indeed, the initial description of phenylketonuria in 1934 was made by 
observing the reaction of phenylpyruvate in the urine of phenylketonurics 
with FeCls, which turns the urine olive green. 

Almost all untreated phenylketonurics are severely mentally retarded. In 
fact, about 1% of patients in mental institutions have phenylketonuria. The 
brain weight of these people is below normal, myelination of their nerves is 
defective, and their reflexes are hyperactive. The life expectancy of untreat- 
ed phenylketonurics is drastically shortened. Half die by age 20 and three- 
quarters by age 30. The biochemical basis of their mental retardation is an 
enigma. 

Phenylketonurics appear normal at birth but are severely defective by 
age 1 if untreated. The therapy for phenylketonuria is a low-phenylalanine 
diet, supplemented with tyrosine because tyrosine is normally synthesized 
from phenylalanine. The aim is to provide just enough phenylalanine to 
meet the needs for growth and replacement. Proteins that have a low content 
of phenylalanine, such as casein from milk, are hydrolyzed and phenylala- 
nine is removed by adsorption. A low-phenylalanine diet must be started 
very soon after birth to prevent irreversible brain damage. In one study, the 
average IQ of phenylketonurics treated within a few weeks after birth was 
93; a control group treated starting at age 1 had an average IQ of 53. 

Early diagnosis of phenylketonuria is essential and has been accom- 
plished by mass screening programs. The phenylalanine level in the blood 
is the preferred diagnostic criterion because it is more sensitive and reliable 
than the FeCl; test. Prenatal diagnosis of phenylketonuria with DNA 
probes has become feasible because the gene has been cloned and the 
exact locations of many mutations have been discovered in the protein. 
Interestingly, whereas some mutations lower the activity of the enzyme, 
others decrease the enzyme concentration instead. These latter muta- 
tions lead to degradation of the enzyme, at least in part by the ubiquitin— 
proteasome pathway (Section 23.2). 

The incidence of phenylketonuria is about 1 in 20,000 newborns. The 
disease is inherited as an autosomal recessive. Heterozygotes, who make up 
about 1.5% of a typical population, appear normal. Carriers of the phe- 
nylketonuria gene have a reduced level of phenylalanine hydroxylase, as 
indicated by an increased level of phenylalanine in the blood. However, this 
criterion is not absolute, because the blood levels of phenylalanine in carri- 
ers and normal people overlap to some extent. The measurement of the 
kinetics of the disappearance of intravenously administered phenylalanine 
is a more definitive test for the carrier state. It should be noted that a high 
blood level of phenylalanine in a pregnant woman can result in abnormal 
development of the fetus. This is a striking example of maternal—fetal rela- 
tionships at the molecular level. Table 23.4 lists some other diseases of 
amino acid metabolism. 


Table 23.4 Inborn errors of amino acid metabolism 


Disease Enzyme deficiency Symptoms 
Citrullinema Arginosuccinate lyase Lethargy, seizures, reduced 
muscle tension 
Tyrosinemia Various enzymes of tyrosine Weakness, self-mutilation, 
degradation liver damage, mental retardation 
Albinism Tyrosinase Absence of pigmentation 
Homocystinuria Cystathionine B-synthase Scoliosis, muscle weakness, mental 
retardation, thin blond hair 
Hyperlysinemia a-Aminoadipic semialdehyde Seizures, mental retardation, 
dehydrogenase lack of muscle tone, ataxia 


Summary 


23.1 


23.2 


23.3 


23.4 


23.5 


Proteins Are Degraded to Amino Acids 

Dietary protein is digested in the intestine, producing amino acids that are 
transported throughout the body. Cellular proteins are degraded at widely 
variable rates, ranging from minutes to the life of the organism. 


Protein Turnover Is Tightly Regulated 

The turnover of cellular proteins is a regulated process requiring com- 
plex enzyme systems. Proteins to be degraded are conjugated with 
ubiquitin, a small conserved protein, in a reaction driven by ATP 
hydrolysis. The ubiquitin-conjugating system is composed of three 
distinct enzymes. A large, barrel-shaped complex called the protea- 
some digests the ubiquitinated proteins. The proteasome also requires 
ATP hydrolysis to function. The resulting amino acids provide a 
source of precursors for protein, nucleotide bases, and other nitroge- 
nous compounds. 


The First Step in Amino Acid Degradation Is the Removal of Nitrogen 
Surplus amino acids are used as building blocks and as metabolic fuel. 
The first step in their degradation is the removal of their a-amino 
groups by transamination to a-ketoacids. Pyridoxal phosphate is the 
coenzyme in all aminotransferases and in many other enzymes cata- 
lyzing amino acid transformations. The a-amino group funnels into 
a-ketoglutarate to form glutamate, which is then oxidatively deami- 
nated by glutamate dehydrogenase to give NH,” and a-ketoglutarate. 
NAD* or NADP? is the electron acceptor in this reaction. 


Ammonium lon Is Converted into Urea in Most Terrestrial Vertebrates 
The first step in the synthesis of urea is the formation of carbamoyl 
phosphate, which is synthesized from HCO3 , NH3, and two mole- 
cules of ATP by carbamoyl phosphate synthetase. Ornithine is then 
carbamoylated to citrulline by ornithine transcarbamoylase. These 
two reactions take place in mitochondria. Citrulline leaves the mito- 
chondrion and condenses with aspartate to form argininosuccinate, 
which is cleaved into arginine and fumarate. The other nitrogen atom 
of urea comes from aspartate. Urea is formed by the hydrolysis of argi- 
nine, which also regenerates ornithine. 


Carbon Atoms of Degraded Amino Acids Emerge as Major 
Metabolic Intermediates 
The carbon atoms of degraded amino acids are converted into pyru- 
vate, acetyl CoA, acetoacetate, or an intermediate of the citric acid 
cycle. Most amino acids are solely glucogenic, two are solely keto- 
genic, and a few are both ketogenic and glucogenic. Alanine, serine, 
cysteine, glycine, threonine, and tryptophan are degraded to pyru- 
vate. Asparagine and aspartate are converted into oxaloacetate. 
a-Ketoglutarate is the point of entry for glutamate and four amino 
acids (glutamine, histidine, proline, and arginine) that can be con- 
verted into glutamate. Succinyl CoA is the point of entry for some of 
the carbon atoms of three amino acids (methionine, isoleucine, and 
valine) that are degraded through the intermediate methylmalonyl 
CoA. Leucine is degraded to acetoacetate and acetyl CoA. The break- 
down of valine and isoleucine is like that of leucine. Their a-ketoacid 
derivatives are oxidatively decarboxylated by the branched-chain 
a-ketoacid dehydrogenase. 

The rings of aromatic amino acids are degraded by oxygenases. 
Phenylalanine hydroxylase, a monooxygenase, uses tetrahydrobiop- 
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Key Terms 


ubiquitin (p. 675) 
degron (p. 676) 
proteasome (p. 677) 


aminotransferase (transaminase) (p. 680) 


23.6 


pyridoxal phosphate (PLP) (p. 681) 
pyridoxamine phosphate (PMP) (p. 682) 
glucose—alanine cycle (p. 685) 


urea cycle (p. 685) 


terin as the reductant. One of the oxygen atoms of O» emerges in 
tyrosine and the other in water. Subsequent steps in the degradation 
of these aromatic amino acids are catalyzed by dioxygenases, which 
catalyze the insertion of both atoms of O, into organic products. Four 
of the carbon atoms of phenylalanine and tyrosine are converted into 
fumarate, and four emerge in acetoacetate. 


Inborn Errors of Metabolism Can Disrupt Amino Acid Degradation 
Errors in amino acid metabolism were sources of some of the first 
insights into the correlation between pathology and biochemistry. 
Phenylketonuria is the best known of the many hereditary errors of 
amino acid metabolism. This condition results from the accumula- 
tion of high levels of phenylalanine in the body fluids. By unknown 
mechanisms, this accumulation leads to mental retardation unless the 
afflicted are placed on low-phenylalanine diets immediately after 
birth. 


N-acetylglutamate (p. 686) 
ketogenic amino acid (p. 690) 
glucogenic amino acid (p. 690) 
biopterin (p. 695) 


glutamate dehydrogenase (p. 680) 


Problems 


1. Getting exposure. Proteins are denatured by acid in the 
stomach. This denaturation makes them better substrates 
for proteolysis. Explain why this is the case. 


2. Targeting for destruction. What are the steps required to 
attach ubiquitin to a target protein? 


3. Not the dating service. Match the description on the right 
with the term on the left. 


a. Pepsin 1. Requires an adenylate 
b. N-terminal rule intermediate 
c. Ubiquitin 2. Marks a protein for 
d. PEST sequence destruction 
e. Threonine nucleophiles 3. 19S regulatory subunit 
f. ATP-dependent 4. Determines half-life of a 
protein unfolding protein 
g. Proteasome 5. 205 core 
h. Ubiquitin-activating 6. Substrate for ligase 
enzyme 7. Stomach proteolytic 
i. Ubiquitin-conjugating enzyme 
enzyme 8. Recognizes protein 
j. Ubiquitin-ligase to be degraded 
9. Protein degrading 
machine 


10. Pro-Glu-Ser-Thr 


carbamoyl phosphate synthetase (p. 686) 


phenylketonuria (p. 697) 


4. Wasted energy? Protein hydrolysis is an exergonic pro- 
cess, yet the 26S proteasome is dependent on ATP hydroly- 
sis for activity. 


(a) Although the exact function of the ATPase activity is 
not known, suggest some likely functions. 


(b) Small peptides can be hydrolyzed without the expendi- 
ture of ATP. How does this information concur with your 
answer to part a? 


5. Keto counterparts. Name the a-ketoacid that is formed by 
the transamination of each of the following amino acids: 

(a) Alanine (d) Leucine 

(b) Aspartate (e) Phenylalanine 


(c) Glutamate (f) Tyrosine 


6. A versatile building block. (a) Write a balanced equation 
for the conversion of aspartate into glucose through the 
intermediate oxaloacetate. Which coenzymes participate in 
this transformation? (b) Write a balanced equation for the 
conversion of aspartate into oxaloacetate through the inter- 
mediate fumarate. 


7. The benefits of specialization. The archaeal proteasome 
contains 14 identical active B subunits, whereas the eukary- 
otic proteasome has 7 distinct 8 subunits. What are the 


potential benefits of having several distinct active sub- 
units? 


8. Propose a structure. The 19S subunit of the proteasome 
contains six subunits that are members of the AAA ATPase 
family. Other members of this large family are associated 
into homohexamers with sixfold symmetry. Propose a struc- 
ture for the AAA ATPases within the 195 proteasome. How 
might you test and refine your prediction? 


9. Effective electron sinks. Pyridoxal phosphate stabilizes 

carbanionic intermediates by serving as an electron sink. 

Which other prosthetic group catalyzes reactions in this 
? 

way: 


10. Cooperation. How do aminotransferases and glutamate 
dehydrogenase cooperate in the metabolism of the amino 
group of amino acids? 


11. Taking away the nitrogen. What amino acids yield citric 
acid cycle components and glycolysis intermediates when 
deaminated? 


12. One reaction only. What amino acids can be deaminated 
directly? 


13. Useful products. What are the common features of the 
breakdown products of the carbon skeletons of amino 
acids? 

14. Helping hand. Propose a role for the positively charged 
guanidinium nitrogen atom in the cleavage of argininosuc- 
cinate into arginine and fumarate. 


15. Nitrogen sources. What are the immediate biochemical 
sources for the two nitrogen atoms in urea? 


16. Counterparts. Match the biochemical on the right with 
the property on the left. 


a. Formed from NH,4* 1. Aspartate 

b. Hydrolyzed to yield urea 2. Urea 

c. Asecond source of nitrogen 3. Ornithine 

d. Reacts with aspartate 4. Carbamoyl phosphate 
e. Cleavage yields fumarate 5. Arginine 

f. Accepts the first nitrogen 6. Citrulline 

g. Final product 7. Arginosuccinate 


17. Line up. Identify structures A—D, and place them in the 
order that they appear in the urea cycle. 
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Problems 


“OOC 


9) (D) 


18. Completing the cycle. Four high-transfer-potential 
phosphoryl groups are consumed in the synthesis of urea 
according to the stoichiometry given on page 687. In this 
reaction, aspartate is converted into fumarate. Suppose that 
fumarate is converted into oxaloacetate. What is the 
resulting stoichiometry of urea synthesis? How many 
high-transfer-potential phosphoryl groups are spent? 


19. A good bet. A friend bets you a bazillion dollars that you 
can’t prove that the urea cycle is linked to the citric acid 
cycle and other metabolic pathways. Can you collect? 


20. Inhibitor design. Compound A has been synthesized as 
a potential inhibitor of a urea-cycle enzyme. Which enzyme 
do you think compound A might inhibit? 


O- 0° 5:7" 
*HsN \/ 
a a No 
H H> 
~OOC 
Compound A 


21. Ammonia toxicity. Glutamate is an important neu- 
rotransmitter whose levels must be carefully regulated in 
the brain. Explain how a high concentration of ammonia 
might disrupt this regulation. How might a high concentra- 
tion of ammonia alter the citric acid cycle? 


22. A precise diagnosis. The urine of an infant gives a posi- 
tive reaction with 2,4-dinitrophenylhydrazine. Mass spec- 
trometry shows abnormally high blood levels of pyruvate, 
a-ketoglutarate, and the a-ketoacids of valine, isoleucine, 
and leucine. Identify a likely molecular defect and propose 
a definitive test of your diagnosis. 


23. Therapeutic design. How would you treat an infant who 
is deficient in argininosuccinate synthetase? Which mole- 
cules would carry nitrogen out of the body? 


24. Damaged liver. As we will see later (Chapter 27), liver 
damage (cirrhosis) often results in ammonia poisoning. 
Explain why this is the case. 


702 
CHAPTER 23 Protein Turnover and Amino Acid Catabolism 


25. Argininosuccinic aciduria, Argininosuccinic aciduria is 
a condition that results when the urea-cycle enzyme argini- 
nosuccinase is deficient. Argininosuccinate is present in the 
blood and urine. Suggest how this condition might be 
treated while still removing nitrogen from the body. 


26. Sweet hazard. Why should phenylketonurics avoid 
using aspartame, an artificial sweetener? (Hint: Aspartame 
is L-aspartyl-L-phenylalanine methy] ester.) 


27. Déja vu. N-Acetylglutamate is required as a cofactor 
in the synthesis of carbamoyl phosphate. How might 
N-acetylglutamate be synthesized from glutamate? 


28. Negative nitrogen balance. A deficiency of even one 
amino acid results in a negative nitrogen balance. In this 
state, more protein is degraded than is synthesized, and so 
more nitrogen is excreted than is ingested. Why would pro- 
tein be degraded if one amino acid were missing? 


29. Precursors. Differentiate between ketogenic amino acids 
and glucogenic amino acids. 


30. Closely related. Pyruvate dehydrogenase complex and 
a-ketoglutarate dehydrogenase complex are huge enzymes 
consisting of three discrete enzymatic activities. Which 
amino acids require a related enzyme complex, and what is 
the name of the enzyme? 


31. Supply lines. The carbon skeletons of the 20 common 
amino acids can be degraded into a limited number of end 
products. What are the end products and in what metabolic 
pathway are they commonly found? 


Mechanism Problems 


32. Serine dehydratase. Write out a complete mechanism 
for the conversion of serine into aminoacrylate catalyzed by 
serine dehydratase. 


33. Serine racemase. The nervous system contains a sub- 
stantial amount of D-serine, which is generated from L-serine 
by serine racemase, a PLP-dependent enzyme. Propose a 
mechanism for this reaction. What is the equilibrium con- 
stant for the reaction L-serine = D-serine? 


Chapter Integration Problems 


34. Double duty. Degradation signals are commonly located 
in protein regions that also facilitate protein-protein inter- 
actions. Explain why this coexistence of two functions in 
the same domain might be useful. 


35. Fuel choice. Within a few days after a fast begins, nitro- 
gen excretion accelerates to a higher-than-normal level. 
After a few weeks, the rate of nitrogen excretion falls to a 
lower level and continues at this low rate. However, after 
the fat stores have been depleted, nitrogen excretion rises to 


a high level. 


(a) What events trigger the initial surge of nitrogen 
excretion? 


(b) Why does nitrogen excretion fall after several weeks of 
fasting? 


(c) Explain the increase in nitrogen excretion when the 
lipid stores have been depleted. 


36. Isoleucine degradation. Isoleucine is degraded to 
acetyl CoA and succinyl CoA. Suggest a plausible reaction 
sequence, based on reactions discussed in the text, for this 
degradation pathway. 


37. Many roles. Pyridoxal phosphate is an important coen- 
zyme in transamination reactions. We have seen this coen- 
zyme before, in glycogen metabolism. Which enzyme in 
glycogen metabolism requires pyridoxal phosphate and 
what role does the coenzyme play in this enzyme? 


38. Enough cycles to have a race. The glucose—alanine cycle 
is reminiscent of the Cori cycle, but the glucose—alanine 
cycle can be said to be more energy efficient. Explain why 
this is so. 


Data Interpretation Problem 


39. Another helping hand. In eukaryotes, the 205 protea- 
some component in conjunction with the 195 component 
degrades ubiquitinated proteins with the hydrolysis of a 
molecule of ATP. Archaea lack ubiquitin and the 265 pro- 
teasome but do contain a 20S proteasome. Some archaea 
also contain an ATPase that is homologous to the ATPases 
of the eukaryotic 195 component. This archaeal ATPase 
activity was isolated as a 650-kd complex (called PAN) 
from the archaeon Thermoplasma, and experiments were 
performed to determine if PAN could enhance the activity 
of the 20S proteasome from Thermoplasma as well as other 
205 proteasomes. 


Protein degradation was measured as a function of time 
and in the presence of various combinations of components. 
Graph A shows the results. 
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(a) What is the effect of PAN on archaeal proteasome 
activity in the absence of nucleotides? 


(b) What is the nucleotide requirement for protein digestion? 


(c) What evidence suggests that ATP hydrolysis, and not 
just the presence of ATP, is required for digestion? 


A similar experiment was performed with a small peptide as 
a substrate for the proteasome instead of a protein. The 
results obtained are shown in graph B. 
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Problems 


(d) How do the requirements for peptide digestion differ 
from those of protein digestion? 


(e) Suggest some reasons for the difference. 


The ability of PAN from the archaeon Thermoplasma to 
support protein degradation by the 20S proteasomes from 
the archaeon Methanosarcina and rabbit muscle was then 
examined. 


Percentage of digestion of protein substrate 
(20S proteasome source) 


Additions Thermoplasma Methanosarcina Rabbit muscle 
None 11 10 10 
PAN 8 8 8 
PAN + ATP 100 40 30 
PAN + ADP 12 9 10 


[Data from P. Zwickl, D. Ng, K. M. Woo, H.-P. Klenk, and A. L. Goldberg. An 
archaebacterial ATPase, homologous to ATPase in the eukaryotic 265 protea- 
some, activates protein breakdown by 205 proteasomes. J. Biol. Chem. 
274(1999): 26008-26014. ] 


(f£) Can the Thermoplasma PAN augment protein digestion 
by the proteasomes from other organisms? 


(g) What is the significance of the stimulation of rabbit 
muscle proteasome by Thermoplasma PAN? 
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The Biosynthesis of Amino Acids 


he assembly of biological molecules, including proteins and nucleic 

acids, requires the generation of appropriate starting materials. We have 
already considered the assembly of carbohydrates in discussions of the 
Calvin cycle and the pentose phosphate pathway (Chapter 20). The present 
chapter and the next two examine the assembly of the other important 
building blocks—namely, amino acids, nucleotides, and lipids. 

The pathways for the biosynthesis of these molecules are extremely 
ancient, going back to the last common ancestor of all living things. Indeed, 
these pathways probably predate many of the pathways of energy transduc- 
tion discussed in Part II and may have provided key selective advantages in 
early evolution. Many of the intermediates in energy-transduction pathways 
play a role in biosynthesis as well. These common intermediates allow 
efficient interplay between energy-transduction (catabolic) and biosyn- 
thetic (anabolic) pathways. Thus, cells are able to balance the degradation of 
compounds for energy mobilization and the synthesis of starting materials 
for macromolecular construction. 

We begin our consideration of biosynthesis with amino acids—the 
building blocks of proteins and the nitrogen source for many other important 
molecules, including nucleotides, neurotransmitters, and prosthetic groups 
such as porphyrins. Amino acid biosynthesis is intimately connected with 
nutrition because many higher organisms, including human beings, have 
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Nitrogen is a key component of amino acids. The 
atmosphere is rich in nitrogen gas (N2), a very unreactive 
molecule. Certain organisms, such as bacteria that live in the 
root nodules of yellow clover, can convert nitrogen gas into 
ammonia (NHz), which can then be used to synthesize, first, 
glutamate and then other amino acids. [(Left) Runk/ 
Schoenburg/Grant Heilman Photography.] 
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Amino Acids Are Precursors of 
Many Biomolecules 
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Anabolism 


Biosynthetic processes. 


Catabolism 


Degradative processes. 
Derived from the Greek ana, “up"; kata, 
“down: ballein, “to throw" 
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lost the ability to synthesize some amino acids and must therefore obtain 
adequate quantities of these essential amino acids in their diets. Further- 
more, because some amino acid biosynthetic enzymes are absent in mammals 
but present in plants and microorganisms, they are useful targets for herbicides 
and antibiotics. 


Amino acid synthesis requires solutions to three key 
biochemical problems 


Nitrogen is an essential component of amino acids. Earth has an abundant 
supply of nitrogen, but it is primarily in the form of atmospheric nitrogen 
gas, a remarkably inert molecule. Thus, a fundamental problem for 
biological systems is to obtain nitrogen in a more usable form. This problem 
is solved by certain microorganisms capable of reducing the inert N=N 
molecule of nitrogen gas to two molecules of ammonia in one of the most 
amazing reactions in biochemistry. Nitrogen in the form of ammonia is the 
source of nitrogen for all the amino acids. The carbon backbones come from 
the glycolytic pathway, the pentose phosphate pathway, or the citric acid 
cycle. 

In amino acid production, we encounter an important problem in 
biosynthesis—namely, stereochemical control. Because all amino acids 
except glycine are chiral, biosynthetic pathways must generate the correct 
isomer with high fidelity. In each of the 19 pathways for the generation of 
chiral amino acids, the stereochemistry at the a-carbon atom is established 
by a transamination reaction that includes pyridoxal phosphate (PLP). 
Almost all the transaminases that catalyze these reactions descend from a 
common ancestor, illustrating once again that effective solutions to bio- 
chemical problems are retained throughout evolution. 

Biosynthetic pathways are often highly regulated such that building 
blocks are synthesized only when supplies are low. Very often, a high con- 
centration of the final product of a pathway inhibits the activity of enzymes 
that function early in the pathway. Often present are allosteric enzymes 
capable of sensing and responding to concentrations of regulatory species. 
These enzymes are similar in functional properties to aspartate transcar- 
bamylase and its regulators (Section 10.1). Feedback and allosteric mecha- 
nisms ensure that all 20 amino acids are maintained in sufficient amounts 
for protein synthesis and other processes. 


24.1 Nitrogen Fixation: Microorganisms Use ATP and 
a Powerful Reductant to Reduce Atmospheric 
Nitrogen to Ammonia 


The nitrogen in amino acids, purines, pyrimidines, and other biomolecules 
ultimately comes from atmospheric nitrogen, N». The biosynthetic pro- 
cess starts with the reduction of Nz to NH3 (ammonia), a process called 
nitrogen fixation. The extremely strong N=N bond, which has a bond 
energy of 940 kJ mol”! (225 kcal mol”), is highly resistant to chemical 
attack. Indeed, Antoine Lavoisier named nitrogen gas “azote,” from Greek 
words meaning “without life,” because it is so unreactive. Nevertheless, the 
conversion of nitrogen and hydrogen to form ammonia is thermodynami- 
cally favorable; the reaction is difficult kinetically because intermediates 
along the reaction pathway are unstable. 

Although higher organisms are unable to fix nitrogen, this conversion is 
carried out by some bacteria and archaea. Symbiotic Rhizobium bacteria 
invade the roots of leguminous plants and form root nodules in which they 


fix nitrogen, supplying both the bacteria and the plants. The importance of 
nitrogen fixation by diazotrophic (nitrogen-fixing) microorganisms to the 
metabolism of all higher eukaryotes cannot be overstated: the amount of N> 
fixed by these species has been estimated to be 10'! kilograms per year, 
about 60% of Earth’s newly fixed nitrogen. Lightning and ultraviolet radia- 
tion fix another 15%; the other 25% is fixed by industrial processes. The 
industrial process for nitrogen fixation devised by Fritz Haber in 1910 is 
still being used in fertilizer factories. 


No + 3 Ho — 2NH3 


The fixation of No is typically carried out by mixing with —pecrons from 
Hp gas over an iron catalyst at about 500°C and a pressure reduced 
of 300 atmospheres. ferredoxin 
To meet the kinetic challenge, the biological process of 
nitrogen fixation requires a complex enzyme with multiple 
redox centers. The nitrogenase complex, which carries out 
this fundamental transformation, consists of two proteins: 
a reductase, which provides electrons with high reducing 
power, and nitrogenase, which uses these electrons to 
reduce N» to NH3. The transfer of electrons from the 


Reductase 
(Fe protein) 
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ADP 


ATP 


Nitrogenase 
(MoFe protein) 


reductase to the nitrogenase component is coupled to the Figure 24.1 Nitrogen fixation. Electrons flow from ferredoxin to 
hydrolysis of ATP by (ies Seductise (Figure 24.1). The the reductase (iron protein, or Fe protein) to nitrogenase 


nitrogenase complex 1s exquisitely sensitive to inactivation 


(molybdenum-iron protein, or MoFe protein) to reduce nitrogen to 
ammonia. ATP hydrolysis within the reductase drives conformational 


by Oz. Leguminous plants maintain a very low concentra- changes necessary for the efficient transfer of electrons. 


tion of free O» in their root nodules by binding O» to 
leghemoglobin, a homolog of hemoglobin. 
In principle, the reduction of Nz to NH3 is a six-electron process. 


N.+6e +6H™* > 2NH; 


However, the biological reaction always generates at least 1 mol of H2 in 
addition to 2 mol of NH; for each mol of N=N. Hence, an input of two 
additional electrons is required. 


N, + 8e +8H* — 2NH;3+ H, 


In most nitrogen-fixing microorganisms, the eight high-potential electrons 
come from reduced ferredoxin, generated by photosynthesis or oxidative pro- 
cesses. Two molecules of ATP are hydrolyzed for each electron transferred. 
Thus, at least 16 molecules of ATP are hydrolyzed for each molecule of No 
reduced. 


No. + 8e +8H™ + 16ATP + 16H,O —> 
2NH; + H, + 16 ADP + 16P; 


Again, ATP hydrolysis is not required to make nitrogen reduction favorable 
thermodynamically. Rather, it is essential to reduce the heights of activation 
barriers along the reaction pathway, thus making the reaction kinetically 
feasible. 


The iron—molybdenum cofactor of nitrogenase binds and reduces 
atmospheric nitrogen 


Both the reductase and the nitrogenase components of the complex are iron— 
sulfur proteins, in which iron is bonded to the sulfur atom of a cysteine resi- 
due and to inorganic sulfide. Recall that iron-sulfur clusters act as electron 
carriers (Section 18.3). The reductase (also called the iron protein or the Fe 
protein) is a dimer of identical 30-kd subunits bridged by a 4Fe-45 cluster 
(Figure 24.2). 


h Figure 24.2 Fe Protein. This protein is 
a dimer composed of two polypeptide chains 
linked by a 4Fe-4S cluster. Notice that each 
monomer is a member of the P-loop NTPase 
family and contains an ATP-binding site. 
[Drawn from 1N2C.pdb.] 
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The role of the reductase is to transfer electrons from a suitable donor, 

such as reduced ferredoxin, to the nitrogenase component. The 4Fe-48 
cluster carries the electrons, one at a time, to nitrogenase. The binding and 
hydrolysis of ATP triggers a conformational change that moves the 
reductase closer to the nitrogenase component, whence it is able to transfer 
its electron to the center of nitrogen reduction. The structure of the ATP- 
binding region reveals it to be a member of the P-loop NTPase family 
(Section 9.4) that is clearly related to the nucleotide-binding regions found 
in G proteins and related proteins. Thus, we see another example of how 
this domain has been recruited in evolution because of its ability to couple 
nucleoside triphosphate hydrolysis to conformational changes. 

The nitrogenase component is an 22 tetramer (240 kd), in which the a 
and B subunits are homologous to each other and structurally quite similar 
(Figure 24.3). Because molybdenum is present in this cluster, the nitrogenase 
component is also called the molybdenum—iron protein (MoFe protein). The 
FeMo cofactor consists of two M-3Fe-35 clusters, in which molybdenum 
occupies the M site in one cluster and iron occupies it in the other. The two 
clusters are joined by three sulfide ions and a central atom, the identity of 
which has not yet been conclusively established. The FeMo cofactor is also 
coordinated to a homocitrate moiety and to the a subunit through one 
histidine residue and one cysteinate residue. This cofactor is distinct from 
apparently all other molybdenum-containing enzymes. 

Electrons from the reductase enter at the P clusters, which are located at 
the a—B interface. The role of the P clusters is to store electrons until they 
can be used productively to reduce nitrogen at the FeMo cofactor. The 
FeMo cofactor is the site of nitrogen fixation. One face of the FeMo cofactor 
is likely to be the site of nitrogen reduction. The electron-transfer reactions 
from the P cluster take place in concert with the binding of hydrogen ions to 


P cluster FeMo cofactor 


h& Figure 24.3 MoFe protein. This protein is a heterotetramer composed of two a subunits 
(red) and two B subunits (blue). Notice that the protein contains two copies each of two types of 
clusters: P clusters and FeMo cofactors. Each P cluster contains eight iron atoms (green) and seven 
sulfides linked to the protein by six cysteinate residues. Each FeMo cofactor contains one 
molybdenum atom, seven iron atoms, nine sulfides, a central atom, and a homocitrate, and is 
linked to the protein by one cysteinate residue and one histidine residue. [Drawn from 1M1N.pdb_] 


nitrogen as it is reduced. Further studies are under way to elucidate the 709 
mechanism of this remarkable reaction. 24.1 Nitrogen Fixation 


Ammonium ion is assimilated into an amino acid 
through glutamate and glutamine 


The next step in the assimilation of nitrogen into biomolecules is the entry 
of NH,” into amino acids. The amino acids glutamate and glutamine play 
pivotal roles in this regard, acting as nitrogen donors for most amino acids. 
The a-amino group of most amino acids comes from the a-amino group of 
glutamate by transamination (Section 23.3). Glutamine, the other major 
nitrogen donor, contributes its side-chain nitrogen atom in the biosynthesis 
of a wide range of important compounds, including the amino acids trypto- 
phan and histidine. 

Glutamate is synthesized from NH,” and a-ketoglutarate, a citric acid 
cycle intermediate, by the action of glutamate dehydrogenase. We have 
already encountered this enzyme in the degradation of amino acids 
(Section 23.3). Recall that NAD? is the oxidant in catabolism, whereas 
NADPH is the reductant in biosyntheses. Glutamate dehydrogenase is 
unusual in that it does not discriminate between NADH and NADPH, at 
least in some species. 


NH,* + a-ketoglutarate + NADPH + H* == 
glutamate + NADP* + H,O 


The reaction proceeds in two steps. First, a Schiff base forms between 
ammonia and a-ketoglutarate. The formation of a Schiff base between an 
amine and a carbonyl compound is a key reaction that takes place at many 
stages of amino acid biosynthesis and degradation. 


Rs Hy 4-Rs 
O N~ - Nye 
- ¢ 
yo. + Rz—NH2 wv + H,O0 —— ran 
Rj Ro Rj R Ht Rj R 
Carbonyl Amino Schiff base Protonated 
compound donor Schiff base 


Schiff bases are easily protonated. In the second step, the protonated Schiff 
base is reduced by the transfer of a hydride ion from NADPH to form 


glutamate. 
Ht 
es + 
H.. +H NAD(P)H NAD(P) 

HO i *HsN 
gl ee a Pa ae She ee 
~OOC CcoO- ~OOC COO- “OOC Ccoo- 

a-Ketoglutarate Glutamate 


This reaction is crucial because it establishes the stereochemistry of the 
a-carbon atom (S absolute configuration) in glutamate. The enzyme binds 
the a-ketoglutarate substrate in such a way that hydride transferred from 
NAD(P)H is added to form the L isomer of glutamate (Figure 24.4). As 
we shall see, this stereochemistry is established for other amino acids by 
transamination reactions that rely on pyridoxal phosphate. 

A second ammonium ion is incorporated into glutamate to form 
glutamine by the action of glutamine synthetase. This amidation is driven by 
the hydrolysis of ATP. ATP participates directly in the reaction by 
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Figure 24.4 Establishment of chirality. In 
the active site of glutamate dehydrogenase, 
hydride transfer (green) from NAD(P)H to a 
specific face of the achiral protonated Schiff 
base of a-ketoglutarate establishes the 

L configuration of glutamate. 


Glutamate 


Protonated 
o-ketoglutarate Schiff base L-Glutamate 
+ 
—— 
NAD(P)H NAD(P)* 


phosphorylating the side chain of glutamate to form an acyl-phosphate 
intermediate, which then reacts with ammonia to form glutamine. 


*HsN H *HsN rf 


0 ‘\ } NH 


YK 


f°) 6 0 O 


Acyl-phosphate intermediate Glutamine 


A high-affinity ammonia-binding site is formed in the enzyme only 
after the formation of the acyl-phosphate intermediate. A specific site for 
ammonia binding is required to prevent attack by water from hydrolyz- 
ing the intermediate and wasting a molecule of ATP. The regulation of 
glutamine synthetase plays a critical role in controlling nitrogen metabo- 
lism (Section 24.3). 

Glutamate dehydrogenase and glutamine synthetase are present in 
all organisms. Most prokaryotes also contain an evolutionarily unrelated 
enzyme, glutamate synthase, which catalyzes the reductive amination of 
a-ketoglutarate to glutamate. Glutamine is the nitrogen donor. 


a-Ketoglutarate + glutamine + NADPH + H* == 
2 glutamate + NADP* 


The side-chain amide of glutamine is hydrolyzed to generate ammonia 
within the enzyme, a recurring theme throughout nitrogen metabolism. 
When NH," is limiting, most of the glutamate is made by the sequential 
action of glutamine synthetase and glutamate synthase. The sum of these reac- 
tions 1s 


NH,* + a-ketoglutarate + NADPH + ATP —> 
glutamate + NADP* + ADP + P; 


Note that this stoichiometry differs from that of the glutamate dehydroge- 
nase reaction in that ATP is hydrolyzed. Why do prokaryotes sometimes 
use this more expensive pathway? The answer is that the value of Kyg of 
glutamate dehydrogenase for NH,” is high (~1 mM), and so this enzyme 
is not saturated when NH," is limiting. In contrast, glutamine synthetase 
has very high affinity for NH,*. Thus, ATP hydrolysis is required to 
capture ammonia when it is scarce. 


24.2 Amino Acids Are Made from Intermediates of the 
Citric Acid Cycle and Other Major Pathways 


Thus far, we have considered the conversion of Nj into NH," and the assim- 
ilation of NH,” into glutamate and glutamine. We turn now to the bio- 
synthesis of the other amino acids, the majority of which obtain their nitrogen 
from glutamate or glutamine. The pathways for the biosynthesis of amino 
acids are diverse. However, they have an important common feature: their 
carbon skeletons come from intermediates of glycolysis, the pentose phosphate 
pathway, or the citric acid cycle. On the basis of these starting materials, amino 
acids can be grouped into six biosynthetic families (Figure 24.5). 
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Figure 24.5 Biosynthetic families of amino acids in bacteria and plants. Major metabolic 
precursors are shaded blue. Amino acids that give rise to other amino acids are shaded yellow. 
Essential amino acids are in boldface type. 


Human beings can synthesize some amino acids but 
must obtain others from the diet 


Most microorganisms, such as EF. coli, can synthesize the entire basic set of 
20 amino acids, whereas human beings cannot make 9 of them. The amino 
acids that must be supplied in the diet are called essential amino acids, 
whereas the others are termed nonessential amino acids (‘Table 24.1). These 
designations refer to the needs of an organism under a particular set of 
conditions. For example, enough arginine is synthesized by the urea cycle 
to meet the needs of an adult but perhaps not those of a growing child. A 
deficiency of even one amino acid results in a negative nitrogen balance. In 
this state, more protein is degraded than is synthesized, and so more nitro- 
gen is excreted than is ingested. 

The nonessential amino acids are synthesized by quite simple reactions, 
whereas the pathways for the formation of the essential amino acids are 
quite complex. For example, the nonessential amino acids alanine and 
aspartate are synthesized in a single step from pyruvate and oxaloacetate, 
respectively. In contrast, the pathways for the essential amino acids require 
from 5 to 16 steps (Figure 24.6). The sole exception to this pattern is 


Arginine 
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Table 24.1 Basic set of 20 amino acids 


Nonessential Essential 
Alanine Histidine 
Arginine Isoleucine 
Asparagine Leucine 
Aspartate Lysine 
Cysteine Methionine 
Glutamate Phenylalanine 
Glutamine Threonine 
Glycine Tryptophan 
Proline Valine 
Serine 

Tyrosine 


Bf Nonessential 
i Essential 


Number of amino acids 


1234567 8 9 10111213141516 
Number of steps in pathway 


Figure 24.6 Essential and nonessential 
amino acids. Some amino acids are 
nonessential to human beings because they 
can be biosynthesized in a small number of 
steps. Those amino acids requiring a large 
number of steps for their synthesis are 
essential in the diet because some of the 
enzymes for these steps have been lost in the 
course of evolution. 


Figure 24.7 Amino acid biosynthesis by 
transamination. (1) Within a transaminase, 
the internal aldimine is converted into 
pyridoxamine phosphate (PMP) by reaction 
with glutamate in a multistep process not 
shown. (2) PMP then reacts with an 
a-ketoacid to generate a ketimine. (3) This 
intermediate is converted into a quinonoid 
intermediate (4), which in turn yields an 
external aldimine. (5) The aldimine is cleaved 
to release the newly formed amino acid to 
complete the cycle. 
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arginine, inasmuch as the synthesis of this nonessential amino acid de novo 
requires 10 steps. Typically, though, it is made in only 3 steps from ornithine 
as part of the urea cycle. Tyrosine, classified as a nonessential amino acid 
because it can be synthesized in 1 step from phenylalanine, requires 10 steps 
to be synthesized from scratch and is essential if phenylalanine is not abun- 
dant. We begin with the biosynthesis of nonessential amino acids. 


Aspartate, alanine, and glutamate are formed by the 
addition of an amino group to an alpha-ketoacid 


Three a-ketoacids—a-ketoglutarate, oxaloacetate, and pyruvate—can be 
converted into amino acids in one step through the addition of an amino 
group. We have seen that a-ketoglutarate can be converted into glutamate 
by reductive amination (p. 709). The amino group from glutamate can be 
transferred to other a-ketoacids by transamination reactions. Thus, aspar- 
tate and alanine can be made from the addition of an amino group to oxalo- 
acetate and pyruvate, respectively. 


Oxaloacetate + glutamate == aspartate + a-ketoglutarate 
Pyruvate + glutamate = — alanine + a-ketoglutarate 


These reactions are carried out by pyridoxal phosphate-dependent transami- 
nases. Transamination reactions are required for the synthesis of most 
amino acids. 

In Section 23.3, we considered the mechanism of transaminases as 
applied to the metabolism of amino acids. Let us review the transaminase 
mechanism as it operates in the biosynthesis of amino acids (see Figure 23.11). 
The reaction pathway begins with pyridoxal phosphate in a Schiff-base link- 
age with lysine at the transaminase active site, forming an internal aldimine 
(Figure 24.7). An amino group is transferred from glutamate to form pyri- 
doxamine phosphate (PMP), the actual amino donor, in a multistep process. 
PMP then reacts with an incoming a-ketoacid to form a ketimine. Proton 
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phosphate (PMP) 
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intermediate 
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loss forms a quinonoid intermediate that then accepts a proton at a different 
site to form an external aldimine. The newly formed amino acid is released 
with the concomitant formation of the internal aldimine. 


A common step determines the chirality of all amino acids 


Aspartate aminotransferase is the prototype of a large 

family of PLP-dependent enzymes. Comparisons of 
amino acid sequences as well as several three-dimensional 
structures reveal that almost all transaminases having roles in 
amino acid biosynthesis are related to aspartate aminotrans- 
ferase by divergent evolution. An examination of the aligned 
amino acid sequences reveals that two residues are completely 
conserved. These residues are the lysine residue that forms the 
Schiff base with the PLP cofactor (lysine 258 in aspartate ami- 
notransferase) and an arginine residue that interacts with the 
a-carboxylate group of the ketoacid (see Figure 23.12). 

An essential step in the transamination reaction is the protonation of 
the quinonoid intermediate to form the external aldimine. The chirality of 
the amino acid formed is determined by the direction from which this proton is 
added to the quinonoid form (Figure 24.8). The interaction between the 
conserved arginine residue and the a-carboxylate group helps orient the 
substrate so that the lysine residue transfers a proton to the bottom face of 
the quinonoid intermediate, generating an aldimine with an L configuration 
at the C, center. 


The formation of asparagine from aspartate requires an 

adenylated intermediate 

The formation of asparagine from aspartate is chemically analogous to the 
formation of glutamine from glutamate. Both transformations are amida- 
tion reactions and both are driven by the hydrolysis of ATP. The actual 
reactions are different, however. In bacteria, the reaction for the asparagine 
synthesis is 


Asparate + NH,* + ATP —— asparagine + AMP + PP; + H* 
Thus, the products of ATP hydrolysis are AMP and PP; rather than ADP 
and P;. Aspartate is activated by adenylation rather than by phosphorylation. 


*H3N (@) Oo Oo 


pa we 
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NH 
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We have encountered this mode of activation in fatty acid degradation and 
will see it again in lipid and protein synthesis. 

In mammals, the nitrogen donor for asparagine is glutamine rather than 
ammonia as in bacteria. Ammonia is generated by hydrolysis of the side 
chain of glutamine and directly transferred to activated aspartate, bound in 
the active site. An advantage is that the cell is not directly exposed to NH,", 
which is toxic at high levels to human beings and other mammals. The use 
of glutamine hydrolysis as a mechanism for generating ammonia for use within 
the same enzyme is a motif common throughout biosynthetic pathways. 
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Figure 24.8 Stereochemistry of proton 
addition. In a transaminase active site, the 
addition of a proton from the lysine residue to 
the bottom face of the quinonoid 
intermediate determines the L configuration of 
the amino acid product. The conserved 
arginine residue interacts with the 
a-carboxylate group and helps establish the 
appropriate geometry of the quinonoid 
intermediate. 
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CHAPTER 24 The Biosynthesis of The synthesis of glutamate by the reductive amination of a-ketoglutarate 
Amino Acids . ‘ : 
has already been discussed, as has the conversion of glutamate into glu- 
tamine (p. 710). Glutamate is the precursor of two other nonessential amino 
acids: proline and arginine. First, the y-carboxyl group of glutamate reacts 
with ATP to form an acyl phosphate. This mixed anhydride is then reduced 


by NADPH to an aldehyde. 
Ht P; 0 
4 # || 
NADPH NADP* H-C 
COO- *H3N 
Glutamate Acyl-phosphate Glutamic y-semialdehyde 


intermediate 


Glutamic y-semialdehyde cyclizes with a loss of HzO in a nonenzymatic 
process to give Al-pyrroline 5-carboxylate, which is reduced by NADPH 
to proline. Alternatively, the semialdehyde can be transaminated to orni- 
thine, which is converted in several steps into arginine (see Figure 23.17). 
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3-Phosphoglycerate is the precursor of serine, cysteine, and glycine 


Serine is synthesized from 3-phosphoglycerate, an intermediate in glycoly- 
sis. The first step is an oxidation to 3-phosphohydroxypyruvate. This 
a-ketoacid is transaminated to 3-phosphoserine, which is then hydrolyzed 


to serine. 
QO 9° Oo 07 
\oZ H* \ 7 
O-—R + O-—R a-Keto- 
NAD* NADH ‘ Glutamate — glutarate 
a 
COO- 
res ieote) 
3-Phospho- 3-Phosphohydroxy- 3-Phospho- Serine 
glycerate pyruvate serine 


Serine is the precursor of cysteine and glycine. As we shall see, the conver- 
sion of serine into cysteine requires the substitution of a sulfur atom derived 


from methionine for the side-chain oxygen atom. In the formation of 
glycine, the side-chain methylene group of serine is transferred to tetrahy- 
drofolate, a carrier of one-carbon units that will be discussed shortly. 


Serine + tetrahydrofolate ——> 
glycine + N°,N'°-methylenetetrahydrofolate + HO 


This interconversion is catalyzed by serine hydroxymethyltransferase, a PLP 
enzyme that is homologous to aspartate aminotransferase. The formation of 
the Schiff base of serine renders the bond between its a- and B-carbon 
atoms susceptible to cleavage, enabling the transfer of the B-carbon to 
tetrahydrofolate and producing the Schiff base of glycine. 


Tetrahydrofolate carries activated one-carbon units at 
several oxidation levels 


Tetrahydrofolate (also called tetrahydropteroylglutamate) is a highly versatile 
carrier of activated one-carbon units. This cofactor consists of three groups: 
a substituted pteridine, p-aminobenzoate, and a chain of one or more gluta- 
mate residues (Figure 24.9). Mammals can synthesize the pteridine ring, 
but they are unable to conjugate it to the other two units. They obtain tetra- 
hydrofolate from their diets or from microorganisms in their intestinal tracts. 


H 
H2N N N 
ne | 
N ae Fe “oy, H 
ae | 
OH HN10 
Pteridine 


O 


p-Aminobenzoate Glutamate 


The one-carbon group carried by tetrahydrofolate i is bonded to its N-5 
or N-10 nitrogen atom (denoted as N° and N?”) or to both. This unit can 
exist in three oxidation states (Table 24.2). The most-reduced form carries 
a methyl group, whereas the intermediate form carries a methylene group. 
More-oxidized forms carry a formyl, formimino, or methenyl group. The 
fully oxidized one-carbon unit, COs, is carried by biotin rather than by 
tetrahydrofolate. 

The one-carbon units carried by tetrahydrofolate are interconvertible 
(Figure 24.10), N,N. Methylenetetrahydrofolate can be reduced to 
N° -methyltetrahydrofolate or oxidized to Na. 
methenyltetrahydrofolate. N°, N?°- Methenyltetrahydrofolate 
can be converted into N°-formiminotetrahydrofolate or N!°- 
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Figure 24.9 Tetrahydrofolate. This cofactor 
includes three components: a pteridine ring, 
p-aminobenzoate, and one or more glutamate 


residues. 


Table 24.2 One-carbon groups carried by tetrahydrofolate 


formyltetra ota both of which are at the same oxida- Group 
tion level. N’?-Formyltetrahydrofolate can also be synthesized Sas 
Oxidation state Formula Name 
from tetrahydrofolate, formate, and ATP. 
Most reduced —CH3 Methyl 
Formate + ATP + tetrahydrofolate ——> (= methanol) 
10_ ' Intermediate —CH,— Methylene 
N*-formyltetrahydrofolate + ADP + P; © ea idede 
N° -Formyltetrahy drofolate can be reversibly isomerized panera ee it 
to NC formyl tetrahydrofolate or it can be converted into i Nameaee) _ aoe 
—CH= Methenyl 


N°’, N’°-methenyltetrahydrofolate. 
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Figure 24.10 Conversions of one-carbon 
units attached to tetrahydrofolate. 
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These tetrahydrofolate derivatives serve as donors of one-carbon units in a 
variety of biosyntheses. Methionine i is regenerated from homocysteine by 
transfer of the methyl group of N°-methyltetrahydrofolate, as will be dis- 
cussed shortly. We shall see in Chapter 25 that some of the carbon atoms of 
purines are acquired from derivatives of N‘° -formyltetrahydrofolate. The 
methyl group of thymine, a pyrimidine, comes from N°, N?° 
methylenetetrahydrofolate. This tetrahydrofolate derivative can aise donute 
a one-carbon unit in an alternative synthesis of glycine that starts with CO» 
and NH,", a reaction catalyzed by glycine synthase (called the glycine cleav- 
age enzyme when it operates in the reverse direction). 


CO, + NH,* + N°,N!°-methylenetetrahydrofolate + NADH == 
glycine + tetrahydrofolate + NAD* 


Thus, one-carbon units at each of the three oxidation levels are utilized 
in biosyntheses. Furthermore, tetrahydrofolate serves as an acceptor of one- 
carbon units in degradative reactions. The major source of one-carbon units 
is the facile conversion of serine into glycine by serine hydroxymethyltrans- 
ferase (p. 715), which yields N°,N'°-methylenetetrahydrofolate. Serine can 
be derived from 3-phosphoglycerate, and so this pathway enables one-carbon 
units to be formed de novo from carbohydrates. 


S-Adenosylmethionine is the major donor of methyl groups 


Tetrahydrofolate can carry a methyl group on its N-5 atom, but its transfer 
potential is not sufficiently high for most biosynthetic methylations. Rather, 
the activated methyl donor is usually S-adenosylmethionine (SAM), which is 


synthesized by the transfer of an adenosyl group from ATP to the sulfur 717 


atom of methionine. 24.2 Synthesis of Amino Acids 
-00c NH3* Hs* NH, 
N Sy) 
+ ATP ——> 4 | y 
N A 
J ° 
Hs 
HO OH 
Methionine S-Adenosylmethionine (SAM) 


The methyl group of the methionine unit is activated by the positive charge 
on the adjacent sulfur atom, which makes the molecule much more reactive 
than N°-methyltetrahydrofolate. The synthesis of S-adenosylmethionine is 
unusual in that the triphosphate group of ATP is split into pyrophosphate 
and orthophosphate; the pyrophosphate is subsequently hydrolyzed to two 
molecules of P;. S-Adenosylhomocysteine is formed when the methyl group of 
S-adenosylmethionine is transferred to an acceptor. S-Adenosylhomo- 
cysteine is then hydrolyzed to homocysteine and adenosine. 


melele NH, 


pW 
| P| H,0 Adenosine 
ZA 


HO OH HO OH 
S-Adenosylmethionine (SAM) S-Adenosylhomocysteine Homocysteine 


Methionine can be regenerated by the transfer of a methyl group to homo- 
cysteine from N°-methyltetrahydrofolate, a reaction catalyzed by methionine 
synthase (also known as homocysteine methyltransferase). 


-O0C H - H 
N ooc H 
| ey SCH, 
a se + | id 
N * N %, 
a ro 
CHs HN. HN. 
H5C 
Homocysteine N5-Methyl- Methionine Tetrahydrofolate 
tetrahydrofolate (| 
. . . Methionine S-Adenosyl- 
The coenzyme that mediates this transfer of a methyl group is methylcobala- fomene ne 
min, derived from vitamin By). In fact, this reaction and the rearrangement ie 
of L-methylmalonyl CoA to succinyl CoA (p. 650), catalyzed by a homolo- . 
gous enzyme, are the only two B;7-dependent reactions known to take place (=e) Homocysteine 
in mammals. Another enzyme that converts homocysteine into methionine 
without vitamin By also is present in many organisms. Figite 24:11 Aeiivated matiybeycla: the 
These reactions constitute the activated methyl cycle (Figure 24.11). methyl group of methionine is activated by 


Methyl groups enter the cycle in the conversion of homocysteine into the formation of S-adenosylmethionine. 
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hk Figure 24.12 DNA methylation. The 
structure of a DNA methylase bound to an 
oligonucleotide target shows that the base to 
be methylated is flipped out of the DNA helix 
into the active site of a SAM-dependent 
methylase. [Drawn from 10MH.pdb.] 
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methionine and are then made highly reactive by the addition of an adeno- 
syl group, which makes the sulfur atoms positively charged and the methyl 
groups much more electrophilic. The high transfer potential of the S-methyl 
group enables it to be transferred to a wide variety of acceptors. 

Among the acceptors modified by S-adenosylmethionine are specific 
bases in DNA. The methylation of DNA protects bacterial DNA from 
cleavage by restriction enzymes (Section 9.3). The base to be methylated 
is flipped out of the DNA double helix into the active site of a DNA meth- 
ylase, where it can accept a methyl group from S-adenosylmethionine 
(Figure 24.12). A recurring S-adenosylmethionine-binding domain is pres- 
ent in many SAM-dependent methylases. 

S-Adenosylmethionine is also the precursor of ethylene, a gaseous plant 
hormone that induces the ripening of fruit. S-Adenosylmethionine is cycl- 
ized to a cyclopropane derivative that is then oxidized to form ethylene. The 
Greek philosopher Theophrastus recognized more than 2000 years ago that 
sycamore figs do not ripen unless they are scraped with an iron claw. The 
reason is now known: wounding triggers ethylene production, which in turn 
induces ripening. Much effort is being made to understand this biosynthetic 
pathway because ethylene is a culprit in the spoilage of fruit. 


HjC——CH) 
nue ACC synthase ACC oxidase 
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+H3N COO- Ethylene 


1-Aminocyclopropane- 
1-carboxylate (ACC) 


Cysteine is synthesized from serine and homocysteine 


In addition to being a precursor of methionine in the activated methyl cycle, 
homocysteine is an intermediate in the synthesis of cysteine. Serine and 
homocysteine condense to form cystathionine. This reaction is catalyzed by 
cystathionine B-synthase. Cystathionine is then deaminated and cleaved to 


cysteine and a-ketobutyrate by cystathionine y-lyase, or cystathionase. Both 719 
of these enzymes utilize PLP and are homologous to aspartate aminotrans- 24.2 Synthesis of Amino Acids 
ferase. The net reaction is 


Homocysteine + serine = cysteine + a-ketobutyrate + NH4~ 


Note that the sulfur atom of cysteine is derived from homocysteine, whereas the 
carbon skeleton comes from serine. 


HO, 
*H3N 
SH H 
,, H + 
~OOC *H3N 
Homocysteine Serine Cystathionine a-Ketobutyrate Cysteine 


High homocysteine levels correlate with vascular disease 


e People with elevated serum levels of homocysteine or the disulfide- 
linked dimer homocystine have an unusually high risk for coronary 
heart disease and arteriosclerosis. The most common genetic cause of high 
homocysteine levels is a mutation within the gene encoding cystathionine 
B-synthase. The molecular basis of homocysteine’s action has not been 
clearly identified, although it appears to damage cells lining blood vessels 
and to increase the growth of vascular smooth muscle. The amino acid 
raises oxidative stress as well. Vitamin treatments are effective in reducing 
homocysteine levels in some people. Treatment with vitamins maximizes 
the activity of the two major metabolic pathways processing homocysteine. 
Pyridoxal phosphate, a vitamin B, derivative, is necessary for the activity of 
cystathionine B-synthase, which converts homocysteine into cystathione; 
tetrahydrofolate, as well as vitamin B,, supports the methylation of homo- 
cysteine to methionine. 


Shikimate and chorismate are intermediates in 
the biosynthesis of aromatic amino acids 


We turn now to the biosynthesis of essential amino acids. These amino acids 
are synthesized by plants and microorganisms, and those in the human diet 
are ultimately derived primarily from plants. The essential amino acids are 
formed by much more complex routes than are the nonessential amino 
acids. The pathways for the synthesis of aromatic amino acids in bacteria 
have been selected for discussion here because they are well understood and 
exemplify recurring mechanistic motifs. 

Phenylalanine, tyrosine, and tryptophan are synthesized by a com- 
mon pathway in E. coli (Figure 24.13). The initial step is the condensation 
of phosphoenolpyruvate (a glycolytic intermediate) with erythrose 
4-phosphate (a pentose phosphate pathway intermediate). The resulting 
seven-carbon open-chain sugar is oxidized, loses its phosphoryl group, and 
cyclizes to 3-dehydroquinate. Dehydration then yields 3-dehydroshikimate, 
which is reduced by NADPH to shikimate. The phosphorylation of shiki- 
mate by ATP gives shikimate 3-phosphate, which condenses with a second 
molecule of phosphoenolpyruvate. The resulting 5-enolpyruvyl intermedi- 
ate loses its phosphoryl group, yielding chorismate, the common precursor 
of all three aromatic amino acids. The importance of this pathway is revealed 
by the effectiveness of glyphosate (commercially known as Roundup), a 
broad-spectrum herbicide. This compound is an uncompetitive inhibitor of 
the enzyme that produces 5-enolpyruvylshikimate 3-phosphate. It blocks 


2-O3P CoO- 


CH2 


Phosphoenol- 
pyruvate 


CH20PO37- 


Erythrose 
4-phosphate 


5-Enolpyruvyl- 
shikimate 
3-phosphate 


Chorismate 


Figure 24.13 Pathway to chorismate. 
Chorismate is an intermediate in the 
biosynthesis of phenylalanine, tyrosine, 
and tryptophan. 
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aromatic amino acid biosynthesis in plants but is fairly nontoxic in animals 
because they lack the enzyme. 


2 f\ 
oO O 
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The pathway bifurcates at chorismate. Let us first follow the prephenate 
branch (Figure 24.14). A mutase converts chorismate into prephenate, the 
immediate precursor of the aromatic ring of phenylalanine and tyrosine. 
This fascinating conversion is a rare example of an electrocyclic reaction 
in biochemistry, mechanistically similar to the well-known Diels—Alder 
reaction in organic chemistry. Dehydration and decarboxylation yield 
phenylpyruvate. Alternatively, prephenate can be oxidatively decarboxylated 
to p-hydroxyphenylpyruvate. These a-ketoacids are then transaminated to 
form phenylalanine and tyrosine. 

The branch starting with anthranilate leads to the synthesis of trypto- 
phan (Figure 24.15). Chorismate acquires an amino group derived from the 
hydrolysis of the side chain of glutamine and releases pyruvate to form 
anthranilate. Then anthranilate condenses with 5-phosphoribosyl-1-pyropho 
sphate (PRPP), an activated form of ribose phosphate. PRPP is also an 
important intermediate in the synthesis of histidine, pyrimidine nucleo- 
tides, and purine nucleotides (Sections 25.1 and 25.2). The C-1 atom of 
ribose 5-phosphate becomes bonded to the nitrogen atom of anthranilate in 
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Figure 24.14 Synthesis of phenylalanine and 
tyrosine. Chorismate can be converted into 


prephenate, which is subsequently converted into OH OH 

phenylalanine and tyrosine. p-Hydroxyphenylpyruvate Tyrosine 
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Figure 24.15 Synthesis of tryptophan. 
N Chorismate can be converted into anthranilate, 
which is subsequently converted into 
Tryptophan tryptophan. 
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Figure 24.17 Substrate 
channeling. A 25-A tunnel runs 
from the active site of the a subunit 
of tryptophan synthase (yellow) to 
the PLP cofactor (red) in the active 
site of the B subunit (blue). 
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a reaction that is driven by the release and hydrolysis of pyrophosphate. The 
ribose moiety of phosphoribosylanthranilate undergoes rearrangement to 
yield 1 -(o-carboxyphenylamino)-1-deoxyribulose 5-phosphate. This inter- 
mediate is dehydrated and then decarboxylated to indole-3-glycerol phos- 
phate, which is cleaved to indole. Then indole reacts with serine to form 
tryptophan. In these final steps, which are catalyzed by tryptophan syn- 
thase, the side chain of indole-3-glycerol phosphate is removed as glyceral- 
dehyde 3-phosphate and replaced by the carbon skeleton of serine. 


Tryptophan synthase illustrates substrate channeling in 
enzymatic catalysis 


The E. coli enzyme tryptophan synthase, an a2{2 tetramer, can be dissociated 
into two a subunits and a B2 dimer (Figure 24.16). The a subunit catalyzes 
the formation of indole from indole-3-glycerol phosphate, whereas each B 
subunit has a PLP-containing active site that catalyzes the condensation of 
indole and serine to form tryptophan. Serine forms a Schiff base with this 
PLP, which is then dehydrated to give the Schiff base of aminoacrylate. This 
reactive intermediate is attacked by indole to give tryptophan. The overall 
three-dimensional structure of this enzyme is distinct from that of aspartate 
aminotransferase and the other PLP enzymes already discussed. 


h Figure 24.16 Structure of 
tryptophan synthase. The structure x 
of the complex formed by one a 
subunit (yellow) and one f subunit 
(blue). Notice that pyridoxal phosphate 
(PLP) is bound deeply inside the 
6 subunit, a considerable distance from 
the a subunit. [Drawn from 1BKS.pdb.] 


The synthesis of tryptophan poses a challenge. Indole, a hydrophobic 
molecule, readily traverses membranes and would be lost from the cell if it 
were allowed to diffuse away from the enzyme. This problem is solved in an 
ingenious way. A 25-A-long channel connects the active site of the a sub- 
unit with that of the adjacent B subunit in the a8» tetramer (Figure 24.17). 
Thus, indole can diffuse from one active site to the other without being 
released into bulk solvent. Isotopic-labeling experiments showed that 
indole formed by the a subunit does not leave the enzyme when serine is 
present. Furthermore, the two partial reactions are coordinated. Indole is 
not formed by the a subunit until the highly reactive aminoacrylate is ready 
and waiting in the B subunit. We see here a clear-cut example of substrate 
channeling in catalysis by a multienzyme complex. Channeling substantially 
increases the catalytic rate. Furthermore, a deleterious side reaction—in this 


case, the potential loss of an intermediate—is prevented. We shall encounter 
other examples of substrate channeling in Chapter 25. 


24.3 Feedback Inhibition Regulates Amino 
Acid Biosynthesis 


The rate of synthesis of amino acids depends mainly on the amounts of the 
biosynthetic enzymes and on their activities. We now consider the control of 
enzymatic activity. The regulation of enzyme synthesis will be discussed in 
Chapter 31. 

In a biosynthetic pathway, the first irreversible reaction, called the 
committed step, is usually an important regulatory site. The final product 
of the pathway (Z) often inhibits the enzyme that catalyzes the committed 
step (A — B). 


Inhibited 
by Z 
A= => —B ==> C => D ==> | ==> 7 


This kind of control is essential for the conservation of building blocks 
and metabolic energy. Consider the biosynthesis of serine (p. 714). The com- 
mitted step in this pathway is the oxidation of 3-phosphoglycerate, catalyzed 
by the enzyme 3-phosphoglycerate dehydrogenase. The E. coli enzyme is a 
tetramer of four identical subunits, each comprising a catalytic domain and 
a serine-binding regulatory domain (Figure 24.18). The binding of serine to 
a regulatory site reduces the value of Vinax for the enzyme; an enzyme bound 
to four molecules of serine is essentially inactive. Thus, if serine is abundant 
in the cell, the enzyme activity is inhibited, and so 3-phosphoglycerate, a 
key building block that can be used for other processes, is not wasted. 


Branched pathways require sophisticated regulation 


The regulation of branched pathways is more compli- 
cated because the concentration of two products must 
be accounted for. In fact, several intricate feedback 
mechanisms have been found in branched biosynthetic 
pathways. 


Feedback inhibition and activation. “Two pathways with 
a common initial step may each be inhibited by its own 
product and activated by the product of the other path- 
way. Consider, for example, the biosynthesis of the amino 
acids valine, leucine, and isoleucine. A common interme- 


diate, hydroxyethyl thiamine pyrophosphate (hydroxy- oe 
ethyl-TPP; Section 17.1), initiates the pathways leading to 4 ' 
all three of these amino acids. Hydroxyethyl-TPP reacts Leucine Valine 


with a-ketobutyrate in the initial step for the synthesis of 

isoleucine. Alternatively, hydroxyethyl-TPP reacts with 

pyruvate in the committed step for the pathways leading to valine and leu- 
cine. Thus, the relative concentrations of a-ketobutyrate and pyruvate 
determine how much isoleucine is produced compared with valine and leu- 
cine. Threonine deaminase, the PLP enzyme that catalyzes the formation of 
a-ketobutyrate, is allosterically inhibited by isoleucine (Figure 24.19). This 
enzyme is also allosterically activated by valine. Thus, this enzyme is inhibited 
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Dimeric 
regulatory domain 


Catalytic 
domain 


hk Figure 24.18 Structure of 
3-phosphoglycerate dehydrogenase. This 
enzyme, which catalyzes the committed step 
in the serine biosynthetic pathway, is inhibited 
by serine. Notice the two serine-binding 
dimeric regulatory domains—one at the top 
and the other at the bottom of the structure. 
[Drawn from 1PSD.pdb.] 
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Figure 24.19 Regulation of threonine 
deaminase. Threonine is converted into 
a-ketobutyrate in the committed step, leading 
to the synthesis of isoleucine. The enzyme 
that catalyzes this step, threonine deaminase, 
is inhibited by isoleucine and activated by 
valine, the product of a parallel pathway. 
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hk Figure 24.20 A recurring regulatory 
domain. The regulatory domain formed by 
two subunits of 3-phosphoglycerate 
dehydrogenase is structurally related to the 
single-chain regulatory domain of threonine 
deaminase. Notice that both structures have 
four a helices and eight 8 strands in similar 
locations. Sequence analyses have revealed 
this amino acid-binding regulatory domain to 
be present in other enzymes as well. [Drawn 
from 1PSD and 1TDJ.pdb.] 
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by the end product of the pathway that it initiates and is activated by the end 
product of a competitive pathway. This mechanism balances the amounts of 
different amino acids that are synthesized. 


A The regulatory domain in threonine deaminase is very similar in 
structure to the regulatory domain in 3-phosphoglycerate dehydroge- 
nase (Figure 24.20). In the latter enzyme, regulatory domains of two 
subunits interact to form a dimeric serine-binding regulatory unit, and so 
the tetrameric enzyme contains two such regulatory units. Each unit is 
capable of binding two serine molecules. In threonine deaminase, the two 
regulatory domains are fused into a single unit with two differentiated 
amino acid-binding sites, one for isoleucine and the other for valine. 
Sequence analysis shows that similar regulatory domains are present in 
other amino acid biosynthetic enzymes. The similarities suggest that 
feedback-inhibition processes may have evolved by the linkage of specific 
regulatory domains to the catalytic domains of biosynthetic enzymes. 


Enzyme multiplicity. The committed step can be catalyzed by two or 
more enzymes with different regulatory properties. For example, the 
phosphorylation of aspartate is the committed step in the biosynthesis of 
threonine, methionine, and lysine. Three distinct aspartokinases catalyze 
this reaction in FE. coli (Figure 24.21). The catalytic domains of these 
enzymes show approximately 30% sequence identity. Although the 
mechanisms of catalysis are essentially identical, their activities are regu- 
lated differently: one enzyme is not subject to feedback inhibition, another 
is inhibited by threonine, and the third is inhibited by lysine. Thus, sophis- 
ticated regulation can also evolve by duplication of the genes encoding the 
biosynthetic enzymes. 


Aspartokinase 


domain 
Figure 24.21 Domain structures of three Unregulated 
aspartokinases. Each catalyzes the 
committed step in the biosynthesis of a —_6—hC 


different amino acid: (top) methionine, Threonine sensitive 
(middle) threonine, and (bottom) lysine. They 
have a catalytic domain in common but differ 
in their regulatory domains. Lysine sensitive 


Cumulative feedback inhibition. A common step is partly inhibited by 
each of the final products, acting independently. The regulation of glu- 
tamine synthetase in E. coli is a striking example of cumulative feedback 
inhibition. Recall that glutamine is synthesized from glutamate, NH4*, and 
ATP. Glutamine synthetase consists of 12 identical 50-kd subunits arranged 
in two hexagonal rings that face each other. Earl Stadtman showed that this 
enzyme regulates the flow of nitrogen and hence plays a key role in control- 
ling bacterial metabolism. The amide group of glutamine is a source of 
nitrogen in the biosyntheses of a variety of compounds, such as tryptophan, 
histidine, carbamoyl phosphate, glucosamine 6-phosphate, cytidine triphos- 
phate, and adenosine monophosphate. Glutamine synthetase is cumula- 
tively inhibited by each of these final products of glutamine metabolism, as 
well as by alanine and glycine. In cumulative inhibition, each inhibitor can 
reduce the activity of the enzyme, even when other inhibitors are bound at 
saturating levels. The enzymatic activity of glutamine synthetase is switched 
off almost completely when all final products are bound to the enzyme. 


An enzymatic cascade modulates the activity of glutamine synthetase 


The activity of glutamine synthetase is also controlled by reversible covalent 
modification—the attachment of an AMP unit by a phosphodiester bond 
to the hydroxyl group of a specific tyrosine residue in each subunit 
(Figure 24.22). This adenylylated enzyme is less active and more susceptible to 
cumulative feedback inhibition than is the de-adenylylated form. The cova- 
lently attached AMP unit is removed from the adenylylated enzyme by 
phosphorolysis. The attachment of an AMP unit is the final step in an enzy- 
matic cascade that is initiated several steps back by reactants and immediate 
products in glutamine synthesis. 

The adenylylation and phosphorolysis reactions are catalyzed by the 
same enzyme, adenylyl transferase. Sequence analysis indicates that this 
adenylyl transferase comprises two homologous halves, suggesting that one 
half catalyzes the adenylation reaction and the other half the phospholytic 
de-adenylylation reaction. What determines whether an AMP unit is added 
or removed? The specificity of adenylyl] transferase is controlled by a regula- 
tory protein (designated P or Py), a trimeric protein that can exist in two 
forms, Pa and Pp. The complex of Pa and adenylyl transferase catalyzes the 
attachment of an AMP unit to glutamine synthetase, which reduces its 
activity. Conversely, the complex of Pp and adenylyl transferase removes 
AMP from the adenylylated enzyme. 
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Acid Biosynthesis 


Figure 24.22 Regulation by adenylation. 
(A) A specific tyrosine residue in each subunit 
in glutamine synthetase is modified by 
adenylation. (B) Adenylation of tyrosine is 
catalyzed by a complex of adenylyl transferase 
(AT) and one form of a regulatory protein 
(Pa). The same enzyme catalyzes 
de-adenylation when it is complexed with the 
other form (Pp) of the regulatory protein. 


Adenylated 
glutamine synthetase 
(less active) 
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Figure 24.23 A higher level in the 
regulatory cascade of glutamine 
synthetase. Pa and Pp, the regulatory 
proteins that control the specificity of adenylyl 
transferase, are interconvertible. Pa is 
converted into Pp by uridylylation, which is 
reversed by hydrolysis. The enzymes 
catalyzing these reactions are regulated by the 
concentrations of metabolic intermediates. 


Figure 24.24 Selected biomolecules 
derived from amino acids. The atoms 


contributed by amino acids are shown in blue. 
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Thyroxine 
(Tetraiodothyronine) 


This brings us to another level of reversible covalent modification. Pa is 
converted into Pp by the attachment of uridine monophosphate to a specific 
tyrosine residue (Figure 24.23). This reaction, which is catalyzed by uridylyl 
transferase, is stimulated by ATP and a-ketoglutarate, whereas it is inhib- 
ited by glutamine. In turn, the UMP units on Pp are removed by hydro- 
lysis, a reaction promoted by glutamine and inhibited by a-ketoglutarate. 
These opposing catalytic activities are present on a single polypeptide chain, 
homologous to adenylyl transferase, and are controlled so that the enzyme 
does not simultaneously catalyze uridylylation and hydrolysis. 

Why is an enzymatic cascade used to regulate glutamine synthetase? 
One advantage of a cascade is that it amplifies signals, as in blood clotting 
and the control of glycogen metabolism. Another advantage is that the 
potential for allosteric control is markedly increased when each enzyme in the 
cascade is an independent target for regulation. The integration of nitrogen 
metabolism in a cell requires that a large number of input signals be de- 
tected and processed. In addition, the regulatory protein P also participates 
in regulating the transcription of genes for glutamine synthetase and 
other enzymes taking part in nitrogen metabolism. The evolution of a 
cascade provided many more regulatory sites and made possible a finer 
tuning of the flow of nitrogen in the cell. 


24.4 Amino Acids Are Precursors of Many Biomolecules 


In addition to being the building blocks of proteins and peptides, amino 
acids serve as precursors of many kinds of small molecules that have impor- 
tant and diverse biological roles. Let us briefly survey some of the biomol- 
ecules that are derived from amino acids (Figure 24.24). 

Purines and pyrimidines are derived largely from amino acids. The bio- 
synthesis of these precursors of DNA, RNA, and numerous coenzymes will 
be discussed in detail in Chapter 25. The reactive terminus of sphingosine, 
an intermediate in the synthesis of sphingolipids, comes from serine. 
Histamine, a potent vasodilator, is derived from histidine by decarboxyla- 
tion. Tyrosine is a precursor of the hormones thyroxine (tetraiodothyronine) 
and epinephrine and of melanin, a complex polymeric pigment. The neu- 
rotransmitter serotonin (5-hydroxytryptamine) and the nicotinamide ring of 
NAD* are synthesized from tryptophan. Let us now consider in more 
detail three particularly important biochemicals derived from amino acids. 


. NH;* 
Y son 


N 
H 
Sphingosine Histamine 
0 
a NH> 
NH3t | 
St 
| 
N 
H R 
Epinephrine Serotonin Nicotinamide 


unit of NAD* 


Glutathione, a gamma-glutamyl peptide, serves as a sulfhydryl buffer 
and an antioxidant 

Glutathione, a tripeptide containing a sulfhydryl group, is a highly distinc- 
tive amino acid derivative with several important roles (Figure 24.25). 


i@) 
~O0OC 
H N H 3° 
y-Glutamate Cysteine Glycine 


Figure 24.25 Glutathione. This tripeptide consists of a cysteine residue flanked by a glycine 
residue and a glutamate residue that is linked to cysteine by an isopeptide bond between 
glutamate’s side-chain carboxylate group and cysteine’s amino group. 


For example, glutathione, present at high levels (~5 mM) in animal cells, 
protects red blood cells from oxidative damage by serving as a sulfhydryl 
buffer (Section 20.5). It cycles between a reduced thiol form (GSH) and an 
oxidized form (GSSG) in which two tripeptides are linked by a disulfide 
bond. 


2 GSH + RO—OH == GSSG + H,O + ROH 


GSSG is reduced to GSH by glutathione reductase, a flavoprotein that uses 
NADPH as the electron source. The ratio of GSH to GSSG in most cells is 
greater than 500. Glutathione plays a key role in detoxification by reacting 
with hydrogen peroxide and organic peroxides, the harmful by-products of 
aerobic life. 

Glutathione peroxidase, the enzyme catalyzing this reaction, is remark- 
able in having a modified amino acid containing a selenium (Se) atom 
(Figure 24.26). Specifically, its active site contains the selenium analog of 
cysteine, in which selenium has replaced sulfur. The selenolate (E-Se_) 
form of this residue reduces the peroxide substrate to an alcohol and is in 
turn oxidized to selenenic acid (E-SeOH). Glutathione then comes into 
action by forming a selenosulfide adduct (E-Se-S-G). A second molecule of 
glutathione then regenerates the active form of the enzyme by attacking the 
selenosulfide to form oxidized glutathione (Figure 24.27). 


Nitric oxide, a short-lived signal molecule, is formed from arginine 


Nitric oxide (NO) is an important messenger in many vertebrate signal- 
transduction processes. For instance, NO stimulates mitochondrial biogenesis. 


GSSG E-Se” kot 
+Ht Selenolate 
ROH, 
E-Se-S-G E-SeOH 
Selenosulfide Selenenic acid 


Figure 24.27 Catalytic cycle of 

glutathione peroxidase. [After O. Epp, 

R. Ladenstein, and A. Wendel. Eur. J. Biochem. 
H,0 GSH 


133(1983):51-69]] 
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Selenocysteine 


hk Figure 24.26 Structure of 
glutathione peroxidase. This enzyme, 
which has a role in peroxide detoxification, 
contains a selenocysteine residue in its active 
site. [Drawn from 1GP1.pdb.] 


Figure 24.28 Formation 
of nitric oxide. NO is 
generated by the 
oxidation of arginine. 


Arginine 


'5N labeling: A pioneer’s account 


“Myself as a Guinea Pig 

...in 1944, | undertook, together with 
David Rittenberg, an investigation on the 
turnover of blood proteins of man. To this 
end | synthesized 66 g of glycine labeled 
with 35 percent '°N at a cost of $1000 for 
the '°N. On 12 February 1945, | started the 
ingestion of the labeled glycine. Since we did 
not know the effect of relatively large doses 
of the stable isotope of nitrogen and since 
we believed that the maximum incorporation 
into the proteins could be achieved by the 
administration of glycine in some continual 
manner, | ingested 1 g samples of glycine at 
hourly intervals for the next 66 hours... . 
At stated intervals, blood was withdrawn 
and after proper preparation the '°N 
concentrations of different blood proteins 
were determined.” 


—David Shemin 
Bioessays 10(1989):30 
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Figure 24.29 Heme labeling. The origins of 


atoms in heme revealed by the results of 
isotopic labeling studies. 
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This free-radical gas is produced endogenously from arginine in a complex 
reaction that is catalyzed by nitric oxide synthase. NADPH and Oz are 
required for the synthesis of nitric oxide (Figure 24.28). Nitric oxide acts by 
binding to and activating soluble guanylate cyclase, an important enzyme in 
signal transduction (Section 32.3). This enzyme is homologous to adenylate 
cyclase but includes a heme-containing domain that binds NO. 


Porphyrins are synthesized from glycine and succinyl coenzyme A 


The participation of an amino acid in the biosynthesis of the porphyrin 
rings of hemes and chlorophylls was first revealed by isotope-labeling 
experiments carried out by David Shemin and his colleagues. In 1945, they 
showed that the nitrogen atoms of heme were labeled after the feeding of 
[‘°N]glycine to human subjects (of whom Shemin was the first), whereas 
the ingestion of [!°N]glutamate resulted in very little labeling. 

Using '‘C, which had just become available, they discovered that 8 of 
the carbon atoms of heme in nucleated duck erythrocytes are derived from 
the a-carbon atom of glycine and none from the carboxyl carbon atom. 
Subsequent studies demonstrated that the other 26 carbon atoms of heme 
can arise from acetate. Moreover, the “Cin methyl-labeled acetate emerged 
in 24 of these carbon atoms, whereas the 4C in carboxyl-labeled acetate 
appeared only in the other 2 (Figure 24.29). 

This highly distinctive labeling pattern led Shemin to propose that prior 
to incorporation into heme, acetate is converted to succinyl-CoA through 
enzymes from the citric acid cycle (Section 17.2). Shemin further posited 
that a heme precursor is formed by the condensation of glycine with succinyl- 
CoA. In fact, the first step in the biosynthesis of porphyrins in mammals is the 
condensation of glycine and succinyl CoA to form 6-aminolevulinate. 


~OOC CoA ~OOC 
+ 
H* co, 
: i oe 
Ncoa + “000 NHgt NH3* 
O O 
Succinyl CoA Glycine 5-Aminolevulinate 


This reaction is catalyzed by 6-aminolevulinate synthase, a PLP enzyme present 
in mitochondria. Consistent with the labeling studies performed by Shemin 
and his coworkers, the carbon atom from the carboxyl group of glycine is 
lost as carbon dioxide, while the a-carbon remains in 6-aminolevulinate. 
Two molecules of 6-aminolevulinate condense to form porphobilinogen, 
the next intermediate. Four molecules of porphobilinogen then condense 
head to tail to form a linear tetrapyrrole in a reaction catalyzed by porpho- 


bilinogen deaminase. The enzyme-bound linear tetrapyrrole then cyclizes to 
form uroporphyrinogen III, which has an asymmetric arrangement of side 
chains. This reaction requires a cosynthase. In the presence of synthase 
alone, uroporphyrinogen I, the nonphysiological symmetric isomer, is pro- 
duced. Uroporphyrinogen III is also a key intermediate in the synthesis of 
vitamin By by bacteria and that of chlorophyll by bacteria and plants 
(Figure 24.30). 

The porphyrin skeleton is now formed. Subsequent reactions alter 
the side chains and the degree of saturation of the porphyrin ring (see 
Figure 24.29). Coproporphyrinogen III is formed by the decarboxylation of 
the acetate side chains. The desaturation of the porphyrin ring and the con- 
version of two of the propionate side chains into vinyl groups yield protopor- 
phyrin LX. The chelation of iron finally gives heme, the prosthetic group of 
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Iron Figure 24.30 Heme biosynthetic pathway. The pathway for 
the formation of heme starts with eight molecules of 


3-aminolevulinate. 
Heme 


\ 


729 


24.4 Amino Acid Precursors 
of Biomolecules 


\ / 


iz 
ae 


A P 
A 
Uroporphyrinogen III 


\ 


730 


CHAPTER 24 The Biosynthesis of 
Amino Acids 


Figure 24.31 Heme degradation. The 
formation of the heme-degradation products 
biliverdin and bilirubin is responsible for the 
color of bruises. Abbreviations: M, methyl; V, 
vinyl. 


proteins such as myoglobin, hemoglobin, catalase, peroxidase, and cyto- 
chrome c. The insertion of the ferrous form of iron is catalyzed by ferroche- 
latase. Iron is transported in the plasma by transferrin, a protein that binds 
two ferric ions, and is stored in tissues inside molecules of ferritin. The large 
internal cavity (~80 A in diameter) of ferritin can hold as many as 4500 fer- 
ric ions (Section 32.4). 

The normal human erythrocyte has a life span of about 120 days, as was 
first shown by the time course of !°N in Shemin’s own hemoglobin after he 
ingested !°N-labeled glycine. The first step in the degradation of the heme 
group is the cleavage of its a-methine bridge to form the green pigment 
biliverdin, a linear tetrapyrrole. The central methine bridge of biliverdin 
is then reduced by biliverdin reductase to form bilirubin, a red pigment 
(Figure 24.31). The changing color of a bruise is a highly graphic indicator 
of these degradative reactions. 


Heme 
20, + NADPH 


CO + H,O + NADP” 


Fe?+ 


V Biliverdin 


NADPH + H* 


NADP* 


Porphyrins accumulate in some inherited disorders of 
porphyrin metabolism 


ey Porphyrias are inherited or acquired disorders caused by a deficiency 
of enzymes in the heme biosynthetic pathway. Porphyrin is synthe- 
sized in both the erythroblasts and the liver, and either one may be the site 
of a disorder. Congenital erythropoietic porphyria, for example, prematurely 
destroys eythrocytes. This disease results from insufficient cosynthase. In 
this porphyria, the synthesis of the required amount of uroporphyrinogen III 
is accompanied by the formation of very large quantities of uroporphyrino- 
gen I, the useless symmetric isomer. Uroporphyrin I, coproporphyrin I, and 
other symmetric derivatives also accumulate. The urine of patients having 
this disease is red because of the excretion of large amounts of uroporphyrin 


I. Their teeth exhibit a strong red fluorescence under ultraviolet light 
because of the deposition of porphyrins. Furthermore, their skin is usually 
very sensitive to light because photoexcited porphyrins are quite reactive. 
Acute intermittent porphyria is the most prevalent of the porphyrias affecting 
the liver. This porphyria is characterized by the overproduction of porpho- 
bilinogen and 6-aminolevulinate, which results in severe abdominal pain 
and neurological dysfunction. The “madness” of George III, king of 
England during the American Revolution, is believed to have been due to 
this porphyria. 


Summary 


24.1 Nitrogen Fixation: Microorganisms Use ATP and a Powerful 

Reductant to Reduce Atmospheric Nitrogen to Ammonia 

Microorganisms use ATP and reduced ferredoxin, a powerful reduc- 
tant, to reduce N» to NH3. An iron—molybdenum cluster in nitroge- 
nase deftly catalyzes the fixation of No, a very inert molecule. Higher 
organisms consume the fixed nitrogen to synthesize amino acids, 
nucleotides, and other nitrogen-containing biomolecules. The major 
points of entry of NH,” into metabolism are glutamine or glutamate. 


24.2 Amino Acids Are Made from Intermediates of the Citric Acid Cycle 
and Other Major Pathways 
Human beings can synthesize 11 of the basic set of 20 amino acids. 
These amino acids are called nonessential, in contrast with the essen- 
tial amino acids, which must be supplied in the diet. The pathways for 
the synthesis of nonessential amino acids are quite simple. Glutamate 
dehydrogenase catalyzes the reductive amination of a-ketoglutarate to 
glutamate. A transamination reaction takes place in the synthesis of 
most amino acids. At this step, the chirality of the amino acid is estab- 
lished. Alanine and aspartate are synthesized by the transamination of 
pyruvate and oxaloacetate, respectively. Glutamine is synthesized 
from NH,” and glutamate, and asparagine is synthesized similarly. 
Proline and arginine are derived from glutamate. Serine, formed from 
3-phosphoglycerate, is the precursor of glycine and cysteine. Tyrosine 
is synthesized by the hydroxylation of phenylalanine, an essential 
amino acid. The pathways for the biosynthesis of essential amino acids 
are much more complex than those for the nonessential ones. 
Tetrahydrofolate, a carrier of activated one-carbon units, plays an 
important role in the metabolism of amino acids and nucleotides. This 
coenzyme carries one-carbon units at three oxidation states, which are 
interconvertible: most reduced—methyl; intermediate—methylene; 
and most oxidized—formyl, formimino, and methenyl. The major 
donor of activated methyl groups is S-adenosylmethionine, which is 
synthesized by the transfer of an adenosyl group from ATP to the sul- 
fur atom of methionine. S- Adenosylhomocysteine is formed when the 
activated methyl group is transferred to an acceptor. It is hydrolyzed 
to adenosine and homocysteine, and the latter is then methylated to 
methionine to complete the activated methyl cycle. 


24.3 Feedback Inhibition Regulates Amino Acid Biosynthesis 
Most of the pathways of amino acid biosynthesis are regulated by 
feedback inhibition, in which the committed step is allosterically 
inhibited by the final product. The regulation of branched pathways 
requires extensive interaction among the branches that includes both 
negative and positive regulation. The regulation of glutamine synthetase 
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in E. coliisa striking demonstration of cumulative feedback inhibition 
and of control by a cascade of reversible covalent modifications. 


24.4 Amino Acids Are Precursors of Many Biomolecules 


Amino acids are precursors of a variety of biomolecules. Glutathione 
(y-Glu-Cys-Gly) serves as a sulfhydryl buffer and detoxifying agent. 
Glutathione peroxidase, a selenoenzyme, catalyzes the reduction of 
hydrogen peroxide and organic peroxides by glutathione. Nitric oxide, 
a short-lived messenger, is formed from arginine. Porphyrins are syn- 
thesized from glycine and succinyl CoA, which condense to give 
6-aminolevulinate. Two molecules of this intermediate become linked 
to form porphobilinogen. Four molecules of porphobilinogen combine 
to form a linear tetrapyrrole, which cyclizes to uroporphyrinogen III. 
Oxidation and side-chain modifications lead to the synthesis of proto- 


porphyrin [X, which acquires an iron atom to form heme. 


Key Terms 


nitrogen fixation (p. 706) 
nitrogenase complex (p. 707) 
essential amino acids (p. 711) 
nonessential amino acids (p. 711) 


pyridoxal phosphate (p. 712) 


Problems 


1. Out of thin air. Define nitrogen fixation. What organ- 
isms are capable of nitrogen fixation? 


2. From few, many. What are the seven precursors of the 
20 amino acids? 


3. Vital, in the truest sense. Why are certain amino acids 
defined as essential for human beings? 


4. From sugar to amino acid. Write a balanced equation for 
the synthesis of alanine from glucose. 


5. From air to blood. What are the intermediates in the flow 
of nitrogen from N> to heme? 


6. The fix is in. “The mechanistic complexity of nitroge- 
nase is necessary because nitrogen fixation is a thermody- 
namically unfavorable process.” True or false? Explain. 


7. Common component. What cofactor is required by all 
transaminases (aminotransferases)? 


8. Here, hold this. In this chapter, we considered three 
different cofactors/cosubstrates that act as carriers of 
one-carbon units. Name them. 


9. One-carbon transfers. Which derivative of folate is a 
reactant in the conversion of (a) glycine into serine? 
(b) homocysteine into methionine? 


10. Telltale tag. In the reaction catalyzed by glutamine syn- 
thetase, an oxygen atom is transferred from the side chain of 
glutamate to orthophosphate, as shown by the results of 
'80Q-labeling studies. Account for this finding. 


tetrahydrofolate (p. 715) 
S-adenosylmethionine (SAM) (p. 716) 
activated methyl cycle (p. 717) 
substrate channeling (p. 722) 
committed step (p. 723) 


enzyme multiplicity (p. 724) 
cumulative feedback inhibition (p. 725) 
glutathione (p. 727) 

nitric oxide (NO) (p. 727) 

porphyria (p. 730) 


11. Telltale tag, redux. In contrast to the production of glu- 
tamine by glutamine synthetase (see Problem 10), the gen- 
eration of asparagine from '8O-labeled aspartate does not 
result in the transfer of an '8O atom to orthophosphate. In 
what molecule do you expect to find one of the '*O atoms? 


12. Therapeutic glycine. Isovaleric acidemia is an inherited 
disorder of leucine metabolism caused by a deficiency of 
isovaleryl CoA dehydrogenase. Many infants having this 
disease die in the first month of life. The administration 
of large amounts of glycine sometimes leads to marked 
clinical improvement. Propose a mechanism for the thera- 
peutic action of glycine. 


13. Lending a hand. The atoms from tryptophan shaded 
below are derived from two other amino acids. Name 
them. 


Tryptophan 


14. Deprived bacteria. Blue-green algae (cyanobacteria) 
form heterocysts when deprived of ammonia and nitrate. In 
this form, the cyanobacteria lack nuclei and are attached to 


adjacent vegetative cells. Heterocysts have photosystem I 
activity but are entirely devoid of photosystem II activity. 
What is their role? 


15. Cysteine and cystine. Most cytoplasmic proteins lack 
disulfide bonds, whereas extracellular proteins usually 
contain them. Why? 


16. Through the looking-glass. Suppose that aspartate amin- 
otransferase were chemically synthesized with the use of 
D-amino acids only. What products would you expect if this 
mirror-image enzyme were treated with (a) L-asparate and 
a-ketoglutarate; (b) D-aspartate and a-ketoglutarate? 


17. To and fro. The synthesis of 6-aminolevulinate takes 
place in the mitochondrial matrix, whereas the formation 
of porphobilinogen takes place in the cytoplasm. Propose a 
reason for the mitochondrial location of the first step in 
heme synthesis. 


18. Direct synthesis. Which of the 20 amino acids can be 
synthesized directly from a common metabolic intermedi- 
ate by a transamination reaction? 


19. Alternative route to proline. Certain species of bacteria 
possess an enzyme, ornithine cyclodeaminase, that can 
catalyze the conversion of L-ornithine into L-proline in a 
single catalytic cycle. 


N COO- 


H2 


Ornithine Proline 


The enzyme lysine cyclodeaminase has also been identified. 
Predict the product of the reaction catalyzed by lysine 
cyclodeaminase. 


20. Lines of communication. For the following example of a 
branched pathway, propose a feedback inhibition scheme 
that would result in the production of equal amounts of 
Y and Z. 


EE -———> ¥ 


A 


21. Cumulative feedback inhibition. Consider the branched 
pathway in Problem 20. The first common step (A —> B) is 
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Problems 


partly inhibited by both of the final products, each acting 
independently of the other. Suppose that a high level of 
Y alone decreased the rate of the A — B step from 100 to 
60s | and that a high level of Z alone decreased the rate 
from 100 to 40s‘. What would the rate be in the presence 
of high levels of both Y and Z? 


22. Recovered activity. Free sulfhydryl groups can be 
alkylated with 2-bromoethylamine to the corresponding 
thioether. 


NH.* HBr 
3 
io" + ne Re ow 


2-Bromoethylamine 


a ae 


Researchers prepared a mutant form of aspartate amino- 
transferase in which lysine 258 was replaced by cysteine 
(Lys258Cys). This mutant protein has no observable 
catalytic activity. However, treatment of Lys258Cys with 
2-bromoethylamine yielded a protein with ~7% activity 
relative to the wild-type enzyme. Explain why alkylation 
recovered some enzyme activity. 


Mechanism Problems 


23. Ethylene formation. Propose a mechanism for the 
conversion of S-adenosylmethionine into 1-aminocyclopro- 
pane-1-carboxylate (ACC) by ACC synthase, a PLP 
enzyme. What is the other product? 


24. Mirror-image serine. Brain tissue contains substantial 
amounts of D-serine, which is generated from L-serine by 
serine racemase, a PLP enzyme. Propose a mechanism for 
the interconversion of L- and D-serine. What is the equilib- 
rium constant for the reaction L-serine === D-serine? 


25. An unusual amino acid. Elongation factor-2 (eEF-2), a 
protein taking part in translation, contains a histidine resi- 
due that is modified posttranslationally in several steps to a 
complex side chain known as diphthamide. An intermedi- 
ate along this pathway is referred to as diphthine. 


(a) Labeling experiments indicate that the diphthine inter- 
mediate is formed by the modification of histidine with four 
molecules of S-adenosylmethionine (indicated by the four 
colors on page 734). Propose a mechanism for the formation 


of diphthine. 


(b) The final conversion of diphthine into diphthamide is 
known to be ATP dependent. Propose two possible mechan- 
isms for the final amidation step. 
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Diphthine 


Diphthamide 


Chapter Integration Problems 


26. Connections. How might increased synthesis of aspar- 
tate and glutamate affect energy production in a cell? How 
would the cell respond to such an effect? 


27. Protection required. Suppose that a mutation in bac- 
teria resulted in the diminished activity of methionine 
adenosyltransferase, the enzyme responsible for the syn- 
thesis of SAM from methionine and ATP. Predict how 
this diminished activity might affect the stability of the 
mutated bacteria’s DNA. 


28. Heme biosynthesis. Shemin and coworkers used acetate- 
labeling experiments to conclude that succinyl-CoA is a key 
intermediate in the biosynthesis of heme. Identify the inter- 
mediates in the conversion of acetate into succinyl-CoA. 


Chapter Integration and Data Interpretation Problem 


29. Light effects. The adjoining graph shows the concentra- 
tion of several free amino acids in light- and dark-adapted 
plants. 


= 
N 


DD Light adapted 
IB Dark adapted 


Free amino acids (pmol x 10? per leaf) 


i 


Asp Gly Ser Asn Thr Gln 


[After B. B. Buchanan, W. Gruissem, and R. L. Jones, 
Biochemistry and Molecular Biology of Plants (American 
Society of Plant Physiology, 2000), Fig. 8.3, p. 363.] 


(a) Of the amino acids shown, which are most affected by 
light-dark adaptation? 


(b) Suggest a plausible biochemical explanation for the dif- 
ference observed. 


(c) White asparagus, a culinary delicacy, is the result of 
growing asparagus plants in the dark. What chemical might 
you think enhances the taste of white asparagus? 


Nucleotide Biosynthesis 


CHAPTER 25 


Methotrexate 


Nucleotides are required for cell growth and replication. A key enzyme for the synthesis of one nucleotide is dihydrofolate 
reductase (right). Cells grown in the presence of methotrexate, a reductase inhibitor, respond by increasing the number of 
copies of the reductase gene. The bright yellow regions visible on three of the chromosomes in the fluorescence micrograph 
(left), which were grown in the presence of methotrexate, contain hundreds of copies of the reductase gene. [(Left) Courtesy of 


Dr. Barbara Trask and Dr. Joyce Hamlin.] 


ucleotides are key biomolecules required for a variety of life processes. 

First, nucleotides are the activated precursors of nucleic acids, necessary 
for the replication of the genome and the transcription of the genetic infor- 
mation into RNA. Second, an adenine nucleotide, ATP, is the universal 
currency of energy. A guanine nucleotide, GTP, also serves as an energy 
source for a more select group of biological processes. Third, nucleotide 
derivatives such as UDP-glucose participate in biosynthetic processes such 
as the formation of glycogen. Fourth, nucleotides are essential components of 
signal-transduction pathways. Cyclic nucleotides such as cyclic AMP and 
cyclic GMP are second messengers that transmit signals both within and 
between cells. Furthermore, ATP acts as the donor of phosphoryl groups 
transferred by protein kinases in a variety of signaling pathways and, in 
some cases, AT'P is secreted as a signal molecule. 

In this chapter, we continue along the path begun in Chapter 24, which 
described the incorporation of nitrogen into amino acids from inorganic 
sources such as nitrogen gas. The amino acids glycine and aspartate are the 
scaffolds on which the ring systems present in nucleotides are assembled. 
Furthermore, aspartate and the side chain of glutamine serve as sources of 
NH) groups in the formation of nucleotides. 
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Purine Bases Can Be 
Synthesized de Novo or 
Recycled by Salvage Pathways 


Deoxyribonucleotides Are 
Synthesized by the Reduction of 
Ribonucleotides Through a 
Radical Mechanism 


Key Steps in Nucleotide 
Biosynthesis Are Regulated by 
Feedback Inhibition 


Disruptions in Nucleotide 
Metabolism Can Cause 
Pathological Conditions 

735 


736 
CHAPTER 25 Nucleotide Biosynthesis 


SALVAGE PATHWAY 


Activated ribose (PRPP) + base 


| 


Nucleotide 


DE NOVO PATHWAY 


Activated ribose (PRPP) + amino acids 
+ ATP + CO2 +... 


| 


Nucleotide 


Figure 25.1 Salvage and de novo 
pathways. In a salvage pathway, a base is 
reattached to a ribose, activated in the form of 
5-phosphoribosyl-1-pyrophosphate (PRPP). In 
de novo synthesis, the base itself is 
synthesized from simpler starting materials, 
including amino acids. ATP hydrolysis is 
required for de novo synthesis. 
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Figure 25.2 De novo pathway for 
pyrimidine nucleotide synthesis. The C-2 
and N-3 atoms in the pyrimidine ring come 
from carbamoyl phosphate, whereas the other 
atoms of the ring come from aspartate. 


Table 25.1 Nomenclature of bases, nucleosides, and nucleotides 


RNA 
Ribonucleotide 

Base Ribonucleoside (5'-monophosphate) 
Adenine (A) Adenosine Adenylate (AMP) 
Guanine (G) Guanosine Guanylate (GMP) 
Uracil (U) Uridine Uridylate (UMP) 
Cytosine (C) Cytidine Cytidylate (CMP) 
DNA 

Deoxyribonucleotide 
Base Deoxyribonucleoside (5'-monophosphate) 
Adenine (A) Deoxyadenosine Deoxyadenylate (dAMP) 
Guanine (G) Deoxyguanosine Deoxyguanylate (dGMP) 
Thymine (T) Thymidine Thymidylate (TMP) 
Cytosine (C) Deoxycytidine Deoxycytidylate (dCMP) 


Nucleotide biosynthetic pathways are tremendously important as inter- 
vention points for therapeutic agents. Many of the most widely used drugs 
in the treatment of cancer block steps in nucleotide biosynthesis, particu- 
larly steps in the synthesis of DNA precursors. 


Nucleotides can be synthesized by de novo or salvage pathways 


The pathways for the biosynthesis of nucleotides fall into two classes: de 
novo pathways and salvage pathways (Figure 25.1). In de novo (from 
scratch) pathways, the nucleotide bases are assembled from simpler com- 
pounds. The framework for a pyrimidine base is assembled first and then 
attached to ribose. In contrast, the framework for a purine base is synthe- 
sized piece by piece directly onto a ribose-based structure. These pathways 
each comprise a small number of elementary reactions that are repeated 
with variation to generate different nucleotides, as might be expected for 
pathways that appeared very early in evolution. In salvage pathways, pre- 
formed bases are recovered and reconnected to a ribose unit. 

De novo pathways lead to the synthesis of ribonucleotides. However, 
DNA is built from deoxyribonucleotides. Consistent with the notion that 
RNA preceded DNA in the course of evolution, all deoxyribonucleotides 
are synthesized from the corresponding ribonucleotides. The deoxyribose 
sugar is generated by the reduction of ribose within a fully formed nucle- 
otide. Furthermore, the methyl group that distinguishes the thymine of 
DNA from the uracil of RNA is added at the last step in the pathway. 

The nomenclature of nucleotides and their constituent units was pre- 
sented in Chapter 4. Recall that a nucleoside is a purine or pyrimidine base 
linked to a sugar and that a nucleotide is a phosphate ester of a nucleoside. 
The names of the major bases of RNA and DNA, and of their nucleoside 
and nucleotide derivatives, are given in Table 25.1. 


25.1 The Pyrimidine Ring Is Assembled de Novo or 
Recovered by Salvage Pathways 


In de novo synthesis of pyrimidines, the ring is synthesized first and then it 
is attached to aribose phosphate to form a pyrimidine nucleotide (Figure 25.2). 
Pyrimidine rings are assembled from bicarbonate, aspartate, and ammonia. 
Although an ammonia molecule already present in solution can be used, 
the ammonia is usually produced from the hydrolysis of the side chain of 
glutamine. 


Bicarbonate and other oxygenated carbon compounds are 
activated by phosphorylation 


The first step in de novo pyrimidine biosynthesis is the synthesis of car- 
bamoyl phosphate from bicarbonate and ammonia in a multistep process, 
requiring the cleavage of two molecules of ATP. This reaction is catalyzed 
by carbamoyl phosphate synthetase (CPS; Section 23.4). Analysis of the 
structure of CPS reveals two homologous domains, each of which catalyzes 
an ATP-dependent step (Figure 25.3). 

In the first step, bicarbonate is phosphorylated by ATP to form 
carboxyphosphate and ADP. Ammonia then reacts with carboxyphosphate 
to form carbamic acid and inorganic phosphate. 


ATP ADP O 2- NH; P; 
0 ie) ie) 
ee ey ae ee a 
Sey 0 
How (OO a or HO~ NH; 
Bicarbonate Carboxyphosphate Carbamic acid 


The active site for this reaction lies ina domain formed by the amino-terminal 
third of CPS. This domain forms a structure called an ATP-grasp fold, 
which surrounds ATP and holds it in an orientation suitable for nucleo- 
philic attack at the y phosphory] group. Proteins containing ATP-grasp folds 
catalyze the formation of carbon—nitrogen bonds through acyl-phosphate 
intermediates. Such ATP-grasp folds are widely used in nucleotide 
biosynthesis. 

In the second step catalyzed by carbamoyl phosphate synthetase, car- 
bamic acid is phosphorylated by another molecule of ATP to form carbam- 
oy! phosphate. 


ATP ADP 2-0 


t we. J 
Os sia ae 
HO NH o” ~o NH> 

Carbamic acid Carbamoyl phosphate 

This reaction takes place in a second AT P-grasp domain within the enzyme. 
The active sites leading to carbamic acid formation and carbamoyl! phos- 
phate formation are very similar, revealing that this enzyme evolved by a 
gene duplication event. Indeed, duplication of a gene encoding an ATP- 
grasp domain followed by specialization was central to the evolution of 
nucleotide biosynthetic processes (p. 741). 


The side chain of glutamine can be hydrolyzed to generate ammonia 


Glutamine is the primary source of ammonia for carbamoyl phosphate syn- 
thetase. In this case, a second polypeptide component of the enzyme hydro- 
lyzes glutamine to form ammonia and glutamate. The active site of the 
glutamine-hydrolyzing component contains a catalytic dyad comprising a 
cysteine and a histidine residue. Such a catalytic dyad, reminiscent of the 
active site of cysteine proteases (see Figure 9.16), is conserved in a family of 
amidotransferases, including CTP synthetase and GMP synthetase. 


Intermediates can move between active sites by channeling 


Carbamoyl phosphate synthetase contains three different active sites (see 
Figure 25.3), separated from one another by a total of 80 A. Intermediates 
generated at one site move to the next without leaving the enzyme. These 
intermediates move within the enzyme by means of substrate channeling, 
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hydrolysis 
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Bicarbonate 
p, phosphorylation 
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Carbamic acid 
phosphorylation 
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h Figure 25.3 Structure of carbamoyl 
phosphate synthetase. Notice that the 
enzyme contains sites for three reactions. This 
enzyme consists of two chains. The smaller 
chain (yellow) contains a site for glutamine 
hydrolysis to generate ammonia. The larger 
chain includes two ATP-grasp domains (blue 
and red). In one ATP-grasp domain (blue), 
bicarbonate is phosphorylated to 
carboxyphosphate, which then reacts with 
ammonia to generate carbamic acid. In the 
other ATP-grasp domain, the carbamic acid is 
phosphorylated to produce carbamoyl 
phosphate. [Drawn from 1JDB.pdb.] 
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Glutamine 


NH; 

Carbamic 
Figure 25.4 Substrate eae 

channeling. The three active sites of 

carbamoyl phosphate synthetase are 

linked by a channel (yellow) through 

which intermediates pass. Glutamine 

enters one active site, and carbamoyl 

phosphate, which includes the nitrogen Carbamoyl 

atom from the glutamine side chain, phosphate 

leaves another 80 A away. [Drawn from 

1JDB.pdb.] 


similar to the process described for tryptophan synthetase (Figure 25.4; also 
Figure 24.17). The ammonia generated in the glutamine-hydrolysis active 
site travels 45 A through a channel within the enzyme to reach the site at 
which carboxyphosphate has been generated. The carbamic acid generated 
at this site diffuses an additional 35 A through an extension of the channel 
to reach the site at which carbamoyl phosphate is generated. This channel- 
ing serves two roles: (1) intermediates generated at one active site are cap- 
tured with no loss caused by diffusion and (2) labile intermediates, such as 
carboxyphosphate and carbamic acid (which decompose in less than 1 s at 
pH 7), are protected from hydrolysis. We will see additional examples of 
substrate channeling later in this chapter. 


Orotate acquires a ribose ring from PRPP to form a pyrimidine 
nucleotide and is converted into uridylate 


Carbamoyl phosphate reacts with aspartate to form carbamoylaspartate 
in a reaction catalyzed by aspartate transcarbamoylase (Section 10.1). 
Carbamoylaspartate then cyclizes to form dihydroorotate, which is then 
oxidized by NAD* to form orotate. 


T T + T 
2-0 Aspartate P; C Ht H,0 C NAD* Ae r@ 
i LD HN “NH, Kw HN “NH oe: HN NH 
_7P. C 4 ——— 
57 So NH sacar: coo- ideas ; ~~ 
H H oO OOC ie) 
HH HH H 
Carbamoyl phosphate Carbamoylaspartate Dihydroorotate Orotate 


In mammals, the enzymes that form orotate are part of single large polypep- 
tide chain called CAD, for carbamoyl] phosphate synthetase, aspartate tran- 
scarbamoylase and dihydroorotase. 

At this stage, orotate couples to ribose, in the form of 5-phosphoribosyl- 
1-pyrophosphate (PRPP), a form of ribose activated to accept nucleotide 
bases. 5-Phosphoribosyl-1-pyrophosphate synthetase synthesizes PRPP by 
adding a pyrophosphate from ATP to ribose-5-phosphate, which is formed 
by the pentose phosphate pathway. 


2-QzPOH2C O ATP AMP 2-Q;POH2C 


OH PRPP 
synthetase 


HO OH HO OH 
Ribose 5-phosphate PRPP 


Orotate reacts with PRPP to form orotidylate, a pyrimidine nucleotide. This 
reaction is driven by the hydrolysis of pyrophosphate. The enzyme that 
catalyzes this addition, pyrimidine phosphoribosyltransferase, is homologous 
to a number of other phosphoribosyltransferases that add different groups 
to PRPP to form the other nucleotides. Orotidylate is then decarboxylat- 
ed to form uridylate (UMP), a major pyrimidine nucleotide that is a precur- 
sor to RNA. This reaction is catalyzed by orotidylate decarboxylase. 


O O 
u HN 
+ 
oO H COz o—{ 
2-9,POH2C re) N { es > 2-Q;,POH2C Oo. N Vi 
H 
C==0 


a 
O 


HO OH HO OH 
Orotidylate Uridylate 


Orotidylate decarboxylase is one of the most proficient enzymes known. 
In its absence, decarboxylation is extremely slow and is estimated to take 
place once every 78 million years; with the enzyme present, it takes place 
approximately once per second, a rate enhancement of 10'7-fold. 


Nucleotide mono-, di-, and triphosphates are interconvertible 


How is the other major pyrimidine ribonucleotide, cytidine, formed? It is 
synthesized from the uracil base of UMP, but the synthesis can take place 
only after UMP has been converted into UTP. Recall that the diphosphates 
and triphosphates are the active forms of nucleotides in biosynthesis and 
energy conversions. Nucleoside monophosphates are converted into nucle- 
oside triphosphates in stages. First, nucleoside monophosphates are con- 
verted into diphosphates by specific nucleoside monophosphate kinases that 
utilize ATP as the phosphoryl-group donor. For example, UMP is phos- 
phorylated to UDP by UMP kinase. 


UMP + ATP == UDP + ADP 


Nucleoside diphosphates and triphosphates are interconverted by 
nucleoside diphosphate kinase, an enzyme that has broad specificity, in con- 
trast with the monophosphate kinases. X and Y represent any of several 
ribonucleosides or even deoxyribonucleosides: 


XDP + YTP == XTP+YDP 


CTP is formed by amination of UTP 


After uridine triphosphate has been formed, it can be transformed into cyti- 
dine triphosphate by the replacement of a carbonyl group by an amino 
group, a reaction catalyzed by cytidine triphosphate synthetase. 
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Figure 25.5 De novo pathway for purine 
nucleotide synthesis. The origins of the 
atoms in the purine ring are indicated. 
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Like the synthesis of carbamoyl] phosphate, this reaction requires ATP and 
uses glutamine as the source of the amino group. The reaction proceeds 
through an analogous mechanism in which the O-4 atom is phosphorylated 
to form a reactive intermediate, and then the phosphate is displaced by 
ammonia, freed from glutamine by hydrolysis. CTP can then be used in 
many biochemical processes, including lipid and RNA synthesis. 


Salvage pathways recycle pyrimidine bases 


Pyrimidine bases can be recovered from the breakdown products of DNA 
and RNA by the use of salvage pathways. In these pathways, a preformed 
base is reincorporated into a nucleotide. We will consider the salvage for the 
pyrimidine base thymine. Thymine is found in DNA and base-pairs with 
adenine in the DNA double helix. Thymine released from degraded DNA 
is salvaged in two steps. First, thymine is converted into nucleoside thymi- 
dine by thymidine phosphorylase. 


Thymine + deoxyribose-l-phosphate == thymidine + P; 
Thymidine is then converted into a nucleotide by thymidine kinase. 
Thymidine + ATP == TMP + ADP 


The activity of thymidine kinase fluctuates with the cell cycle, displaying 
peak activity during S phase when DNA synthesis is occurring. Viral thymi- 
dine kinase differs from the mammalian enzyme and thus provides a thera- 
peutic target. For instance, herpes simplex infections are treated with acy- 
clovir, which viral thymidine kinase converts into a suicide inhibitor that 
terminates DNA synthesis. As we will see shortly, thymidine kinase also 
plays a role in the de novo synthesis of thymidylate. 


25.2 Purine Bases Can Be Synthesized de Novo or 
Recycled by Salvage Pathways 


Like pyrimidine nucleotides, purine nucleotides can be synthesized de novo 
or by a salvage pathway. When synthesized de novo, purine synthesis 
begins with simple starting materials such as amino acids and bicarbonate 
(Figure 25.5). Unlike the bases of pyrimidines, the purine bases are assem- 
bled already attached to the ribose ring. Alternatively, purine bases, released 
by the hydrolytic degradation of nucleic acids and nucleotides, can be 
salvaged and recycled. Purine salvage pathways are especially noted for the 
energy that they save and the remarkable effects of their absence (p. 752). 


The purine ring system is assembled on ribose phosphate 


De novo purine biosynthesis, like pyrimidine biosynthesis, requires PRPP 
but, for purines, PRPP provides the foundation on which the bases are 


constructed step by step. The initial committed step is the displacement of 
pyrophosphate by ammonia, rather than by a preassembled base, to pro- 
duce 5-phosphoribosyl-1-amine, with the amine in the 8 configuration. 
Glutamine phosphoribosyl amidotransferase catalyzes this reaction. This 
enzyme comprises two domains: the first is homologous to the phosphori- 
bosyltransferases in purine salvage pathways (p. 744), whereas the second 
produces ammonia from glutamine by hydrolysis. However, this glutamine- 
hydrolysis domain 1s distinct from the domain that performs the same func- 
tion in carbamoyl phosphate synthetase. In glutamine phosphoribosyl 
amidotransferase, a cysteine residue located at the amino terminus facilitates 
glutamine hydrolysis. To prevent wasteful hydrolysis of either substrate, 
the amidotransferase assumes the active configuration only on binding of 
both PRPP and glutamine. As is the case with carbamoyl phosphate syn- 
thetase, the ammonia generated at the glutamine-hydrolysis active site passes 
through a channel to reach PRPP without being released into solution. 


The purine ring is assembled by successive steps of activation by 
phosphorylation followed by displacement 


a Nine additional steps are required to assemble the purine ring. 

Remarkably, the first six steps are analogous reactions. Most of these 
steps are catalyzed by enzymes with ATP-grasp domains that are homolo- 
gous to those in carbamoyl phosphate synthetase. Each step consists of the 
activation of a carbon-bound oxygen atom (typically a carbonyl oxygen atom) 
by phosphorylation, followed by the displacement of the phosphoryl group by 
ammonia or an amine group acting as a nucleophile (Nu). 


Q 2 


ATP ADP pe-O Nu P; 
S O A \ gS ZZ ,\, Nu 


7 ~ Phosphorylation / Displacement ye 


De novo purine biosynthesis proceeds as shown in Figure 25.6. Table 25.2 
lists the enzymes that catalyze each step of the reaction. 


1. The carboxylate group of a glycine residue is activated by phosphoryla- 
tion and then coupled to the amino group of phosphoribosylamine. A new 
amide bond is formed, and the amino group of glycine is free to act as a 
nucleophile in the next step. 


2. Formate is activated and then added to this amino group to form for- 
mylglycinamide ribonucleotide. 


Table 25.2 The enzymes of de novo purine synthesis 


Step Enzyme 


Glycinamide ribonucleotide (GAR) synthetase 

GAR transformylase 

Formylglycinamidine synthase 

Aminoimidazole ribonucleotide synthetase 
Carboxyaminoimidazole ribonucleotide synthase 
Succinylaminoimidazole carboxamide ribonucleotide synthetase 
Adenylosuccinate lyase 

Aminoimidazole carboxamide ribonucleotide transformylase 


Oo mON Dm F WNP 


Inosine monophosphate cyclohydrolyase 
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Figure 25.6 De novo purine biosynthesis. (1) Glycine is coupled to the amino group of H. N ‘ 
phosphoribosylamine. (2) N'°-Formyltetrahydrofolate (THF) transfers a formyl group to the amino C— \ 0 
group of the glycine residue. (3) The inner amide group is phosphorylated and converted into an , eZ 
amidine by the addition of ammonia derived from glutamine. (4) An intramolecular coupling ricacee NF \ 
reaction forms the five-membered imidazole ring. (5) Bicarbonate adds first to the exocyclic amino \ NH 
group and then to a carbon atom of the imidazole ring. (6) The imidazole carboxylate is NS 
phosphorylated, and the phosphate is displaced by the amino group of aspartate. (7) Fumarate H 
is released. (8) A second formyl group is donated from N'°-formyltetrahydrofolate (THF). 
(9) Cyclization completes the synthesis of inosinate, a purine nucleotide. Inosinate (IMP) 


3. The inner carbonyl group is activated by phosphorylation and then 
converted into an amidine by the addition of ammonia derived from 
glutamine. 


4. The product of this reaction, formylglycinamidine ribonucleotide, cy- 
clizes to form the five-membered imidazole ring found in purines. Although 
this cyclization is likely to be favorable thermodynamically, a molecule of 
ATP is consumed to ensure irreversibility. The familiar pattern is repeated: 
a phosphoryl group from the ATP molecule activates the carbonyl group 
and is displaced by the nitrogen atom attached to the ribose molecule. 
Cyclization is thus an intramolecular reaction in which the nucleophile and 
phosphate-activated carbon atom are present within the same molecule. In 
higher eukaryotes, the enzymes catalyzing steps 1, 2, and 4 (see Table 25.2) 
742 are components of a single polypeptide chain. 


5. Bicarbonate is activated by phosphorylation and then attacked by the 
exocyclic amino group. The product of the reaction in step 5 rearranges to 
transfer the carboxylate group to the imidazole ring. Interestingly, mam- 
mals do not require ATP for this step; bicarbonate apparently attaches 
directly to the exocyclic amino group and is then transferred to the imida- 
zole ring. 


6. The imidazole carboxylate group is phosphorylated again and the phos- 
phate group is displaced by the amino group of aspartate. Once again, in 
higher eukaryotes, the enzymes catalyzing steps 5 and 6 (see Table 25.2) 
share a single polypeptide chain. 


7. Fumarate, an intermediate in the citric acid cycle, is eliminated, leaving 
the nitrogen atom from aspartate joined to the imidazole ring. The use of 
aspartate as an amino-group donor and the concomitant release of fumarate 
are reminiscent of the conversion of citrulline into arginine in the urea cycle, 
and these steps are catalyzed by homologous enzymes in the two pathways 
(Section 23.4). 


8. A formyl group from N'°-formyltetrahydrofolate is added to this 
nitrogen atom to form a final 5-formaminoimidazole-4-carboxamide 
ribonucleotide. 


9. 5-Formaminoimidazole-4-carboxamide ribonucleotide cyclizes with the 
loss of water to form inosinate. 


Many of the intermediates in the de novo purine biosynthesis pathway 
degrade rapidly in water. Their instability in water suggests that the product 
of one enzyme must be channeled directly to the next enzyme along the 
pathway. Recent evidence shows that the enzymes do indeed form com- 
plexes when purine synthesis is required. 


AMP and GMP are formed from IMP 


A few steps convert inosinate into either AMP or GMP (Figure 25.7). 
Adenylate is synthesized from inosinate by the substitution of an amino 
group for the carbonyl oxygen atom at C-6. Again, the addition of aspartate 
followed by the elimination of fumarate contributes the amino group. GTP, 
rather than ATP, is the phosphoryl-group donor in the synthesis of the 
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Figure 25.7 Generating AMP and GMP. 
Inosinate is the precursor of AMP and GMP. 
AMP is formed by the addition of aspartate 
followed by the release of fumarate. GMP is 
generated by the addition of water, 
dehydrogenation by NAD*, and the 
replacement of the carbonyl oxygen atom by 
—NHp derived by the hydrolysis of glutamine. 
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Figure 25.8 Formation of purinosomes. 

A gene construct encoding a fusion protein 
consisting of formylglycinamidine synthase and 
GFP was transfected into and expressed in Hela 
cells, a human cell line. (A) In the presence of 
purines (the absence of purine synthesis), the 
GFP was seen as a diffuse stain throughout the 
cytoplasm. (B) When the cells were shifted to a 
purine-free medium, purinosomes formed, 
seen as cytoplasmic granules, and purine 
synthesis occurred. [An, S., Kumar, R., Sheets, 
E. D., and Benkovic, S. J. 2008. Science 320: 
103-106. Figure 2, C and D] 
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adenylosuccinate intermediate from inosinate and aspartate. In accord with 
the use of GTP, the enzyme that promotes this conversion, adenylosuccinate 
synthase, is structurally related to the G-protein family and does not contain 
an ATP-grasp domain. 

Guanylate is synthesized by the oxidation of inosinate to xanthylate 
(XMP), followed by the incorporation of an amino group at C-2. NAD” is 
the hydrogen acceptor in the oxidation of inosinate. Xanthylate is activated 
by the transfer of an AMP group (rather than a phosphoryl group) from 
ATP to the oxygen atom in the newly formed carbonyl group. Ammonia, 
generated by the hydrolysis of glutamine, then displaces the AMP group to 
form guanylate, in a reaction catalyzed by GMP synthetase. Note that the 
synthesis of adenylate requires GTP, whereas the synthesis of guanylate 
requires ATP. This reciprocal use of nucleotides by the pathways creates an 
important regulatory opportunity (Section 25.4). 


Enzymes of the purine synthesis pathway associate with 
one another in vivo 


Biochemists believe that the enzymes of many metabolic pathways, such as 
glycolysis and the citric acid cycle, are physically associated with one anoth- 
er. Such associations would increase the efficiency of pathways by facilitat- 
ing the movement of the product of one enzyme to the active site of the next 
enzyme in the pathway. The evidence for such associations comes primarily 
from experiments in which one component of a pathway, carefully isolated 
from the cell, is found to be bound to other components of the pathway. 
However, these observations raise the question, do enzymes associate with 
one another in vivo or do they spuriously associate during the isolation pro- 
cedure? Recent in vivo evidence shows that the enzymes of the purine syn- 
thesis pathway associate with one another when purine synthesis is required. 
Various enzymes of the pathway were fused with the green fluorescent pro- 
tein (see Figure 2.65) and transfected into cells. When cells were grown in 
the presence of purine, the GFP was spread diffusely throughout the cyto- 
plasm (Figure 25.8A). When the cells were switched to growth media with- 
out purines, purine synthesis began and the enzymes became associated 
with one another, forming complexes dubbed purinosomes (Figure 25.8B). 
The experiments were repeated with other enzymes of the purine synthesis 
pathway bearing the GFP, and the results were the same: purine synthesis 
occurs when the enzymes form the purinosomes. What actually causes 
complex formation? While the results are not yet established, it appears that 
a phosphatase, presumably somehow responding to the absence of purines, 
instigates complex formation, while a kinase, responding to the presence of 
purines, causes disassembly of the purinosome. 


Salvage pathways economize intracellular energy expenditure 


As we have seen, the de novo synthesis of purines requires a substantial 
investment of ATP. Purine salvage pathways provide a more economical 
means of generating purines. Free purine bases, derived from the turnover 
of nucleotides or from the diet, can be attached to PRPP to form purine 
nucleoside monophosphates, in a reaction analogous to the formation of 
orotidylate. Two salvage enzymes with different specificities recover purine 
bases. Adenine phosphoribosyltransferase catalyzes the formation of adenyl- 
ate (AMP): 


Adenine + PRPP —— adenylate + PP; 


whereas hypoxanthine-guanine phosphoribosyltransferase (HGPRT) catalyz- 
es the formation of guanylate (GMP) as well as inosinate (inosine mono- 
phosphate, IMP), a precursor of guanylate and adenylate. 
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25.3 Deoxyribonucleotides Are Synthesized by the 
Reduction of Ribonucleotides Through a 
Radical Mechanism 


We turn now to the synthesis of deoxyribonucleotides. These precursors of 
DNA are formed by the reduction of ribonucleotides; specifically, the ADP GDP CDP UDP 


2'-hydroxyl group on the ribose moiety is replaced by a hydrogen atom. The __f ii Products of 
substrates are ribonucleoside diphosphates, and the ultimate reductant is ———E ale ora 
NADPH. The enzyme ribonucleotide reductase is responsible for the reduc- 


tion reaction for all four ribonucleotides. The ribonucleotide reductases of dADP dGDP dCDP dUDP 
different organisms are a remarkably diverse set of enzymes. Yet detailed \ \ | | 


Further processing 


studies have revealed that they have a common reaction mechanism, and yields dNTP 


their three-dimensional structural features indicate that these enzymes are 
homologous. We will focus on the best understood of these enzymes, that of dATP dGTP dCTP TIP 
E. coli living aerobically. 


Mechanism: A tyrosyl radical is critical to the action of 
ribonucleotide reductase 


The ribonucleotide reductase of E. coli consists of two subunits: R1 (an 
87-kd dimer) and R2 (a 43-kd dimer). The R1 subunit contains the active 
site as well as two allosteric control sites (Section 25.4). This subunit 
includes three conserved cysteine residues and a glutamate residue, all four 
of which participate in the reduction of ribose to deoxyribose (Figure 25.9). 
The R2 subunit’s role in catalysis is to generate a remarkable free radical in 
each of its two chains. Each R2 chain contains a stable tyrosyl radical with 


R1 dimer 


R2 dimer 


Active site 


Tyrosyl-radical site 


Figure 25.9 Ribonucleotide reductase. Ribonucleotide reductase reduces ribonucleotides to 
deoxyribonucleotides in its active site, which contains three key cysteine residues and one 
glutamate residue. Each R2 subunit contains a tyrosyl radical that accepts an electron from one of 
the cysteine residues in the active site to initiate the reduction reaction. Two R1 subunits come 
together to form a dimer as do two R2 subunits. 


hk Figure 25.10 Ribonucleotide 
reductase R2 subunit. The R2 subunit 
contains a stable free radical on a 
tyrosine residue. This radical is generated 
by the reaction of oxygen (not shown) at 
a nearby site containing two iron atoms. 
Two R2 subunits come together to form 
a dimer. [Drawn from 1RIB.pdb.] 


Tyrosine 
(radical site) 


an unpaired electron delocalized onto its aromatic ring (Figure 25.10). This 
very unusual free radical is generated by a nearby iron center consisting of 
two ferric (Fe**) ions bridged by an oxide (O7 ) ion. 

In the synthesis of a deoxyribonucleotide, the OH bonded to C-2' of the 
ribose ring is replaced by H, with retention of the configuration at the C-2' 
carbon atom (Figure 25.11). 


Figure 25.11 Ribonucleotide reductase mechanism. (1) An electron is transferred from a 
cysteine residue on R1 to a tyrosine radical on R2, generating a highly reactive cysteine thiyl radical. 
(2) This radical abstracts a hydrogen atom from C-3’ of the ribose unit. (3) The radical at C-3’ 
releases OH™ from the C-2' carbon atom. Combined with a proton from a second cysteine residue, 
the OH" is eliminated as water. (4) A hydride ion is transferred from a third cysteine residue with 
the concomitant formation of a disulfide bond. (5) The C-3' radical recaptures the originally 
abstracted hydrogen atom. (6) An electron is transferred from R2 to reduce the thiyl radical, which 
also accepts a proton. The deoxyribonucleotide is free to leave R1. The disulfide formed in the 
active site must be reduced to begin another cycle. 
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1. The reaction begins with the transfer of an electron from a cysteine resi- 
due on R1 to the tyrosyl radical on R2. The loss of an electron generates a 
highly reactive cysteine thiyl radical within the active site of R1. 


2. This radical then abstracts a hydrogen atom from C-3' of the ribose unit, 
generating a radical at that carbon atom. 


3. The radical at C-3' promotes the release of the OH from the C-2' car- 
bon atom. Protonated by a second cysteine residue, the departing OH 
leaves as a water molecule. 


4. A hydride ion (a proton with two electrons) is then transferred from a 
third cysteine residue to complete the reduction of the position, form a di- 
sulfide bond, and re-form a radical. 


5. This C-3' radical recaptures the same hydrogen atom originally abstract- 
ed by the first cysteine residue, and the deoxyribonucleotide is free to leave 
the enzyme. 


6. R2 provides an electron to reduce the thiyl radical. The disulfide bond 
generated in the enzyme’s active site must then be reduced to regenerate the 
active enzyme. 


The electrons for this reduction come from NADPH, but not directly. 
One carrier of reducing power linking NADPH with the reductase is thiore- 
doxin, a 12-kd protein with two exposed cysteine residues near each other. 
These sulfhydryls are oxidized to a disulfide in the reaction catalyzed by 
ribonucleotide reductase itself. In turn, reduced thioredoxin is regenerated 
by electron flow from NADPH. This reaction is catalyzed by thioredoxin 
reductase, a flavoprotein. Electrons flow from NADPH to bound FAD of 
the reductase, to the disulfide of oxidized thioredoxin, and then to ribonu- 
cleotide reductase and finally to the ribose unit. 


SH S SH 
a T fe ae Ribose 
FAD \ % % unit 


SH S SH 
SH Ss 
wor Snon! S|] 1] 
Ss SH Ss 
| J | ) | | 
Thioredoxin reductase Thioredoxin Ribonucleotide 
(TR) (T) reductase (RR) 


Stable radicals other than tyrosyl radical are employed by other 
ribonucleotide reductases 


na Ribonucleotide reductases that do not contain tyrosyl radicals have 

been characterized in other organisms. Instead, these enzymes contain 
other stable radicals that are generated by other processes. For example, in 
one class of reductases, the coenzyme adenosylcobalamin (vitamin B,2) is 
the radical source. Despite differences in the stable radical employed, the 
active sites of these enzymes are similar to that of the E. coli ribonucleotide 
reductase, and they appear to act by the same mechanism, based on the 
exceptional reactivity of cysteine radicals. Thus, these enzymes have a com- 
mon ancestor but evolved a range of mechanisms for generating stable 
radical species that function well under different growth conditions. 
The primordial enzymes appear to have been inactivated by oxygen, whereas 
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Figure 25.12 Thymidylate synthesis. 
Thymidylate synthase catalyzes the addition 
of a methyl group (derived from N°,V'0- 
methylenetetrahydrofolate) to dUMP to form 
TMP. The addition of a thiolate from the 
enzyme activates dUMP. Opening the five- 
membered ring of the THF derivative prepares 
the methylene group for nucleophilic attack by 
the activated dUMP. The reaction is completed 
by the transfer of a hydride ion to form 
dihydrofolate. 


H,C——_N 
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N°,N'°-Methylenetetrahydrofolate 


deoxyribose-P 


Deoxyuridine 
monophosphate 
(dUMP) 


enzymes such as the E. coli enzyme make use of oxygen to generate the ini- 
tial tyrosyl radical. Note that the reduction of ribonucleotides to deoxyribo- 
nucleotides is a difficult reaction chemically, likely to require a sophisticated 
catalyst. The existence of a common protein enzyme framework for this 
process strongly suggests that proteins joined the RNA world before the 
evolution of DNA as a stable storage form for genetic information. 


Thymidylate is formed by the methylation of deoxyuridylate 


Uracil, produced by the pyrimidine synthesis pathway, is not a component 
of DNA. Rather, DNA contains thymine, a methylated analog of uracil. 
Another step is required to generate thymidylate from uracil. Thymidylate 
synthase catalyzes this finishing touch: deoxyuridylate (dUMP) is methyl- 
ated to thymidylate (TMP). Recall that thymidylate synthase also functions 
in the thymine salvage pathways. As will be described in Chapter 28, the 
methylation of this nucleotide marks sites of DNA damage for repair and, 
hence, helps preserve the integrity of the genetic information stored in 
DNA. Themethyldonorinthisreactionis N°,N'°-methylenetetrahydrofolate 
rather than S-adenosylmethionine (Section 24.2). 

The methyl group becomes attached to the C-5 atom within the aro- 
matic ring of (UMP, but this carbon atom is not a good nucleophile and 
cannot itself attack the appropriate group on the methy! donor. Thymidylate 
synthase promotes methylation by adding a thiolate from a cysteine side 
chain to this ring to generate a nucleophilic species that can attack the 
methylene group of N°, N!°-methylenetetrahydrofolate (Figure 25.12). This 
methylene group, in turn, is activated by distortions imposed by the enzyme 
that favor opening the five-membered ring. The activated (UMP’s attack 
on the methylene group forms the new carbon-carbon bond. The interme- 
diate formed is then converted into product: a hydride ion is transferred 
from the tetrahydrofolate ring to transform the methylene group into a 
methyl group, and a proton is abstracted from the carbon atom bearing the 
methyl group to eliminate the cysteine and regenerate the aromatic ring. 
The tetrahydrofolate derivative loses both its methylene group and a 
hydride ion and, hence, is oxidized to dihydrofolate. For the synthesis of 
more thymidylate, tetrahydrofolate must be regenerated. 


Dihydrofolate 


HS——enzyme 
deoxyribose-P deoxyribose-P 


Thymidylate 
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Dihydrofolate reductase catalyzes the regeneration of 749 
tetrahydrofolate, a one-carbon carrier 25.3 Synthesis of Deoxynucleotides 


Tetrahydrofolate is regenerated from the dihydrofolate that is produced in 
the synthesis of thymidylate. This regeneration is accomplished by dihydro- 
folate reductase with the use of NADPH as the reductant. 


H H 
H2 N N H2 N N 
Se * 
HN ZA + NADPH + H+ ——> HN %, H + NADPt 

N N *, 

ae 

0 HN O HN 

H H 
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A hydride ion is directly transferred from the nicotinamide ring of NADPH 
to the pteridine ring of dihydrofolate. The bound dihydrofolate and 
NADPH are held in close proximity to facilitate the hydride transfer. 


Several valuable anticancer drugs block the synthesis 
of thymidylate 


e Rapidly dividing cells require an abundant supply of thymidylate for 
the synthesis of DNA. The vulnerability of these cells to the inhibi- 
tion of TMP synthesis has been exploited in the treatment of cancer. 
Thymidylate synthase and dihydrofolate reductase are choice targets of 
chemotherapy (Figure 25.13). 

Fluorouracil, an anticancer drug, is converted in vivo into fluorodeoxyuri- 
dylate (F-dUMP). This analog of (UMP irreversibly inhibits thymidylate 
synthase after acting as a normal substrate through part of the catalytic 
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N°, N'°-Methylene- Dihydrofolate 
tetrahydrofolate ! . 
Figure 25.13 Anticancer drug targets. 
Glycine _. NADPH + Ht Thymidylate synthase and dihydrofolate 
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cycle. Recall that the formation of TMP requires the removal of a proton 
(H*) from C-5 of the bound nucleotide (see Figure 25.12). However, the 
enzyme cannot abstract F* from F-dUMP, and so catalysis is blocked at the 
stage of the covalent complex formed by F-dUMP, methylenetetrahydrofo- 
late, and the sulfhydryl group of the enzyme (Figure 25.14). We see here an 
example of suicide inhibition, in which an enzyme converts a substrate into 
a reactive inhibitor that halts the enzyme’s catalytic activity (Section 8.5 ). 

The synthesis of TMP can also be blocked by inhibiting the regenera- 
tion of tetrahydrofolate. Analogs of dihydrofolate, such as aminopterin and 
methotrexate (amethopterin), are potent competitive inhibitors (K; < 1 nM) 
of dihydrofolate reductase. 


NH 
Aminopterin (R = H) or methotrexate (R = CH;) 


Methotrexate is a valuable drug in the treatment of many rapidly growing 
tumors, such as those in acute leukemia and choriocarcinoma, a cancer 
derived from placental cells. However, methotrexate kills rapidly replicat- 
ing cells whether they are malignant or not. Stem cells in bone marrow, 
epithelial cells of the intestinal tract, and hair follicles are vulnerable to the 
action of this folate antagonist, accounting for its toxic side effects, which 
include weakening of the immune system, nausea, and hair loss. 

Folate analogs such as trimethoprim have potent antibacterial and anti- 
protozoal activity. Trimethoprim binds 10°-fold less tightly to mammalian 
dihydrofolate reductase than it does to reductases of susceptible microor- 
ganisms. Small differences in the active-site clefts of these enzymes account 
for the highly selective antimicrobial action. The combination of trimethop- 
rim and sulfamethoxazole (an inhibitor of folate synthesis) is widely used 
to treat infections. 


25.4 Key Steps in Nucleotide Biosynthesis Are Regulated 
by Feedback Inhibition 


Nucleotide biosynthesis is regulated by feedback inhibition in a manner 
similar to the regulation of amino acid biosynthesis (Section 24.3). These 


regulatory pathways ensure that the various nucleotides are produced in the 
required quantities. 


Pyrimidine biosynthesis is regulated by aspartate transcarbamoylase 


Aspartate transcarbamoylase, one of the key enzymes for the regulation of 
pyrimidine biosynthesis in bacteria, was described in detail in Chapter 10. 
Recall that ATCase is inhibited by CTP, the final product of pyrimidine bio- 
synthesis, and stimulated by ATP. 


el | 
Aspartate Area 
+ ——- carbamoylaspartate  — — UMP 
carbamoyl phosphate t 


ATP 


> UDP — > UTP —= CTP 


Carbamoy] phosphate synthetase is also a site of feedback inhibition in both 
prokaryotes and eukaryotes. 


The synthesis of purine nucleotides is controlled by 
feedback inhibition at several sites 


The regulatory scheme for purine nucleotides is more complex than that for 
pyrmidine nucleotides (Figure 25.15). 


1. The committed step in purine nucleotide biosynthesis is the conversion 
of PRPP into phosphoribosylamine by glutamine phosphoribosyl amidotrans- 
ferase. This important enzyme is feedback-inhibited by many purine ribo- 
nucleotides. It is noteworthy that AMP and GMP, the final products of the 
pathway, are synergistic in inhibiting the amidotransferase. 


2. Inosinate is the branch point in the synthesis of AMP and GMP. The 
yveactions leading away from inosinate ave sites of feedback inhibition. AMP 
inhibits the conversion of inosinate into adenylosuccinate, its immediate 
precursor. Similarly, GMP inhibits the conversion of inosinate into xanthyl- 
ate, its immediate precursor. 


3. As already noted, GTP is a substrate in the synthesis of AMP, whereas 
ATP isasubstrate in the synthesis of GMP. This reciprocal substrate relation 
tends to balance the synthesis of adenine and guanine ribonucleotides. 


Note that the synthesis of PRPP by PRPP synthetase is highly regulated 
even though it is not the committed step in purine synthesis. Mutations 
have been identified in PRPP synthetase that result in a loss of allosteric 
response to nucleotides without any effect on catalytic activity of the 
enzyme. A consequence of this mutation is an overabundance of purine 
nucleotides that can result in gout, a pathological condition discussed later 
in the chapter. 
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Figure 25.15 Control of purine 
biosynthesis. Feedback inhibition controls 
both the overall rate of purine biosynthesis 
and the balance between AMP and GMP 
production. 


==> AMP 
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(A) 


Active site 


Figure 25.16 Regulation of 
ribonucleotide reductase. (A) 
Each subunit in the R1 dimer 
contains two allosteric sites in 
addition to the active site. One site 
regulates the overall activity and 
the other site regulates substrate 
specificity. (B) The patterns of 
regulation with regard to different 
nucleoside diphosphates 
demonstrated by ribonucleotide 
reductase. 
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The synthesis of deoxyribonucleotides is controlled by the 
regulation of ribonucleotide reductase 


The reduction of ribonucleotides to deoxyribonucleotides is precisely con- 
trolled by allosteric interactions. Each polypeptide of the R1 subunit of the 
aerobic E. coli ribonucleotide reductase contains two allosteric sites: one of 
them controls the overall activity of the enzyme, and the other regulates 
substrate specificity (Figure 25.16). The overall catalytic activity of ribonu- 
cleotide reductase is diminished by the binding of dATP, which signals an 
abundance of deoxyribonucleotides. The binding of ATP reverses this 
feedback inhibition. The binding of dATP or ATP to the substrate-speci- 
ficity control site enhances the reduction of UDP and CDP, the pyrimidine 
nucleotides. The binding of thymidine triphosphate (TTP) promotes the 
reduction of GDP and inhibits the further reduction of pyrimidine ribo- 
nucleotides. The subsequent increase in the level of dGTP stimulates the 
reduction of ATP to dATP. This complex pattern of regulation supplies 
the appropriate balance of the four deoxyribonucleotides needed for the 
synthesis of DNA. 


25.5 Disruptions in Nucleotide Metabolism Can 
Cause Pathological Conditions 


Nucleotides are vital to a host of biochemical processes. It is not surprising, 
then, that disruption of nucleotide metabolism would have a variety of 
physiological effects. The nucleotides of a cell undergo continual turnover. 
Nucleotides are hydrolytically degraded to nucleosides by nucleotidases. 
The phosphorolytic cleavage of nucleosides to free bases and ribose 1-phos- 
phate (or deoxyribose 1-phosphate) is catalyzed by nucleoside phosphorylases. 
Ribose 1-phosphate is isomerized by phosphoribomutase to ribose 5-phos- 
phate, a substrate in the synthesis of PRPP. Some of the bases are reused to 
form nucleotides by salvage pathways. Others are degraded to products 
that are excreted (Figure 25.17). A deficiency of an enzyme can disrupt 
these pathways, leading to a pathological condition. 


The loss of adenosine deaminase activity results in severe combined 
immunodeficiency 


¥e The pathway for the degradation of AMP includes an extra step 
because adenosine is not a substrate for nucleoside phosphorylase. 
First, the phosphate is removed by a nucleotidase to yield the nucleoside 
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adenosine (see Figure 25.17). In the extra step, adenosine is deaminated by 
adenosine deaminase to form inosine. 

A deficiency in adenosine deaminase activity is associated with some 
forms of severe combined immunodeficiency (SCID), an immunological disor- 
der. Persons with the disorder have severe recurring infections, often lead- 
ing to death at an early age. SCID is characterized by a loss of T cells, which 
are crucial to the immune response (Section 34.5). Although the biochemi- 
cal basis of the disorder is not clearly established, a lack of adenosine 
deaminase results in an increase of 50 to 100 times the normal level of 
dATP, which inhibits ribonucleotide reductase and, consequently, DNA 
synthesis. Moreover, adenosine itself is a powerful signal molecule with a 
role in a number of regulatory pathways. Disruption in the levels of adeno- 
sine may also be deleterious. SCID is often called the “bubble boy disease” 
because its treatment may include complete isolation of the patient from the 
environment. Adenosine deaminase deficiency has been successfully treated 
by gene therapy. 


Gout is induced by high serum levels of urate 


ci Inosine generated by adenosine deaminase is subsequently metabo- 
lized by nucleoside phosphorylase to hypoxanthine. Xanthine oxidase, 
a molybdenum- and iron-containing flavoprotein, oxidizes hypoxanthine 
to xanthine and then to uric acid. Molecular oxygen, the oxidant in both 
reactions, is reduced to H2O2, which is decomposed to H2O and O2 by 
catalase. Uric acid loses a proton at physiological pH to form urate. In 
human beings, urate is the final product of purine degradation and is excreted 
in the urine. 

High serum levels of urate (hyperuricemia) induce the painful joint dis- 
ease gout. In this disease, the sodium salt of urate crystallizes in the fluid and 
lining of the joints (Figure 25.18). The small joint at the base of the big toe 
is a common site for sodium urate buildup, although the salt accumulates 


Figure 25.18 Micrograph of sodium urate 
crystals. The accumulation of these crystals 
damages joints and kidneys. [Courtesy of Dr. 
James McGuire.] 
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at other joints also. Painful inflammation results when cells of the immune 
system engulf the sodium urate crystals. The kidneys, too, may be damaged 
by the deposition of urate crystals. Gout is a common medical problem, 
affecting 1% of the population of Western countries. It is nine times as 
common in men as in women. 

Administration of allopurinol, an analog of hypoxanthine, is one treat- 
ment for gout. The mechanism of action of allopurinol is interesting: it acts 
first as a substrate and then as an inhibitor of xanthine oxidase. The oxidase 
hydroxylates allopurinol to alloxanthine (oxipurinol), which then remains 
tightly bound to the active site. The binding of alloxanthine keeps the 
molybdenum atom of xanthine oxidase in the +4 oxidation state instead of 
it returning to the +6 oxidation state as in a normal catalytic cycle. We see 
here another example of suicide inhibition. 

The synthesis of urate from hypoxanthine and xanthine decreases soon 
after the administration of allopurinol. The serum concentrations of hypox- 
anthine and xanthine rise, and that of urate drops. 


The average serum level of urate in human beings is close to the solubil- 

ity limit. In contrast, prosimians (such as lemurs) have 10-fold lower 
levels. A striking increase in urate levels occurred in the evolution of pri- 
mates. What is the selective advantage of a urate level so high that it teeters 
on the brink of gout in many people? It turns out that urate has a markedly 
beneficial action. Urate is a highly effective scavenger of reactive oxygen 
species. Indeed, urate is about as effective as ascorbate (vitamin C) as an 
antioxidant. The increased level of urate in human beings compared with 
prosimians and other lower primates may contribute significantly to the 
longer human life span and to lowering the incidence of human cancer. 


Lesch—Nyhan syndrome is a dramatic consequence of mutations 
in a salvage-pathway enzyme 


¥ Mutations in genes that encode nucleotide biosynthetic enzymes can 
reduce levels of needed nucleotides and can lead to an accumulation 
of intermediates. A nearly total absence of hypoxanthine-guanine phospho- 
ribosyltransferase has unexpected and devastating consequences. The most 
striking expression of this inborn error of metabolism, called the Lesch— 
Nyhan syndrome, is compulsive self-destructive behavior. At age 2 or 3, children 
with this disease begin to bite their fingers and lips and will chew them off 
if unrestrained. These children also behave aggressively toward others. 
Mental deficiency and spasticity are other characteristics of the Lesch-Nyhan 
syndrome. Elevated levels of urate in the serum lead to the formation of 
kidney stones early in life, followed by the symptoms of gout years later. 
The disease is inherited as a sex-linked recessive disorder. 

The biochemical consequences of the virtual absence of hypoxanthine- 
guanine phosphoribosy]l transferase are an elevated concentration of PRPP, 
a marked increase in the rate of purine biosynthesis by the de novo pathway, 
and an overproduction of urate. The relation between the absence of the 
transferase and the bizarre neurological signs is an enigma, although recent 
evidence suggests that the lack of hypoxanthine-guanine phosphoribosyl- 
transferase results, in some undetermined fashion, in an imbalance of 
key neurotransmitters. The Lesch-Nyhan syndrome demonstrates that 
the salvage pathway for the synthesis of IMP and GMP is not gratuitous. 
Moreover, the Lesch-Nyhan syndrome reveals that abnormal behavior such 
as self-mutilation and extreme hostility can be caused by the absence of a single 
enzyme. Psychiatry will no doubt benefit from the unraveling of the molecu- 
lar basis of such mental disorders. 


Folic acid deficiency promotes birth defects such as spina bifida 


e Spina bifida is one of a class of birth defects characterized by the 
incomplete or incorrect formation of the neural tube early in develop- 
ment. In the United States, the prevalence of neural-tube defects is approx- 
imately 1 case per 1000 births. A variety of studies have demonstrated that 
the prevalence of neural-tube defects is reduced by as much as 70% when 
women take folic acid as a dietary supplement before and during the first 
trimester of pregnancy. One hypothesis is that more folate derivatives are 
needed for the synthesis of DNA precursors when cell division is frequent 
and substantial amounts of DNA must be synthesized. 


Summary 


25.1 The Pyrimidine Ring Is Assembled de Novo or Recovered by 

Salvage Pathways 

The pyrimidine ring is assembled first and then linked to ribose 
phosphate to form a pyrimidine nucleotide. 5-Phosphoribosyl-1- 
pyrophosphate is the donor of the ribose phosphate moiety. The syn- 
thesis of the pyrimidine ring starts with the formation of carbamoylas- 
partate from carbamoyl phosphate and aspartate, a reaction catalyzed 
by aspartate transcarbamoylase. Dehydration, cyclization, and oxida- 
tion yield orotate, which reacts with PRPP to give orotidylate. 
Decarboxylation of this pyrimidine nucleotide yields UMP. CTP is 
then formed by the amination of UTP. 


25.2 Purine Bases Can Be Synthesized de Novo or Recycled by 

Salvage Pathways 

The purine ring is assembled from a variety of precursors: glutamine, 
glycine, aspartate, N 10_formyltetrahydrofolate, and CO . The commit- 
ted step in the de novo synthesis of purine nucleotides is the formation 
of 5-phosphoribosylamine from PRPP and glutamine. The purine ring 
is assembled on ribose phosphate, in contrast with the de novo synthesis 
of pyrimidine nucleotides. The addition of glycine, followed by formyla- 
tion, amination, and ring closure, yields 5-aminoimidazole ribonucle- 
otide. This intermediate contains the completed five-membered ring 
of the purine skeleton. The addition of COx, the nitrogen atom of aspar- 
tate, and a formyl group, followed by ring closure, yields inosinate, a 
purine ribonucleotide. AMP and GMP are formed from IMP. Purine 
ribonucleotides can also be synthesized by a salvage pathway in which 
a preformed base reacts directly with PRPP. 


25.3 Deoxyribonucleotides Are Synthesized by the Reduction of 
Ribonucleotides Through a Radical Mechanism 
Deoxyribonucleotides, the precursors of DNA, are formed in E. coli 
by the reduction of ribonucleoside diphosphates. These conversions 
are catalyzed by ribonucleotide reductase. Electrons are transferred 
from NADPH to sulfhydryl groups at the active sites of this enzyme 
by thioredoxin. A tyrosyl free radical generated by an iron center in 
the reductase initiates a radical reaction on the sugar, leading to the 
exchange of H for OH at C-2'. TMP is formed by the methylation of 
dUMP. The donor of a methylene group and a hydride in this reaction 
is N°,N'°-methylenetetrahydrofolate, which is converted into dihy- 
drofolate. Tetrahydrofolate is regenerated by the reduction of dihydro- 
folate by NADPH. Dihydrofolate reductase, which catalyzes this 
reaction, is inhibited by folate analogs such as aminopterin and 
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Key Terms 


25.4 


25.5 


methotrexate. These compounds and fluorouracil, an inhibitor of 
thymidylate synthase, are used as anticancer drugs. 


Key Steps in Nucleotide Biosynthesis Are Regulated by 

Feedback Inhibition 

Pyrimidine biosynthesis in E. coli is regulated by the feedback inhi- 
bition of aspartate transcarbamoylase, the enzyme that catalyzes 
the committed step. CTP inhibits and ATP stimulates this enzyme. 
The feedback inhibition of glutamine-PRPP amidotransferase by 
purine nucleotides is important in regulating their biosynthesis. 


Disruptions in Nucleotide Metabolism Can Cause 

Pathological Conditions 

Severe combined immunodeficiency results from the absence of adeno- 
sine deaminase, an enzyme in the purine degradation pathway. 
Purines are degraded to urate in human beings. Gout, a disease that 
affects joints and leads to arthritis, is associated with an excessive 
accumulation of urate. The Lesch-Nyhan syndrome, a genetic dis- 
ease characterized by self-mutilation, mental deficiency, and gout, is 
caused by the absence of hypoxanthine-guanine phosphoribosy]l- 
transferase. This enzyme is essential for the synthesis of purine nucle- 
otides by the salvage pathway. Neural-tube defects are more frequent 
when a pregnant woman is deficient in folate derivatives early in preg- 
nancy, possibly because of the important role of these derivatives in 
the synthesis of DNA precursors. 


pyrimidine nucleotide (p. 736) 


carbamoy] phosphate 
synthetase (CPS) (p. 737) 
ATP-grasp fold (p. 737) 
5-phosphoribosyl-1-pyrophosphate 
(PRPP) (p. 738) 
orotidylate (p. 739) 


Problems 


1. From the beginning or extract and save and reuse. 
Differentiate between the de novo synthesis of nucleotides 
and salvage pathway synthesis. 


2. Finding their roots 1. Identify the source of the atoms in 
the pyrimidine ring 

3. Finding their roots 2. Identify the source of the atoms in 
the purine ring. 

4, Multifaceted. List some of the biochemical roles played 


by nucleotides. 


5. An s instead of a t? Differentiate between a nucleoside 
and a nucleotide. 


6. Associate ’em. 


1. Spina bifida 
2. Precursor to both 


ATP and GTP 


(a) Excessive urate 
(b) Lack of adenosine 


deaminase 


salvage pathway (p. 740) 

purine nucleotide (p. 740) 

glutamine phosphoribosyl 
amidotransferase (p. 741) 

ribonucleotide reductase (p. 745) 

thymidylate synthase (p. 748) 

dihydrofolate reductase (p. 749) 


severe combined immunodeficiency 


(SCID) (p. 753) 
gout (p. 753) 
Lesch—Nyhan syndrome (p. 754) 
spina bifida (p. 755) 
neural-tube defect (p. 755) 


(c) lack of HGPRT 3. Purine 

(d) Carbamoyl phosphate 4. Deoxynucleotide synthesis 

(e) Inosinate 5. UTP 

(f) Ribonucleotide 6. Lesch-Nyhan disease 
reductase 7. Immunodeficiency 

(g) Lack of folic acid 8. Pyrimidine 

(h) Glutamine 9. Gout 
phosphoribosyl 10. First step in pyrimidine 
transferase synthesis 

(i) Single ring 11. Committed step in purine 

(j) Bicyclic ring synthesis 

(k) Precursor to CTP 


7. Safe passage. What is substrate channeling? How does it 
affect enzyme efficiency? 


8. Activated ribose phosphate. Write a balanced equation for 
the synthesis of PRPP from glucose through the oxidative 
branch of the pentose phosphate pathway. 


9. Making a pyrimidine. Write a balanced equation for the 
synthesis of orotate from glutamine, COs, and aspartate. 


10. Identifying the donor. What is the activated reactant in 
the biosynthesis of each of the following compounds? 


(a) Phosphoribosylamine (c) Orotidylate (from orotate) 


(b) Carbamoylaspartate (d) Phosphoribosylanthranilate 


11. Inhibiting purine biosynthesis. Amidotransferases are 
inhibited by the antibiotic azaserine (O-diazoacetyl-L- 
serine), which is an analog of glutamine. 


“ve 
n~ oN : 


O H3N COoO- 


Azaserine 


Which intermediates in purine biosynthesis would accu- 
mulate in cells treated with azaserine? 


12. The price of methylation. Write a balanced equation for 
the synthesis of TMP from dUMP that is coupled to the 


conversion of serine into glycine. 


13. Sulfa action. Bacterial growth is inhibited by sulfanil- 
amide and related sulfa drugs, and there is a concomitant 
accumulation of 5-aminoimidazole-4-carboxamide ribonu- 
cleotide. This inhibition is reversed by the addition of 
p-aminobenzoate. 


n(n 


Sulfanilamide 


Propose a mechanism for the inhibitory effect of 
sulfanilamide. 


14. HAT medium. Mutant cells unable to synthesize nucle- 
otides by salvage pathways are very useful tools in molecu- 
lar and cell biology. Suppose that cell A lacks thymidine 
kinase, the enzyme catalyzing the phosphorylation of thy- 
midine to thymidylate, and that cell B lacks hypoxanthine- 
guanine phosphoribosy]l transferase. 


(a) Cell A and cell B do not proliferate ina HAT medium 
containing hypoxanthine, aminopterin or amethopterin 
(methotrexate), and thymine. However, cell C, formed by 
the fusion of cells A and B, grows in this medium. Why? 


(b) Suppose that you want to introduce foreign genes into 
cell A. Devise a simple means of distinguishing between 
cells that have taken up foreign DNA and those that 
have not. 
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Problems 


15. Bringing equilibrium. What is the reciprocal substrate 
relation in the synthesis of ATP and GTP? 


16. Find the label. Suppose that cells are grown on amino 
acids that have all been labeled at the a carbons with C. 
Identify the atoms in cytosine and guanine that will be 


labeled with °C. 


17. Different strokes. Human beings contain two different 
carbamoyl phosphate synthetase enzymes. One uses glu- 
tamine as a substrate, whereas the other uses ammonia. 
What are the functions of these two enzymes? 


18. Adjunct therapy. Allopurinol is sometimes given to 
patients with acute leukemia who are being treated with 
anticancer drugs. Why is allopurinol used? 


19. A hobbled enzyme. Both side-chain oxygen atoms of 
aspartate 27 at the active site of dihydrofolate reductase form 
hydrogen bonds with the pteridine ring of folates. The impor- 
tance of this interaction was assessed by studying two mutants 
at this position, Asn 27 and Ser 27. The dissociation constant 
of methotrexate was 0.07 nM for the wild type, 1.9 nM for 
the Asn 27 mutant, and 210 nM for the Ser 27 mutant, at 
25°C. Calculate the standard free energy of the binding of 
methotrexate by these three proteins. What is the decrease in 
binding energy resulting from each mutation? 


20. Correcting deficiencies. Suppose that a person is found 
who is deficient in an enzyme required for IMP synthesis. 
How might this person be treated? 

21. Labeled nitrogen. Purine biosynthesis is allowed to take 
place in the presence of ['*N]aspartate, and the newly syn- 
thesized GTP and ATP are isolated. What positions are 


labeled in the two nucleotides? 


22. On the trail of carbons. Tissue culture cells were incu- 
bated with glutamine labeled with !°N in the amide group. 
Subsequently, IMP was isolated and found to contain some 
'SN Which atoms in IMP were labeled? 


23. Mechanism of action. What is the biochemical basis of 
allopurinol treatment for gout? 


24. Changed inhibitor. Xanthine oxidase treated with 
allopurinol results in the formation of a new compound that 
is an extremely potent inhibitor of the enzyme. Propose a 
structure for this compound. 


25. Calculate the ATP footprint. How many molecules 
of ATP are required to synthesize one molecule of CTP 
from scratch? 


26. Blockages. What intermediate in purine synthesis will 
accumulate if a strain of bacteria is lacking each of the fol- 
lowing biochemicals? 


(c) Glycine 
(d) Glutamine 


(a) Aspartate 
(b) Tetrahydrofolate 
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Mechanism Problems 


27. The same and not the same. Write out mechanisms for 
the conversion of phosphoribosylamine into glycinamide 
ribonucleotide and of xanthylate into guanylate. 


28. Closing the ring. Propose a mechanism for the conver- 
sion of 5-formamidoimidazole-4-carboxamide ribonucle- 
otide into inosinate. 


Chapter Integration Problems 


29. A generous donor. What major biosynthetic reactions 
utilize PRPP? 


30. They’re everywhere! Nucleotides play a variety of roles 
in the cell. Give an example of a nucleotide that acts in 
each of the following roles or processes. 


(a) Second messenger (e) Transfer of electrons 


(b) Phosphoryl-group transfer (f) DNA sequencing 
g 


( 
(c) Activation of carbohydrates (g) Chemotherapy 
( 


(d) Activation of acetyl groups (h) Allosteric effector 


31. Pernicious anemia. Purine biosynthesis is impaired by 
vitamin By, deficiency. Why? How might fatty acid and 
amino acid metabolism also be affected by a vitamin By. 
deficiency? 


32. Folate deficiency. Suppose someone was suffering from a 
folate deficiency. What cells would you think might be most 
affected? Symptoms may include diarrhea and anemia. 


33. Hyperuricemia. Many patients with glucose 6-phos- 
phatase deficiency have high serum levels of urate. 
Hyperuricemia can be induced in normal people by the 
ingestion of alcohol or by strenuous exercise. Propose a 
common mechanism that accounts for these findings. 


34. Labeled carbon. Succinate uniformly labeled with '*C is 
added to cells actively engaged in pyrimidine biosynthesis. 
Propose amechanism by which carbon atoms from succinate 
could be incorporated into a pyrimidine. At what positions 
is the pyrimidine labeled? 


35. Something funny going on here. Cells were incubated 
with glucose labeled with '4C in carbon 2, shown in red in 
the structure at the top of the next column. Later, uracil was 
isolated and found to contain '*C in carbons 4 and 6. 
Account for this labeling pattern. 


te 
HC—OH 

HO—CH 
HC—OH 


HC——OH 


CH,0H 


36. Exercising muscle. Some interesting reactions take 
place in muscle tissue to facilitate the generation of ATP for 
contraction. 

Adenylosuccinate 


Adenylosuccinate 
lyase 


Fumarate 


‘ 
Adenylate 
kinase PURINE 


O GDP +P; 
Adenylosuccinate 
ATP + AMP NUCLEOTIDE 
CYCLE Ne 
Aspartate + GTP 


NH; IMP 


2 ADP 


H,0 
AMP deaminase 


In muscle contraction, ATP is converted into ADP. 
Adenylate kinase converts two molecules of ADP into a 
molecule of ATP and AMP. 


(a) Why is this reaction beneficial to contracting muscle? 


(b) Why is the equilibrium for the adenylate kinase approx- 
imately equal to 1? 


Muscle can metabolize AMP by using the purine nucle- 
otide cycle. The initial step in this cycle, catalyzed by AMP 
deaminase, is the conversion of AMP into IMP. 


(c) Why might the deamination of AMP facilitate ATP 


formation in muscle? 


(d) How does the purine nucleotide cycle assist the aero- 
bic generation of ATP? 


37. A common step. What three reactions transfer an amino 
group from aspartate to yield the aminated product and 
fumarate? 


38. Your pet duck. You suspect that your pet duck has gout. 
Why should you think twice before administering a dose of 
allopurinol-laced bread? 


The Biosynthesis of Membrane 
Lipids and Steroids 


his chapter examines the biosynthesis of three important components of 

biological membranes—phospholipids, sphingolipids, and cholesterol 
(Chapter 12). Triacylglycerols also are considered here because the pathway 
for their synthesis overlaps that of phospholipids. Cholesterol is of interest 
both as a membrane component and as a precursor of many signal mole- 
cules, including the steroid hormones progesterone, testosterone, estrogen, 
and cortisol. 

The transport and uptake of cholesterol vividly illustrate a recurring 
mechanism for the entry of metabolites and signal molecules into cells. 
Cholesterol is transported in blood by the low-density lipoprotein (LDL) 
and taken up into cells by the LDL receptor on the cell surface. The LDL 
receptor is absent in people with familial hypercholesterolemia, a genetic 
disease. People lacking the receptor have markedly elevated cholesterol 
levels in the blood and cholesterol deposits on blood vessels, and they are 
prone to childhood heart attacks. Indeed, cholesterol is implicated in the 
development of atherosclerosis in people who do not have genetic defects. 
Thus, the regulation of cholesterol synthesis and transport can be a source 
of especially clear insight into the role that our understanding of biochem- 
istry plays in medicine. 


CHAPTER 26 


Fats such as the triacylglycerol molecule (below) are widely 
used to store excess energy for later use and to fulfill other 
purposes, illustrated by the insulating blubber of whales. The 
natural tendency of fats to exist in nearly water free forms 
makes these molecules well suited to these roles. [(Left) 
Francois Cohier/Photo Researchers.] 
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Phospholipids and 
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Important Derivatives of 
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HO 
No, 
HO—C=H 
E : 2= 
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No Be 


Glycerol 3-phosphate 


Figure 26.1 PATHWAY INTEGRATION: 
Sources of intermediates in the synthesis 
of triacylglycerols and phospholipids. 
Phosphatidate, synthesized from 
dihydroxyacetone phosphate (DHAP) 
produced in gluconeogenesis and fatty acids, 
can be further processed to produce 
triacylglycerol or phospholipids. Phospholipids 
and other membrane lipids are continuously 
produced in all cells. 


26.1 Phosphatidate Is a Common Intermediate in the 
Synthesis of Phospholipids and Triacylglycerols 


Lipid synthesis requires the coordinated action of gluconeogenesis and fatty 
acid metabolism, as illustrated in Figure 26.1. The first step in the synthe- 
sis of both phospholipids for membranes and triacylglycerols for energy 
storage is the synthesis of phosphatidate (diacylglycerol 3-phosphate). In 
mammalian cells, phosphatidate is synthesized in the endoplasmic reticu- 
lum and the outer mitochondrial membrane. The pathway begins with 
glycerol 3-phosphate, which is formed primarily by the reduction of 
dihydroxyacetone phosphate (DHAP) synthesized by the gluconeogenic 
pathway, and to a lesser extent by the phosphorylation of glycerol. The addi- 
tion of two fatty acids to glycerol-3-phosphate yields phosphatidate. First, 
acyl coenzyme A contributes a fatty acid chain to form lysophosphatidate 
and, then, a second acyl CoA contributes a fatty acid chain to yield 
phosphatidate. 


Usually saturated 


R 0. R 0 
R\CO—CoA CoA NC cH, R,CO—CoA CoA Se Sc, 
\ ; I : \ / I : 
OCHO—C=H =O O O—C-H 0 
= 1 R> BA = 1 
: [} 2- NS : [} 2 
C 
Ep i ma eo 
re) fe) 
Usually unsaturated 
Lysophosphatidate Phosphatidate 


These acylations are catalyzed by glycerol phosphate acyltransferase. In most 
phosphatidates, the fatty acid chain attached to the C-1 atom is saturated, 
whereas the one attached to the C-2 atom is unsaturated. Phosphatidate can 
also be synthesized from diacylglycerol, in what is essentially a salvage 
pathway, by the action of diacylglycerol kinase: 


Diacylglycerol + ATP —— phosphatidate + ADP 


The phospholipid and triacylglycerol pathways diverge at phosphati- 
date. In the synthesis of triacylglycerols, a key enzyme in the regulation of 


Oo) 
Glycerol 
Ractate eee ONEP aner cho phat 


(ADIPOSE TISSUE OR DIET 


Glycerol Pi 


Triacylglycerol — o) 
*For phospholipid synthesis, either 
phosphatidate or the alcohol must 


be activated by reaction with an 
NTP (3). 


Free 


a fatty acids ¥. 


Active pathways: 

1. Gluconeogenesis, Chapter 16 

2. Triacylglycerol breakdown, Chapter 22 
3. Triacylglycerol synthesis, Chapter 26 
4. Phospholipid synthesis, Chapter 26 


lipid synthesis, phosphatidic acid phosphatase hydrolyzes phosphatidate to 761 
give a diacylglycerol (DAG). This intermediate is acylated to a triacylglycerol 26.1 Synthesis of Membrane Lipids 
through the addition of a third fatty acid chain in a reaction that is catalyzed 

by diglyceride acyltransferase. Both enzymes are associated in a triacylglycerol 

synthetase complex that is bound to the endoplasmic reticulum membrane. 


R 0. R 0. R O 
Ne “Sore H20 Pi Me “Gis R;CO—CoA CoA biel cH, 
Nest LJ Jo: LD [  : 
O o—C=H O O Om C=H O om CH 6 
a a ee ON ROY EG 
l H2C RR l Hz l H2C C 
l Se S O t ar I ~ ao 
Phosphatidate Diacylglycerol Triacylglycerol 
(DAG) 


The liver is the primary site of triacylglycerol synthesis. From the liver, the 
triacylglycerols are transported to the muscles for energy conversion or to 
the adipose cells for storage. 


The synthesis of phospholipids requires an activated intermediate 


Membrane-lipid synthesis continues in the endoplasmic reticulum and in 
the Golgi apparatus. Phospholipid synthesis requires the combination of a 
diacylglycerol with an alcohol. As in most anabolic reactions, one of the 
components must be activated. In this case, either the diacylglycerol or the 
alcohol may be activated, depending on the source of the reactants. 


Synthesis from an activated diacylglycerol. The de novo pathway starts 
with the reaction of phosphatidate with cytidine triphosphate (CTP) to 
form the activated diacylglycerol, cytidine diphosphodiacylglycerol (CDP- 
diacylglycerol; Figure 26.2). This reaction, like those of many biosyntheses, 
is driven forward by the hydrolysis of pyrophosphate. 


Figure 26.2 Structure of 
CDP-diacylglycerol. A key intermediate in 
the synthesis of phospholipids consists of 
phosphatidate and cytidine monophosphate 
joined by a pyrophosphate linkage. 
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Phosphatidate CDP-diacylglycerol 


The activated phosphatidyl unit then reacts with the hydroxyl group of 
an alcohol to form a phosphodiester linkage. If the alcohol is inositol, the 
products are phosphatidylinositol and cytidine monophosphate (CMP). 
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Subsequent phosphorylations catalyzed by specific kinases lead to the 
synthesis of phosphatidylinositol 4,5-bisphosphate, the precursor of two 
intracellular messengers—diacylglycerol and inositol 1,4,5-trisphosphate 
(Section 14.2). If the alcohol is phosphatidylglycerol, the products are 
diphosphatidylglycerol (cardiolipin) and CMP. In eukaryotes, cardiolipin is 
located exclusively in inner mitochondrial membranes and plays an impor- 
tant role in the organization of the protein components of oxidative phos- 
phorylation. For example, cardiolipin is required for the full activity of 
cytochrome oxidase. 
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Diphosphatidylglycerol (Cardiolipin) 


The fatty acid components of phospholipids may vary, and thus cardio- 
lipin, as well as most other phospholipids, represents a class of molecules 


rather than a single species. As a result, a single mammalian cell may con- 
tain thousands of distinct phospholipids. Phosphatidylinositol is unusual in 
that it has a nearly fixed fatty acid composition. Stearic acid usually occu- 
pies the C-1 position and arachidonic acid the C-2 position. 


Synthesis from an activated alcohol. Phosphatidylethanolamine, the 
major phospholipid of the inner leaflet of cell membranes, is synthesized 
from the alcohol ethanolamine. To activate the alcohol, ethanolamine is 
phosphorylated by ATP to form the precursor, phosphorylethanolamine. 
This precursor then reacts with CTP to form the activated alcohol, CDP- 
ethanolamine. The phosphorylethanolamine unit of CDP-ethanolamine is 
transferred to a diacylglycerol to form phosphatidylethanolamine. 

The most common phospholipid in mammals is phosphatidylcholine. 
In this case, dietary choline is activated in a series of reactions analogous to 
those in the activation of ethanolamine. Interestingly, the liver possesses an 
enzyme, phosphatidylethanolamine methyltransferase, which synthesizes 
phosphatidylcholine from phosphatidylethanolamine when dietary choline 
is insufficient. The amino group of this phosphatidylethanolamine is 
methylated three times to form phosphatidylcholine. 5-Adenosylmethionine is 
the methyl donor. 


3 S-Adenosyl 3 S-Adenosyl CHs 
methionine homocysteine _-CH3 
R NH;* R *=N 
No a ee . No Z NH; 
H H H H 
Phosphatidyl- Phosphatidyl- 
ethanolamine choline 


Thus, phosphatidylcholine can be produced by two distinct pathways in 
mammals, ensuring that this phospholipid can be synthesized even if the 
components for one pathway are in limited supply. 

Phosphatidylserine makes up 10% of the phospholipids in mammals. 
This phospholipid is synthesized in a base-exchange reaction of serine with 
phosphatidylcholine or phosphatidylethanolamine. In the reaction, serine 
replaces choline or ethanolamine. 


Phosphatidylcholine + serine —— choline + phosphatidylserine 


Phosphatidylethanolamine + serine ——> 
ethanolamine + phosphatidylserine 


Phosphatidylserine is normally located in the inner leaflet of the plasma 
membrane bilayer but is moved to the outer leaflet in apoptosis (Section 18.6). 
There, it may serve to attracted phagocytes to consume the cell remnants 
after apoptosis is complete. 

Note that a cytidine nucleotide plays the same role in the synthesis of 
these phosphoglycerides as a uridine nucleotide does in the formation of 
glycogen (Section 21.4). In all of these biosyntheses, an activated inter- 
mediate (UDP-glucose, CDP-diacylglycerol, or CDP-alcohol) is formed 
from a phosphorylated substrate (glucose 1-phosphate, phosphatidate, or a 
phosphorylalcohol) and a nucleoside triphosphate (UTP or CTP). The 
activated intermediate then reacts with a hydroxyl group (the terminus of 
glycogen, the side chain of serine, or a diacylglycerol). 


Sphingolipids are synthesized from ceramide 


We now turn from glycerol-based phospholipids to another class of 
membrane lipid—the sphingolipids. These lipids are found in the plasma 
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Figure 26.3 Synthesis of ceramide. Palmitoyl CoA and serine ‘CH; CH; 
combine to initiate the synthesis of ceramide. Dihydroceramide Ceramide 
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NCH; 
Sphingosine 


membranes of all eukaryotic cells, although the concentration is highest in 
the cells of the central nervous system. The backbone of a sphingolipid is 
sphingosine, rather than glycerol. Palmitoyl CoA and serine condense to 
form 3-ketosphinganine. The serine—palmitoyl transferase catalyzing this 
reaction is the rate-limiting step in the pathway and requires pyridoxal 
phosphate, revealing again the dominant role of this cofactor in transfor- 
mations that include amino acids. Ketosphinganine is then reduced to 
dihydrosphingosine before conversion into ceramide, a lipid consisting of a 
fatty acid chain attached to the amino group of a sphingosine backbone 
(Figure 26.3). 

Inall sphingolipids, the amino group of ceramide is acylated (Figure 26.4). 
The terminal hydroxyl group also is substituted. In sphingomyelin, a com- 
ponent of the myelin sheath covering many nerve fibers, the substituent is 
phosphorylcholine, which comes from phosphatidylcholine. In a cerebroside, 
the substituent is glucose or galactose. UDP-glucose or UDP-galactose is 
the sugar donor. 


Gangliosides are carbohydrate-rich sphingolipids that 
contain acidic sugars 


Gangliosides are the most complex sphingolipids. In a ganglioside, an oligo- 
saccharide chain is linked to the terminal hydroxyl group of ceramide by a 
glucose residue (Figure 26.5). This oligosaccharide chain contains at least 
one acidic sugar, either N-acetylneuraminate or N-glycolylneuraminate. 
These acidic sugars are called sialic acids. Their nine-carbon backbones 
are synthesized from phosphoenolpyruvate (a three-carbon unit) and 
N-acetylmannosamine 6-phosphate (a six-carbon unit). 

Gangliosides are synthesized by the ordered, step-by-step addition of 
sugar residues to ceramide. The synthesis of these complex lipids requires 
the activated sugars UDP-glucose, UDP-galactose, and UDP-N-acetylga- 
lactosamine, as well as the CMP derivative of N-acetylneuraminate. CMP- 
N-acetylneuraminate is synthesized from CTP and N-acetylneuraminate. 
The sugar composition of the resulting ganglioside is determined by the 
specificity of the glycosyltransferases in the cell. More than 60 different 
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Phosphatidyl- 
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sugars 
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Ceramide is the starting point for the CHs 


Figure 26.4 Synthesis of sphingolipids. 


formation of sphingomyelin and gangliosides. Cerebroside 


gangliosides have been characterized (see Figure 26.5 for the composition of 
ganglioside G)y41). Ganglioside-binding by cholera toxin is the first step in 
the development of cholera, a pathological condition characterized by 
severe diarrhea. Gangliosides are also crucial for binding immune-system 
cells to sites of injury in the inflammatory response. 


Sphingolipids confer diversity on lipid structure and function 


e The structures of sphingolipids and the more abundant glycerophos- 
pholipids are very similar. Given the structural similarity of these 
two types of lipids, why are sphingolipids required at all? Indeed, the prefix 
“sphingo” was applied to capture the “sphinxlike” properties of this enig- 
matic class of lipids. Although the precise role of sphingolipids is not 
firmly established, progress toward solving the riddle of their function is 
being made. As discussed in Chapter 12, sphingolipids are important com- 
ponents of lipid rafts, highly organized regions of the plasma membrane 
that are important in signal transduction. Sphingosine, sphingosine 1-phos- 
phate, and ceramide serve as second messengers in the regulation of cell 
growth, differentiation, and death. For instance, ceramide derived from a 
sphingolipid initiates programmed cell death in some cell types and may 
contribute to the development of type 2 diabetes (Chapter 27). 


Respiratory distress syndrome and Tay-Sachs disease result from the 
disruption of lipid metabolism 


cz Respiratory distress syndrome is a pathological condition resulting from 
a failure in the biosynthetic pathway for dipalmitoyl phosphatidyl- 
choline. This phospholipid, in conjunction with specific proteins and other 
phospholipids, is found in the extracellular fluid that surrounds the alveoli of 
the lung. Its function is to decrease the surface tension of the fluid to prevent 
lung collapse at the end of the expiration phase of breathing. Premature 
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Figure 26.5 Ganglioside Gy;. This 
ganglioside consists of five monosaccharides 
linked to ceramide: one glucose (Glc) 
molecule, two galactose (Gal) molecules, one 
N-acetylgalactosamine (GalNAc) molecule, 
and one NV-acetylneuraminate (NAN) 
molecule. 
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Figure 26.6 Lysosome with lipids. An 
electron micrograph of a lysosome containing 
an abnormal amount of lipid. [Courtesy of 

Dr. George Palade.] 


Figure 26.7 Regulation of lipid synthesis. 
Phosphatidic acid phosphatase is the key 
regulatory enzyme in lipid synthesis. When 
active, PAP generates diacylglycerol, which can 
react with activated alcohols to form 
phospholipids or with fatty acyl CoA to form 
triacylglycerols. When PAP is inactive, 
phosphatidate is converted into CMP-DAG for 
the synthesis of different phospholipids. PAP 
also controls the amount of DAG and 
phosphatidate, both of which function as 
second messengers. 
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infants may suffer from respiratory distress syndrome because their imma- 
ture lungs do not synthesize enough dipalmitoyl phosphatidylcholine. 

Tay—Sachs disease is caused by a failure of lipid degradation: an inability 
to degrade gangliosides. Gangliosides are found in highest concentration 
in the nervous system, particularly in gray matter, where they constitute 6% 
of the lipids. Gangliosides are normally degraded inside lysosomes by the 
sequential removal of their terminal sugars but, in Tay—Sachs disease, this 
degradation does not take place. As a consequence, neurons become 
significantly swollen with lipid-filled lysosomes (Figure 26.6). An affected 
infant displays weakness and retarded psychomotor skills before 1 year of 
age. The child is demented and blind by age 2 and usually dies before age 3. 

The ganglioside content of the brain of an infant with Tay—Sachs disease 
is greatly elevated. The concentration of ganglioside Gyy2 is many times higher 
than normal because its terminal N-acetylgalactosamine residue is removed 
very slowly or not at all. The missing or deficient enzyme is a specific B-N- 
acetylhexosaminidase. 


GalNac 


is a Ceramide ae wae pa Ceramide 


2,3 2,3 
NAN NAN 
Ganglioside Gy, Ganglioside Gy, 


Tay-Sachs disease can be diagnosed in the course of fetal development. 
Amniotic fluid is obtained by amniocentesis and assayed for B-N- 
acetylhexosaminidase activity. 


Phosphatiditic acid phosphatase is a key regulatory 
enzyme in lipid metabolism 


Although the details of the regulation of lipid synthesis remain to be eluci- 
dated, evidence suggests that phosphatidic acid phosphatase (PAP), works in 
concert with diacylglycerol kinase, playing a key role in the regulation of 
lipid synthesis. PAP, also called lipin 1 in mammals, controls the extent to 
which triacylglycerols are synthesized relative to phospholipids and regulates 
the type of phospholipid synthesized (Figure 26.7). For instance, when PAP 
activity is high, phosphatidate is dephosphorylated and diacylglycerol is 
produced, which can react with the appropriate activated alcohols to yield 
phosphatidylethanolamine, phosphatidylserine or phosphatidylcholine. 
Diacylglycerol can also be converted into triacylglycerols, and evidence 
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Second messengers 


suggests that the formation of triacylglycerols may act as a fatty acid buffer, 
which helps to regulate the levels of diacylglycerol and sphingolipids, which 
serve signaling functions. 

When PAP activity is lower, phosphatidate is used as a precursor for 
different phospholipids, such as phosphatidylinositol and cardiolipin. 
Moreover, phosphatidate is a signal molecule itself. Phosphatidate regulates 
the growth of endoplasmic reticulum and nuclear membranes and acts as a 
cofactor that stimulates the expression of genes in phospholipid synthesis. 

What are the signal molecules that regulate the activity of PAP? CDP- 
diacylglycerol, phosphatidylinositol, and cardiolipin enhance PAP activity, 
and sphingosine and dihydrosphingosine inhibit it. 

Studies in mice clearly show the importance of PAP for the regulation of 
fatty acid synthesis. The loss of PAP function prevents normal adipose- 
tissue development, leading to lipodystrophy (severe loss of body fat) and 
insulin resistance. Excess PAP activity results in obesity. Understanding 
the regulation of phospholipid synthesis is an exciting area of research that 
will be active for some time to come. 


26.2 Cholesterol Is Synthesized from Acetyl 
Coenzyme A in Three Stages 
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We now turn our attention to the synthesis of the fundamental H3C-, C.-C. CH 
lipid cholesterol. This steroid modulates the fluidity of animal CH; r = i 
cell membranes (Section 12.5) and is the precursor of steroid . Ad A CH; 
hormones such as progesterone, testosterone, estradiol, and cor- CHs | | * 
tisol. All 27 carbon atoms of cholesterol are derived from acetyl Pe <I os Arad 
CoA in a three-stage synthetic process (Figure 26.8). | | 
C C 
1. Stage one is the synthesis of isopentenyl pyrophosphate, anacti- 10 Cc Cc H,c~ oS 


vated isoprene unit that is the key building block of cholesterol. 


Figure 26.8 Labeling of cholesterol. Isotope-labeling 


2. Stage two is the condensation of six molecules of isopentenyl 
pyrophosphate to form squalene. 


experiments reveal the source of carbon atoms in cholesterol 
synthesized from acetate labeled in its methyl group (blue) or 


carboxylate atom (red). 


3. In stage three, squalene cyclizes and the tetracyclic product 
is subsequently converted into cholesterol. 


The first stage takes place in the cytoplasm, and the second two in the endo- 
plasmic reticulum. 


The synthesis of mevalonate, which is activated as isopentenyl 
pyrophosphate, initiates the synthesis of cholesterol 


The first stage in the synthesis of cholesterol is the formation of isopente- 
nyl pyrophosphate from acetyl CoA. This set of reactions starts with the 
formation of 3-hydroxy-3-methylglutaryl CoA (HMG CoA) from acetyl 
CoA and acetoacetyl CoA. This intermediate is reduced to mevalonate for 
the synthesis of cholesterol (Figure 26.9). Recall that, alternatively, 
3-hydroxy-3-methylglutaryl CoA may be generated in the mitochondria 
and processed to form ketone bodies, which are subsequently secreted to 
provide fuel for other tissues, notably the brain under starvation conditions 
(Section 22.3). 

The synthesis of mevalonate is the committed step in cholesterol formation. 
The enzyme catalyzing this irreversible step, 3-hydroxy-3-methylglutaryl 
CoA reductase (HMG-CoA reductase), is an important control site in 
cholesterol biosynthesis, as will be discussed shortly. 


Cholesterol 


“Cholesterol is the most highly decorated 
small molecule in biology. Thirteen Nobel 
Prizes have been awarded to scientists who 
devoted major parts of their careers to 
cholesterol. Ever since it was isolated from 
gallstones in 1784, cholesterol has exerted 
an almost hypnotic fascination for scientists 
from the most diverse areas of science and 
medicine. .. . Cholesterol is a Janus-faced 
molecule. The very property that makes it 
useful in cell membranes, namely its 
absolute insolubility in water, also makes it 
lethal? 
—Michael Brown and Joseph Goldstein, 
on the occasion of their receipt of a 
Nobel Prize for elucidating the control 
of blood levels of cholesterol. 
Nobel Lectures (1985) 
© The Nobel Foundation, 1985 
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Figure 26.9 Fates of 3-hydroxy-3- CH; 
methylglutaryl CoA. In the cytoplasm, 
HMG-CoA is converted into mevalonate. In melele 
mitochondria, it is converted into acetyl CoA 0 
and acetoacetate. MITOCHONDRIA Acetoacetate 


3-Hydroxy-3-methylglutaryl CoA + 2 NADPH + 2H* —> 
mevalonate + 2 NADP* + CoA 


HMG-CoA reductase is an integral membrane protein in the endoplasmic 
CHs reticulum. 
CH> Mevalonate is converted into 3-isopentenyl pyrophosphate in three 
H2C a consecutive reactions requiring ATP (Figure 26.10). In the last step, the 
Isoprene release of COz yields isopentenyl pyrophosphate, an activated isoprene unit 
that is a key building block for many important biomolecules throughout 
the kingdoms of life. 


Mevalonate 5-Phospho- 5-Pyrophospho- 3-Isopentenyl 
mevalonate mevalonate pyrophosphate 


Figure 26.10 Synthesis of isopentenyl 
pyrophosphate. This activated intermediate 
is formed from mevalonate in three steps 


iring ATP; followed by a decarboxylation. : ; . 
Penieen lia ereteeacereryne ec emerenre Squalene (Cz) is synthesized from six molecules of isopentenyl 


pyrophosphate (Cs) 


Squalene is synthesized from isopentenyl pyrophosphate by the reaction 


sequence 
Cs , Cio , Cis , C39 


This stage in the synthesis of cholesterol starts with the isomerization of 


isopentenyl pyrophosphate to dimethylallyl pyrophosphate. 


CHz CHz 
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Isopentenyl pyrophosphate Dimethylallyl pyrophosphate 
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Figure 26.11 Condensation mechanism in cholesterol synthesis. The mechanism for joining 
dimethylallyl pyrophosphate and isopenteny! pyrophosphate to form geranyl pyrophosphate. The 
same mechanism is used to add an additional isopentenyl pyrophosphate to form farnesyl 


pyrophosphate. 


These two isomeric Cs units (one of each type) condense 
to form a C1) compound: isopentenyl pyrophosphate 
attacks an allylic carbocation ion formed from dimethyl- 
allyl pyrophosphate to yield geranyl pyrophosphate (Fig- 
ure 26.11). The same kind of reaction takes place 
again: geranyl pyrophosphate is converted into an allylic 
carbonium ion, which is attacked by isopentenyl pyro- 
phosphate. The resulting C;5 compound 1s called farnesyl 
pyrophosphate. The same enzyme, geranyl transferase, 
catalyzes each of these condensations. 

The last step in the synthesis of squalene is a reductive 
tail-to-tail condensation of two molecules of farnesyl 
pyrophosphate catalyzed by the endoplasmic reticulum 
enzyme squalene synthase. 


2 Farnesyl pyrophosphate (C;;) + NADPH ——~ 
squalene (C39) + 2 PP; + NADP* + H* 


The reactions leading from Cs units to squalene, a C39 
isoprenoid, are summarized in Figure 26.12. 


Squalene cyclizes to form cholesterol 


The final stage of cholesterol biosynthesis starts with the 
cyclization of squalene (Figure 26.13). Squalene is first 
activated by conversion into squalene epoxide (2,3-oxi- 
dosqualene) in a reaction that uses O2 and NADPH. 
Squalene epoxide is then cyclized to lanosterol by oxi- 
dosqualene cyclase. This remarkable transformation pro- 
ceeds in a concerted fashion. The enzyme holds squalene 
epoxide in an appropriate conformation and initiates the 
reaction by protonating the epoxide oxygen. The carbo- 
cation formed spontaneously rearranges to produce 
lanosterol. Lanosterol is converted into cho- 
lesterol in a multistep process by the removal 

of three methyl groups, the reduction of one 


Figure 26.12 Squalene synthesis. One molecule of 
dimethyallyl pyrophosphate and two molecules of 
isopentenyl pyrophosphate condense to form farnesyl 
pyrophosphate. The tail-to-tail coupling of two molecules 
of farnesyl pyrophosphate yields squalene. 
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Figure 26.14 Cholesterol formation. 
Lanosterol is converted into cholesterol in a 
complex process. 
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Figure 26.13 Squalene cyclization. The 
formation of the steroid nucleus from 
squalene begins with the formation of 


squalene epoxide. This intermediate is HO ar 
protonated to form a carbocation that cyclizes “4H 

to form a tetracyclic structure, which H3C —CH3 
rearranges to form lanosterol. Lanosterol 


double bond by NADPH, and the migration of the other double bond 
(Figure 26.14). 


26.3 The Complex Regulation of Cholesterol Biosynthesis 
Takes Place at Several Levels 


Cholesterol can be obtained from the diet or it can be synthesized de novo. 
Cholesterol biosynthesis is one of the most highly regulated metabolic path- 
ways known. Biosynthetic rates may vary several hundredfold, depending 
on how much cholesterol is consumed in the diet. An adult on a low- 
cholesterol diet typically synthesizes about 800 mg of cholesterol per day. 
The liver is the major site of cholesterol synthesis in mammals, although the 
intestine also forms significant amounts. The rate of cholesterol formation 
by these organs is highly responsive to the cellular level of cholesterol. This 
feedback regulation is mediated primarily by changes in the amount and 
activity of 3-hydroxy-3-methylglutaryl CoA reductase. As described earlier 
(p. 767), this enzyme catalyzes the formation of mevalonate, the committed 
step in cholesterol biosynthesis. HMG CoA reductase is controlled in 
multiple ways: 


1. The rate of synthesis of reductase mRNA is controlled by the sterol regula- 
tory element binding protein (SREBP). This transcription factor binds to a 
short DNA sequence called the sterol regulatory element (SRE) on the 5’ side 
of the reductase gene. It binds to the SRE when cholesterol levels are low 
and enhances transcription. In its inactive state, the SREBP resides in the 
endoplasmic reticulum membrane, where it is associated with the SREBP 
cleavage activating protein (SCAP), an integral membrane protein. SCAP 
is the cholesterol sensor. When cholesterol levels fall, SCAP escorts SREBP 
in small membrane vesicles to the Golgi complex, where it is released from 
the membrane by two specific proteolytic cleavages (Figure 26.15). The 
first cleavage frees a fragment of SREBP from SCAP, whereas the second 
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Figure 26.15 The SREBP pathway. SREBP resides in the endoplasmic reticulum, where it is 
bound to SCAP by its regulatory (Reg) domain. When cholesterol levels fall, SCAP and SREBP 
move to the Golgi complex, where SREBP undergoes successive proteolytic cleavages by a 
serine protease and a metalloprotease. The released DNA-binding domain moves to the 
nucleus to alter gene expression. [After an illustration provided by Dr. Michael Brown and 

Dr. Joseph Goldstein.] 


cleavage releases the regulatory domain from the membrane. The released 
protein migrates to the nucleus and binds the SRE of the HMG-CoA 
reductase gene, as well as several other genes in the cholesterol biosynthetic 
pathway, to enhance transcription. When cholesterol levels rise, the prote- 
olytic release of the SREBP is blocked, and the SREBP in the nucleus is 
rapidly degraded. These two events halt the transcription of genes of the 
cholesterol biosynthetic pathways. 

What is the molecular mechanism that retains SCAP-SREBP in the 
ER when cholesterol is present but allows movement to the Golgi complex 
when cholesterol concentration is low? When cholesterol is low, SCAP 
binds to vesicular proteins that facilitate the transport of SCAP-SREBP to 
the Golgi apparatus, as heretofore described. When cholesterol is present, 
SCAP binds cholesterol, which causes a structural change in SCAP so that 
it binds to another endoplasmic reticulum protein called Insig (Figure 26.16). 
Insig is the anchor that retains SCAP and thus SREBP in the endoplasmic 
reticulum in the presence of cholesterol. The interactions between SCAP 
and Insig can also be forged when Insig binds 25-hydroxycholesterol, a 
metabolite of cholesterol. Thus, two distinct steroid—protein interactions 
serve to prevent the inappropriate movement of SCAP-SREBP to the Golgi 
complex. 


2. The rate of translation of reductase mRNA is inhibited by nonsterol 
metabolites derived from mevalonate. 


3. The degradation of the reductase is stringently controlled. The enzyme is 
bipartite: its cytoplasmic domain carries out catalysis and its membrane 
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Insig SCAP SREBP 


Figure 26.16 Insig regulates SCAP-SREBP movement. (A) In the presence of cholesterol, Insig 
interacts with SCAP-SREBP and prevents the activation of SREBP. Cholesterol binding to SCAP or 
25-hydroxycholesterol binding to Insig facilitates the interaction of Insig and SCAP, retaining SCAP— 
SREBP in the endoplasmic reticulum. (B) In the absence of cholesterol or its derivatives, SCAP 
interacts with transport proteins and shepherds SREBP to the Golgi apparatus for activation. [After 
M. S. Brown and J. L. Goldstein. Cholesterol feedback: From Schoenheimer's bottle to Scap’s 
MELADL. J. Lipid Res. 50:S15-S27, 2009.] 


domain senses signals that lead to its degradation. The membrane domain 
may undergo structural changes in response to increasing concentrations of 
sterols such as lanosterol and 25-hydroxycholesterol. Under these conditions, 
the reductase appears to bind to a subset of Insigs that are also associated 
with the ubiquitinating enzymes (Figure 26.17). The reductase is polyubiq- 
uitinated and subsequently extracted from the membrane in a process that 
requires gerenylgeraniol. The extracted reductase is then degraded by the 
proteasome. The combined regulation at the levels of transcription, transla- 
tion, and degradation can alter the amount of enzyme in the cell more than 
200-fold. 


4. Phosphorylation decreases the activity of the reductase. This enzyme, 
like acetyl CoA carboxylase (which catalyzes the committed step in fatty 
acid synthesis, Section 22.5), is switched off by an AMP-activated 
protein kinase. Thus, cholesterol synthesis ceases when the ATP level 
is low. 
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Figure 26.17 Insig facilitates the degradation of HMG-CoA reductase. In the presence 
of sterols, a subclass of Insig associated with ubquitinating enzymes binds HMG-CoA 
reductase. This interaction results in the ubquitination of the enzyme. This modification and 
the presence of geranylgeraniol results in extraction of the enzyme from the membrane and 
degradation by the proteasome. [After R. A. DeBose-Boyd. Feedback regulation of cholesterol 
synthesis: Sterol-accelerated ubiquitination and degradation of HMG CoA reductase. Cell Res. 
18:609-621, 2008.] 


As we will see shortly, all four regulatory mechanisms are modulated by 
receptors that sense the presence of cholesterol in the blood. 


Lipoproteins transport cholesterol and triacylglycerols 
throughout the organism 


Cholesterol and triacylglycerols are transported in body fluids in the form 
of lipoprotein particles. This transport is important for a number of reasons. 
First, lipoprotein particles are the means by which triacylglycerols are deliv- 
ered to tissues, from the intestine or liver, for use as fuel or for storage. 
Second, the fatty acid constituents of the triacylglycerol components of the 
lipoprotein particles are incorporated into phospholipids for membrane 
synthesis. Likewise, cholesterol is a vital component of membranes and is a 
precursor to the powerful signal molecules, the steroid hormones. Finally, 
cells are not able to degrade the steroid nucleus. Consequently, the choles- 
terol must be used biochemically or excreted by the liver. Excess choles- 
terol plays a role in the development of atherosclerosis. Lipoprotein particles 
function in cholesterol homeostasis, transporting the molecule from sites of 
synthesis to sites of use, and finally to the liver for excretion. 

Each lipoprotein particle consists of a core of hydrophobic lipids 
surrounded by a shell of more-polar lipids and proteins. The protein 
components of these macromolecular aggregates, called apoproteins, have 
two roles: they solubilize hydrophobic lipids and contain cell-targeting signals. 
Apolipoproteins are synthesized and secreted by the liver and the intes- 
tine. Lipoprotein particles are classified according to increasing density 
(Table 26.1): chylomicrons, chylomicron remnants, very low density lipopro- 
teins (VLDLs), intermediate-density lipoproteins (IDLs), low-density lipopro- 
teins (LDLs), and high-density lipoproteins (HDLs). These classes have 
numerous subtypes. Moreover, lipoprotein particles can shift between 
classes as they release or pick up cargo, thereby changing their density. 

Triacylglycerols, cholesterol, and other lipids obtained from the diet 
are carried away from the intestine in the form of large chylomicrons 
(Section 22.1). These particles have a very low density because triacylglyc- 
erols constitute about 90% of their content. Apolipoprotein B-48 (apo 
B-48), a large protein (240 kd), forms an amphipathic spherical shell around 
the fat globule; the external face of this shell is hydrophilic. The triacylglyc- 
erols in chylomicrons are released through hydrolysis by lipoprotein lipases. 
These enzymes are located on the lining of blood vessels in muscle and other 
tissues that use fatty acids as fuels or in the synthesis of lipids. The liver then 
takes up the cholesterol-rich residues, known as chylomicron remnants. 

Lipoprotein particles are also crucial for the transport of lipids from the 
liver, which is a major site of triacylglycerol and cholesterol synthesis, to 


Table 26.1 Properties of plasma lipoproteins 
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Composition (%) 


Plasma lipoproteins Density (g ml!) Diameter (nm) Apolipoprotein — Physiological role TAG CE C PL P 

Chylomicron <0.95 75-1200 B-48, C, E Dietary fat transport 86 3 1 8 2 

Very low density 0.951.006 30-80 B-100, C, E Endogenous fat 52 14 7 18 8 
lipoprotein transport 

Intermediate-density 1.006-1.019 15-35 B-100, E LDL precursor 38 30 8 23 11 
lipoprotein 

Low-density lipoprotein 1.019-1.063 18-25 B-100 Cholesterol transport 10 38 8 22, Dil 

High-density lipoprotein 1.063-1.21 7.5-20 A Reverse cholesterol 5-10 14-21 3-7 19-29 33-57 

transport 


Abbreviations: TAG, triacylglyerol; CE, cholesteryl ester; C, free cholesterol; PL, phospholipid; P, protein. 


500 nm 


Figure 26.18 Site of cholesterol 
synthesis. Electron micrograph of a part of a 
liver cell actively engaged in the synthesis and 
secretion of very low density lipoprotein 
(VLDL). The arrow points to a vesicle that is 
releasing its content of VLDL particles. 
[Courtesy of Dr. George Palade.] 


Figure 26.19 Schematic model of low- 
density lipoprotein. The LDL particle is 
approximately 22 nm (220 A) in diameter. 
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other tissues in the body (Figure 26.18). Triacylglycerols and cholesterol in 
excess of the liver’s own needs are exported into the blood in the form 
of very low density lipoproteins. These particles are stabilized by two 
apolipoproteins—apo B-100 and apo E (34 kd). Apo B-100, one of the larg- 
est proteins known (513 kd), is a longer version of apo B-48. Both apo B 
proteins are encoded by the same gene and produced from the same initial 
RNA transcript. In the intestine, RNA editing (Section 29.3) modifies the 
transcript to generate the mRNA for apo B-48, the truncated form. 
Triacylglycerols in very low density lipoproteins, as in chylomicrons, are 
hydrolyzed by lipases on capillary surfaces, with the released fatty acids 
being taken up by the muscle and other tissues. The resulting remnants, 
which are rich in cholesteryl esters, are called intermediate-density lipopyro- 
teins. These particles have two fates. Half of them are taken up by the liver 
for processing, and half are converted into low-density lipoprotein by the 
removal of more triacylglycerol by tissue lipases that absorb the released 
fatty acids. 

Low-density lipoprotein is the major carrier of cholesterol in blood 
(Figure 26.19). It contains a core of some 1500 cholesterol molecules esteri- 
fied to fatty acids; the most common fatty acid chain in these esters is 
linoleate, a polyunsaturated fatty acid. A shell of phospholipids and unes- 
terified cholesterol molecules surrounds this highly hydrophobic core. The 
shell also contains a single copy of apo B-100, which is recognized by target 
cells. The role of LDL is to transport cholesterol to peripheral tissues and regu- 
late de novo cholesterol synthesis at these sites, as described on page 775. A 
different purpose is served by high-density lipoprotein, which picks up cho- 
lesterol released into the plasma from dying cells and from membranes 
undergoing turnover and delivers the cholesterol to the liver for excretion. 
An acyltransferase in HDL esterifies these cholesterols, which are then 
returned by HDL to the liver (Figure 26.20). 
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The blood levels of certain lipoproteins can serve diagnostic purposes 


e High serum levels of cholesterol cause disease and death by contrib- 

uting to the formation of atherosclerotic plaques in arteries through- 
out the body. This excess cholesterol is present in the form of the low-density 
lipoprotein particle, so-called bad cholesterol. 

High-density lipoprotein is sometimes referred to as “good cholesterol.” 
HDL functions as a shuttle that moves cholesterol throughout the body. 
HDL binds and esterifies cholesterol released from macrophages and the 
peripheral tissues and then transfers cholesteryl esters to tissues that use 


Intestine 775 
\ Dietary fat + 26.3 Regulation of Cholesterol Synthesis 
a < cholesterol : : 


Chylomicron 


————> i 
Chylomicrons remnants 


, a 


Bile acids HDL 


tissues 
(with LDL 
receptors) 


FFA Figure 26.20 An overview of lipoprotein 
particle metabolism. Fatty acids are 
abbreviated FFA. [After J. G. Hardman (Ed.), 
L. L. Limbird (Ed.), and A. G. Gilman (Consult. 
Ed.), Goodman and Gilman's The 
Pharmacological Basis of Therapeutics, 10th 


ed. (McGraw-Hill, 2001), p. 975, Fig. 36.1.] 


cholesterol to synthesize steroid hormones or to the liver, where the choles- 
terol is converted into bile salts or excreted. A specific receptor mediates the 
docking of the HDL to these tissues. The exact nature of the protective 
effect of HDL levels is not known; however, possible mechanisms are 
examined on page 776. 

The ratio of cholesterol in the form of LDL to that in the form of HDL 
can be used to evaluate susceptibility to the development of heart disease. 
For a healthy person, the HDL/LDL ratio is 3.5. 


Low-density lipoproteins play a central role in cholesterol metabolism 


Cholesterol metabolism must be precisely regulated to prevent athero- 
sclerosis. The mode of control in the liver, the primary site of cholesterol 
synthesis, has already been considered: dietary cholesterol reduces the 
activity and amount of 3-hydroxy-3-methylglutaryl CoA reductase, the 
enzyme catalyzing the committed step. Studies by 
Michael Brown and Joseph Goldstein are sources of 
insight into the control of cholesterol metabolism in 


nonhepatic cells. In general, cells outside the liver and a 

intestine obtain cholesterol from the plasma rather than 

synthesizing it de novo. Specifically, their primary source eee 

of cholesterol is the low-density lipoprotein. The process op 

of LDL uptake, called receptor-mediated endocytosis, Cholesteryl ‘\ 

serves as a paradigm for the uptake of many molecules ae Je Geral 
: Protein 

(Figure 26.21). 7 0, 

Endocytosis begins when apolipoprotein B-100 on } 29° 
the surface of an LDL particle binds to a specific recep- LDL ma Cholesteryl 
tor protein on the plasma membrane of nonhepatic > =—s > ay @) oleate 
cells. The receptors for LDL are localized in specialized @ ot oe 

ware a ae 
regions called coated pits, which contain a specialized @ & amino acids 


protein called clathrin. The receptor-LDL complex is 
then internalized by endocytosis; that is, the plasma 
membrane in the vicinity of the complex invaginates _ Figure 26.21 Receptor-mediated endocytosis. The process of 
and then fuses to form an endocytic vesicle called an receptor-mediated endocytosis is illustrated for the cholesterol- 
endosome (Figure 26.22). The endosome is acidified, —¢@!"¥ing complex, low-density lipoprotein (LDL): (1) LDL binds to a 
which causes the receptor to release its cargo. The receptor nate pees hiOr IG vias ee eee. 
: . : J orm an internal vesicle; (3) after separation from its receptor, the 
is returned to the cell membrane in a recycling vesicle. LDL-containing vesicle fuses with a lysosome, leading to the 

The round-trip time for a receptor is about 10 minutes; degradation of the LDL and the release of the cholesterol. 


LDL binding ———~> Internalization ———~> Lysosomal hydrolysis 
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(B) 


Figure 26.22 Endocytosis of LDL bound 
to its receptor. (A) Electron micrograph 
showing LDL (conjugated to ferritin for 
visualization, dark spots) bound to a coated-pit 
region on the surface of a cultured human 
fibroblast cell. (B) Micrograph showing this 
region invaginating and fusing to form an 
endocytic vesicle. [From R. G. W. Anderson, 

M. S. Brown, and J. L. Goldstein. Cel! 10:351— 
364, 1977] 


in its lifetime of about a day, each receptor may bring hundreds of 
LDL particles into the cell. The vesicles containing LDL subsequently 
fuse with lysosomes, acidic vesicles that carry a wide array of degradative 
enzymes. The protein component of LDL is hydrolyzed to free amino 
acids. The cholesteryl esters in LDL are hydrolyzed by a lysosomal acid 
lipase. The released unesterified cholesterol can then be used for membrane 
biosynthesis. Alternatively, it can be reesterified for storage inside the cell. 
In fact, free cholesterol activates acyl CoA:cholesterol acyltransferase 
(ACAT), the enzyme catalyzing this reaction. Reesterified cholesterol 
contains mainly oleate and palmitoleate, which are monounsaturated 
fatty acids, in contrast with the cholesteryl esters in LDL, which are rich 
in linoleate, a polyunsaturated fatty acid (see Table 12.1). It is impera- 
tive that the cholesterol be reesterified. High concentrations of unest- 
erified cholesterol disrupt the integrity of cell membranes. 

The synthesis of the LDL receptor is itself subject to feedback regulation. 
Studies of cultured fibroblasts show that, when cholesterol is abundant inside 
the cell, new LDL receptors are not synthesized, and so the uptake of additional 
cholesterol from plasma LDL is blocked. The gene for the LDL receptor, like 
that for the reductase, is regulated by SREBP, which binds to a sterol 
regulatory element that controls the rate of mRNA synthesis. 


The absence of the LDL receptor leads to hypercholesterolemia 
and atherosclerosis 


Brown and Goldstein’s pioneering studies of familial hypercholester- 

olemia revealed the physiological importance of the LDL receptor. 
The total concentration of cholesterol and LDL in the blood plasma is 
markedly elevated in this genetic disorder, which results from a mutation at 
a single autosomal locus. The cholesterol level in the plasma of homozy- 
gotes is typically 680 mg dl-!, compared with 300 mg dl! in heterozygotes 
(clinical assay results are often expressed in milligrams per deciliter, which 
is equal to milligrams per 100 milliliters). A value of < 200 mg di! is 
regarded as desirable, but many people have higher levels. In familial hyper- 
cholesterolemia, cholesterol is deposited in various tissues because of the high 
concentration of LDL cholesterol in the plasma. Nodules of cholesterol called 
xanthomas are prominent in skin and tendons. Of particular concern is the 
oxidation of the excess blood LDL to form oxidized LDL (oxLDL). The 
oxLDL is taken up by immune-system cells called macrophages, which 
become engorged to form foam cells. These foam cells become trapped in 
the walls of the blood vessels and contribute to the formation of atheroscle- 
rotic plaques that cause arterial narrowing and lead to heart attacks 
(Figure 26.23). In fact, most homozygotes die of coronary artery disease in 
childhood. The disease in heterozygotes (1 in 500 people) has a milder and 
more variable clinical course. 

The molecular defect in most cases of familial hypercholesterolemia is an 
absence or deficiency of functional receptors for LDL. Receptor mutations that 
disrupt each of the stages in the endocytotic pathway have been identified. 
Homozygotes have almost no functional receptors for LDL, whereas 
heterozygotes have about half the normal number. Consequently, the entry 
of LDL into liver and other cells is impaired, leading to an increased level of 
LDL in the blood plasma. Furthermore, less IDL enters liver cells because 
IDL entry, too, is mediated by the LDL receptor. Consequently, IDL stays 
in the blood longer in familial hypercholesterolemia, and more of it is con- 
verted into LDL than in normal people. All deleterious consequences of an 
absence or deficiency of the LDL receptor can be attributed to the ensuing 
elevated level of LDL cholesterol in the blood. 
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(A) (B) 


Figure 26.23 The effects of excess cholesterol. Cross section of (A) a normal artery and 
(B) an artery blocked by a cholesterol-rich plaque. [SPL/Photo Researchers.] 


Mutations in the LDL receptor prevent LDL release and result 
in receptor destruction 


¥ One class of mutations that results in familial hypercholesterolemia 
generates receptors that are reluctant to give up the LDL cargo. Let 
us begin by examining the makeup of the receptor. The human LDL recep- 
tor is a 160-kd glycoprotein, which is composed of six different types of 
domains (Figure 26.24A). The amino-terminal region of the receptor, 
which is the site of LDL binding, consists of seven homologous LA 
domains, with domains 4 and 5 most important for LDL binding. A second 
type of domain is homologous to one found in the epidermal growth factor 
(EGF). This domain is repeated three times, and in between the second and 
third repeat is a propeller structure consisting of six bladelike domains. This 


(A) (B) 


Figure 26.24 LDL receptor releases LDL 
in the endosome. (A) A schematic 
representation of the domain structure of the 
LDL receptor. (B) In the endosome, the 
receptor converts from the open structure into 
the closed structure, resulting in the release of 
the LDL into the endosome. [After |. D. 
Campbell, Biochem. Soc. Trans. 31(pt. 
6p):1107—1114, 2003, Fig. 1A] 


S Six-bladed 


propeller 
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part of the receptor is crucial in releasing LDL in the endosome. The fourth 
domain, which is very rich in serine and threonine residues, contains 
O-linked sugars. These oligosaccharides function as struts to keep the 
receptor extended from the membrane so that the LDL-binding domain is 
accessible to LDL. The fifth type of domain consists of 22 hydrophobic 
residues that span the plasma membrane. The sixth and final domain con- 
sists of 50 residues and emerges on the cytoplasmic side of the membrane, 
where it controls the interaction of the receptor with coated pits and par- 
ticipates in endocytosis. 

How does the LDL receptor relinquish its cargo on entering the endo- 
some? The receptor exists in two interconvertible states: an extended or 
open state, capable of binding LDL, and a closed state that results in release 
of the LDL in the endosome. The receptor maintains the open state while 
in the plasma membrane, on binding LDL, and throughout its journey to 
the endosome. The conversion from the open state into the closed state takes 
place on exposure tothe acidic environment of the endosome (Figure 26.24B). 
Three contiguous modules LA7, EGFA, and EGFB rigidly position the 
propeller module to facilitate displacement of the LDL as the closed state is 
formed. Under neutral pH, aspartate residues of the propeller blades form 
hydrogen bonds that hold each blade to the rest of the propeller structure. 
Exposure to the low-pH environment of the endosome causes the propeller- 
like structures to interact with the LDL-binding domain. This interaction 
displaces the LDL, which is then digested by the lysosome. Under normal 
circumstances, the receptor is returned to the plasma membrane to again 
bind LDL. The importance of this process is highlighted by the fact that 
more than half of the point mutations that result in familial hypercholester- 
olemia are due to disruptions in the interconversion between the open state 
and the closed state. These mutations result in a failure to release the LDL 
cargo and loss of the receptor by degradation. 


HDL appears to protect against artheriosclerosis 


y Although the events that result in atherosclerosis take place rapidly 
in familial hypercholesterolemia, a similar sequence of events take 
place in people who develop atherosclerosis over decades. In particular, the 
formation of foam cells and plaques are especially hazardous occurrences. 
HDL and its role in returning cholesterol to the liver appear to be important 
in mitigating these life-threatening circumstances. 

HDL has a number of antiatherogenic properties, including the inhibi- 
tion of LDL oxidation, but the best-characterized property is the removal 
of cholesterol from cells, especially macrophages. Earlier, we learned that 
HDL retrieves cholesterol from other tissues in the body to return the cho- 
lesterol to the liver for excretion as bile or in the feces. This transport, called 
reverse cholesterol transport, is especially important in regard to macrophages. 
Indeed, when the transport fails, macrophages become foam cells and 
facilitate the formation of plaques. Macrophages that collect cholesterol 
from LDL normally transport the cholesterol to HDL particles. The more 
HDL, the more readily this transport takes place and the less likely that 
the macrophages will develop into foam cells. Presumably, this robust 
reverse cholesterol transport accounts for the observation that higher HDL 
levels confer protection against arthrosclerosis. Indeed, efforts are under- 
way to develop drugs that will boost the levels of HDL in the blood. 

The importance of reverse cholesterol transport is illustrated by the 
occurrence of mutations that inactivate one of the cholesterol-transport 
proteins in macrophages. This transport is facilitated by members of the 
ABC class of transport proteins (see Figure 13.7) present in macrophage 


membranes. Loss of activity of cholesterol-transport protein ABAC1 
results in a very rare condition called Tangier disease, which is characterized 
by HDL deficiency, accumulation of cholesterol in macrophages, and 
premature atherosclerosis. 

Another antiatherogenic property of HDL is due to the association of a 
serum esterase, paraoxanase, with HDL. Paraoxanase may destroy oxLDL, 
accounting for some of HDL’s ability to protect against coronary disease. 


The clinical management of cholesterol levels can be understood 
at a biochemical level 


e Homozygous familial hypercholesterolemia can be treated only by a 
liver transplant. A more generally applicable therapy is available for 
heterozygotes and others with high levels of cholesterol. The goal is to reduce 
the amount of cholesterol in the blood by stimulating the single normal gene to 
produce more than the customary number of LDL receptors. We have already 
observed that the production of LDL receptors is controlled by the cell’s 
need for cholesterol. The therapeutic strategy is to deprive the cell of ready 
sources of cholesterol. When cholesterol is required, the amount of mRNA 
for the LDL receptor rises and more receptors are found on the cell surface. 
This state can be induced by a two-pronged approach. First, the reabsorp- 
tion of bile salts from the intestine is inhibited. Bile salts are cholesterol 
derivatives that promote the absorption of dietary cholesterol and dietary 
fats. Second, de novo synthesis of cholesterol is blocked. 

The reabsorption of bile is impeded by oral administration of positively 
charged polymers, such as cholestyramine, that bind negatively charged 
bile salts and are not themselves absorbed. Cholesterol synthesis can be 
effectively blocked by a class of compounds called statins. A well-known 
example of such a compound is lovastatin, which is also called mevacor 
(Figure 26.25). These compounds are potent competitive inhibitors (K; = 1 
nM) of HMG-CoA reductase, the essential control point in the biosyn- 
thetic pathway. Plasma cholesterol levels decrease by 50% in many patients 
given both lovastatin and inhibitors of bile-salt reabsorption. Lovastatin 
and other inhibitors of HMG-CoA reductase are widely used to lower the 
plasma-cholesterol level in people who have atherosclerosis, which is the 
leading cause of death in industrialized societies. The development of sta- 
tins as effective drugs is further described in Chapter 36. 


26.4 Important Derivatives of Cholesterol Include 
Bile Salts and Steroid Hormones 


Although cholesterol is well known in its own right as a contributor to the 
development of heart disease, metabolites of cholesterol—the steroid 
hormones—also appear in the news frequently. Indeed, steroid-hormone 
abuse seems to be as prominent in the sports pages as any athlete is. In addi- 
tion to steroid hormones, cholesterol is a precursor for two other important 
molecules: bile salts and vitamin D. We begin with a look at the bile salts, 
molecules crucial for the uptake of lipids in the diet. 


Bile salts. Bile salts are polar derivatives of cholesterol. These compounds 
are highly effective detergents because they contain both polar and nonpolar 
regions. Bile salts are synthesized in the liver, stored and concentrated in 
the gall bladder, and then released into the small intestine. Bile salts, the 
major constituent of bile, solubilize dietary lipids. Solubilization increases 
the effective surface area of lipids with two consequences: (1) more surface 
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Lovastatin 


Figure 26.25 Lovastatin, a competitive 
inhibitor of HMG-CoA reductase. The part 
of the structure that resembles the 
3-hydroxy-3-methylglutaryl moiety is shown 

in red. 


Figure 26.26 Synthesis of bile salts. The 
OH groups in red are added to cholesterol, 
as are the groups shown blue. 
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area is exposed to the digestive action of lipases and (2) lipids are more 
readily absorbed by the intestine. Bile salts are also the major breakdown 
products of cholesterol. The bile salts glycocholate, the primary bile salt, 
and taurocholate are shown in Figure 26.26. 


Steroid hormones. Cholesterol is the precursor of the five major classes of 
steroid hormones: progestagens, glucocorticoids, mineralocorticoids, andro- 
gens, and estrogens (Figure 26.27). These hormones are powerful signal 
molecules that regulate a host of organismal functions. Progesterone, a 
progestogen, prepares the lining of the uterus for the implantation of an 
ovum. Progesterone is also essential for the maintenance of pregnancy. 
Androgens (such as testosterone) are responsible for the development of male 
secondary sex characteristics, whereas estrogens (such as estradiol) are 
required for the development of female secondary sex 
characteristics. Estrogens, along with progesterone, also 
participate in the ovarian cycle. Glucocorticoids (such as 
cortisol) promote gluconeogenesis and the formation of 
glycogen, enhance the degradation of fat and protein, and 
inhibit the inflammatory response. They enable animals 
to respond to stress; indeed, the absence of glucocorticoids 
can be fatal. Mineralocorticoids (primarily aldosterone) act 
on the distal tubules of the kidney to increase the reab- 
sorption of Na” and the excretion of K* and H™, which 
leads to an increase in blood volume and blood pressure. 
The major sites of synthesis of these classes of hormones 
are the corpus luteum, for progestogens; the testes, for 


androgens; the ovaries, for estrogens; and the adrenal cor- 
Androgens 


iinerdlacadicuids aa tex, for glucocorticoids and mineralocorticoids. 
(Gi) Steroid hormones bind to and activate receptor mole- 
4 cules that serve as transcription factors to regulate gene 
Estrogens expression (Section 32.2). These small similar molecules 
(Cis) are able to have greatly differing effects because the slight 


Figure 26.27 Biosynthetic relations of classes of steroid 
hormones and cholesterol. 
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structural differences among them allow interactions with 
specific receptor molecules. 


Figure 26.28 Cholesterol 
carbon numbering. The 
numbering scheme for the 
carbon atoms in cholesterol and 
other steroids. 


Letters identify the steroid rings and numbers identify the 
carbon atoms 


Carbon atoms in steroids are numbered, as shown for cholesterol in 
Figure 26.28. The rings in steroids are denoted by the letters A, B, C, and 
D. Cholesterol contains two angular methyl groups: the C-19 methyl group 
is attached to C-10, and the C-18 methyl group is attached to C-13. The 
C-18 and C-19 methyl groups of cholesterol lie above the plane containing 
the four rings. A substituent that is above the plane is termed B oriented, 
whereas a substituent that is below the plane is a oriented. 

If a hydrogen atom 1s attached to C-5, it can be either a or B oriented. 
The A and B steroid rings are fused in a trans conformation if the C-5 
hydrogen is @ oriented, and cis if it is B oriented. The absence of a Greek 
letter for the C-5 hydrogen atom on the steroid nucleus implies a trans 
fusion. The C-5 hydrogen atom is a oriented in all steroid hormones that 
contain a hydrogen atom in that position. In contrast, bile salts have a 
B-oriented hydrogen atom at C-5. Thus, a cis fusion is characteristic of the 
bile salts, whereas a trans fusion is characteristic of all steroid hormones that 
possess a hydrogen atom at C-5. A trans fusion yields a nearly planar struc- 
ture, whereas a cis fusion gives a buckled structure. 


CH; 


CH; 


H 
5a-Hydrogen 
(trans fusion) 


H 


5p-Hydrogen 
(cis fusion) 


Steroids are hydroxylated by cytochrome P450 monooxygenases that 
use NADPH and O> 


The addition of OH groups plays an important role in the synthesis of cho- 
lesterol from squalene and in the conversion of cholesterol into steroid hor- 
mones and bile salts. All these hydroxylations require NADPH and O2. The 
oxygen atom of the incorporated hydroxyl group comes from Oy) rather 
than from H2,O. Whereas one oxygen atom of the O2 molecule goes into the 
substrate, the other is reduced to water. The enzymes catalyzing these reac- 
tions are called monooxygenases (or mixed-function oxygenases). Recall that a 
monooxygenase also participates in the hydroxylation of aromatic amino 
acids (Section 23.5). 


RH + O, + NADPH + H* —> ROH + H,O + NADP* 
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Hydroxylation requires the activation of oxygen. In the synthesis of steroid 
hormones and bile salts, activation is accomplished by members of the 
cytochrome P450 family, a family of cytochromes that absorb light maxi- 
mally at 450 nm when complexed in vitro with exogenous carbon monoxide. 
These membrane-anchored proteins (~50 kd) contain a heme prosthetic 
group. Oxygen is activated through its binding to the iron atom in the 
heme group. 

Because the hydroxylation reactions promoted by P450 enzymes are 
oxidation reactions, it is at first glance surprising that they also consume the 
reductant NADPH. NADPH transfers its high-potential electrons to a 
flavoprotein, which transfers them, one at a time, to adrenodoxin, a non- 
heme iron protein. Adrenodoxin transfers one electron to reduce the ferric 
(Fe?*) form of P450 to the ferrous (Fe*~) form (Figure 26.29). 

Without the addition of this electron, P450 will not bind oxygen. Recall 
that only the ferrous form (Fe?~) of myoglobin binds oxygen (Section 7.1). 
The binding of O2 to the heme is followed by the acceptance of a second 
electron from adrenodoxin. The acceptance of this second electron leads to 
cleavage of the O—O bond. One of the oxygen atoms is then protonated 
and released as water. The remaining oxygen atom forms a highly reactive 
ferry] FeO intermediate. This intermediate abstracts a hydrogen atom 
from the substrate RH to form Re. This transient free radical captures the 
OH group from the iron atom to form ROH, the hydroxylated product, 
returning the iron atom to the ferric state. 
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Figure 26.29 Cytochrome P450 mechanism. These enzymes bind O2 and use one oxygen 
atom to hydroxylate their substrates. 


The cytochrome P450 system is widespread and performs 
a protective function 


The cytochrome P450 system, which in mammals 1s located primarily in the 
endoplasmic reticulum of the liver and small intestine, is also important in 
the detoxification of foreign substances (xenobiotic compounds). For example, 
the hydroxylation of phenobarbital, a barbiturate, increases its solubility and 
facilitates its excretion. Likewise, polycyclic aromatic hydrocarbons that are 
ingested by drinking contaminated water are hydroxylated by P450, pro- 


viding sites for conjugation with highly polar units (e.g., glucuronate or 
sulfate) that markedly increase the solubility of the modified aromatic 
molecule. One of the most relevant functions of the cytochrome P450 
system to human beings is its role in metabolizing drugs such as caffeine 
and ibuprofen (Chapter 36). Some members of the cytochrome P450 system 
also metabolize ethanol (Section 27.6). The duration of action of many 
medications depends on their rate of inactivation by the P450 system. 
Despite its general protective role in the removal of foreign chemicals, the 
action of the P450 system is not always beneficial. Some of the most powerful 
carcinogens are generated from harmless compounds by the P450 system in vivo 
in the process of metabolic activation. In plants, the cytochrome P450 system 
plays a role in the synthesis of toxic compounds as well as the pigments of 
flowers. 


The cytochrome P450 system is a ubiquitous superfamily of mono- 

oxygenases that is present in plants, animals, and prokaryotes. The 
human genome encodes more than 50 members of the family, whereas the 
genome of the plant Arabidopsis encodes more than 250 members. All mem- 
bers of this large family arose by gene duplication followed by subsequent 
divergence, which generated a range of substrate specificity. The specificity 
of these enzymes is encoded in delimited regions of the primary structure, 
and the substrate specificity of closely related members is often defined by 
a few critical residues or even a single amino acid. 


Pregnenolone, a precursor of many other steroids, is formed 
from cholesterol by cleavage of its side chain 


Steroid hormones contain 21 or fewer carbon atoms, whereas cholesterol 
contains 27. Thus, the first stage in the synthesis of steroid hormones is 
the removal of a six-carbon unit from the side chain of cholesterol to form 
pregnenolone. The side chain of cholesterol is hydroxylated at C-20 and then 
at C-22, and the bond between these carbon atoms is subsequently cleaved 
by desmolase. Three molecules of NADPH and three molecules of Op» are 
consumed in this remarkable six-electron oxidation. 


OH 


CH; 


Cholesterol 20«,226-Dihydroxycholesterol Pregnenolone 


Progesterone and corticosteroids are synthesized from pregnenolone 


Progesterone is synthesized from pregnenolone in two steps. The 3-hydroxyl 
group of pregnenolone is oxidized to a 3-keto group, and the A° double bond 
is isomerized to a A* double bond (Figure 26.30). Cortisol, the major gluco- 
corticoid, is synthesized from progesterone by hydroxylations at C-17, 
C-21, and C-11; C-17 must be hydroxylated before C-21 is hydroxylated, 
whereas C-11 can be hydroxylated at any stage. The enzymes catalyzing 
these hydroxylations are highly specific, as shown by some inherited disor- 
ders. The initial step in the synthesis of aldosterone, the major mineralocor- 
ticoid, is the hydroxylation of progesterone at C-21. The resulting deoxy- 
corticosterone is hydroxylated at C-11. The oxidation of the C-18 angular 
methyl group to an aldehyde then yields aldosterone. 
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Figure 26.30 Pathways for the formation 
of progesterone, cortisol, and ie) 
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Androgens and estrogens are synthesized from pregnenolone 


Androgens and estrogens also are synthesized from pregnenolone through 
the intermediate progesterone. Androgens contain 19 carbon atoms. 
The synthesis of androgens starts with the hydroxylation of progesterone at 
C-17 (Figure 26.31). The side chain consisting of C-20 and C-21 is then 
cleaved to yield androstenedione, an androgen. Testosterone, another andro- 
gen, is formed by the reduction of the 17-keto group of androstenedione. 
Testosterone, through its actions in the brain, is paramount in the develop- 
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Figure 26.31 Pathways for the formation H HO . 


for androgens and estrogens. Estrone Estradiol 
784 


ment of male sexual behavior. It is also important for the maintenance of 
the testes and the development of muscle mass. Owing to the latter activity, 
testosterone is referred to as an anabolic steroid. Testosterone is reduced by 
5a-reductase to yield dihydrotestosterone (DHT), a powerful embryonic 
androgen that instigates the development and differentiation of the male 
phenotype. Estrogens are synthesized from androgens by the loss of the 
C-19 angular methyl group and the formation of an aromatic A ring. 
Estrone, an estrogen, is derived from androstenedione, whereas estradiol, 
another estrogen, is formed from testosterone. The formation of the 
aromatic A ring is catalyzed by the P450 enzyme aromatase. 


¥ Because breast and ovarian cancers frequently depend on estrogens 
for growth, aromatase inhibitors are often used as a treatment for 
these cancers. Anastrazole is a competitive inhibitor of the enzyme, whereas 
exemestane is a suicide inhibitor that covalently modifies and inactivates 
the enzyme. 


Vitamin D is derived from cholesterol by the ring-splitting activity 
of light 


Cholesterol is also the precursor of vitamin D, which plays an essential role 
in the control of calctum and phosphorus metabolism. 7-Dehydrocholesterol 
(provitamin D3) is photolyzed by the ultraviolet light of sunlight to previtamin 
D3, which spontaneously isomerizes to vitamin D3 (Figure 26.32). Vitamin D3 
(cholecalciferol) is converted into calcitriol (1,25-dihydroxycholecalciferol), 
the active hormone, by hydroxylation reactions in the liver and kidneys. 
Although not a steroid, vitamin D acts in an analogous fashion. It binds to 
a receptor, structurally similar to the steroid receptors, to form a complex 
that functions as a transcription factor, regulating gene expression. 
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Figure 26.32 Vitamin D 
synthesis. The pathway for 
the conversion of 
7-dehydrocholesterol into 
vitamin Dz and then into 
calcitriol, the active hormone. 
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Vitamin D deficiency in childhood produces rickets, a disease charac- 
ki terized by inadequate calcification of cartilage and bone. Rickets was 

so common in seventeenth-century England that it was called the 
“children’s disease of the English.” The 7-dehydrocholesterol in the skin of 
these children was not photolyzed to previtamin D3, because there was little 
sunlight for many months of the year. Furthermore, their diets provided 
little vitamin D, because most naturally occurring foods have a low content 
of this vitamin. Fish-liver oils are a notable exception. Cod-liver oil, 
abhorred by generations of children because of its unpleasant taste, was 
used in the past as a rich source of vitamin D. Today, the most reliable 
dietary sources of vitamin D are fortified foods. Milk, for example, is forti- 
fied to a level of 400 international units per quart (10 wg per quart). The 
recommended daily intake of vitamin D is 200 international units until 
age 50, after which it increases with age. In adults, vitamin D deficiency 
leads to softening and weakening of bones, a condition called osteomalacia. 
The occurrence of osteomalacia in Muslim women who are clothed so that 
only their eyes are exposed to sunlight is a striking reminder that vitamin D 
is needed by adults as well as by children. 

Research over the past few years indicates that vitamin D may play a 
much larger biochemical role than simply the regulation of bone metabo- 
lism. Muscle seems to be a target of vitamin D action. In muscle, vitamin D 
appears to affect a number of biochemical processes with the net effect being 
enhanced muscle performance. Studies also suggest that vitamin D prevents 
cardiovascular disease, reduces the incidence of a variety of cancers, and 
protects against autoimmune diseases including diabetes. Moreover, vita- 
min D deficiency appears to be more common than thought. In the United 
States, 90% of Blacks and many Hispanics and Asians have insufficient 
blood levels of vitamin D. This recent research on vitamin D shows again 
the dynamic nature of biochemical investigations. Vitamin D, a chemical 
whose biochemical role was believed to be well established, now offers new 
frontiers of biomedical research. 


Summary 


26.1 Phosphatidate Is a Common Intermediate in the Synthesis of 

Phospholipids and Triacylglycerols 

Phosphatidate is formed by successive acylations of glycerol 3-phos- 
phate by acyl CoA. Hydrolysis of its phosphoryl group followed by 
acylation yields a triacylglycerol. CDP-diacylglycerol, the activated 
intermediate in the de novo synthesis of several phospholipids, is 
formed from phosphatidate and CTP. The activated phosphatidyl 
unit is then transferred to the hydroxyl group of a polar alcohol, such 
as inositol, to form a phospholipid such as phosphatidylinositol. In 
mammals, phosphatidylethanolamineis formed by CDP-ethanolamine 
and diacylglycerol. Phosphatidylethanolamine is methylated by 
S-adenosylmethionine to form phosphatidylcholine. In mammals, 
this phosphoglyceride can also be synthesized by a pathway that uti- 
lizes dietary choline. CDP-choline is the activated intermediate in this 
route. 

Sphingolipids are synthesized from ceramide, which is formed 
by the acylation of sphingosine. Gangliosides are sphingolipids 
that contain an oligosaccharide unit having at least one residue of 
N-acetylneuraminate or a related sialic acid. They are synthesized by 
the step-by-step addition of activated sugars, such as UDP-glucose, 
to ceramide. 


26.2 


Cholesterol Is Synthesized from Acetyl Coenzyme A in Three Stages 
Cholesterol is a steroid component of animal membranes and a pre- 
cursor of steroid hormones. The committed step in its synthesis is the 
formation of mevalonate from 3-hydroxy-3-methylglutaryl CoA 
(derived from acetyl CoA and acetoacetyl CoA). Mevalonate is 
converted into isopentenyl pyrophosphate (Cs), which condenses with 
its isomer, dimethylallyl pyrophosphate (Cs), to form geranyl pyro- 
phosphate (C9). The addition of a second molecule of isopentenyl 
pyrophosphate yields farnesyl pyrophosphate (C,5), which condenses 
with itself to form squalene (C39). This intermediate cyclizes to lanos- 
terol (C39), which is modified to yield cholesterol (C7). 


26.3 The Complex Regulation of Cholesterol Biosynthesis Takes Place at 


26.4 


Several Levels 

In the liver, cholesterol synthesis is regulated by changes in the amount 
and activity of 3-hydroxy-3-methylglutaryl CoA reductase. Transcrip- 
tion of the gene, translation of the mRNA, and degradation of the 
enzyme are stringently controlled. In addition, the activity of the 
reductase is regulated by phosphorylation. 

Triacylglycerols exported by the intestine are carried by chylomi- 
crons and then hydrolyzed by lipases lining the capillaries of target 
tissues. Cholesterol and other lipids in excess of those needed by the 
liver are exported in the form of very low density lipoprotein. After 
delivering its content of triacylglycerols to adipose tissue and other 
peripheral tissue, VLDL is converted into intermediate-density lipo- 
protein and then into low-density lipoprotein. IDL and LDL carry 
cholesteryl esters, primarily cholesteryl linoleate. Liver and peripheral 
tissue cells take up LDL by receptor-mediated endocytosis. The LDL 
receptor, a protein spanning the plasma membrane of the target cell, 
binds LDL and mediates its entry into the cell. Absence of the LDL 
receptor in the homozygous form of familial hypercholesterolemia 
leads to a markedly elevated plasma level of LDL cholesterol and 
the deposition of cholesterol on blood-vessel walls, which in turn 
may result in childhood heart attacks. Apolipoprotein B, a very large 
protein, is a key structural component of chylomicrons, VLDL, 
and LDL. High-density lipoproteins transport cholesterol from the 
peripheral tissues to the liver. 


Important Derivatives of Cholesterol Include Bile Salts and 
Steroid Hormones 
In addition to bile salts, which facilitate the digestion of lipids, five 
major classes of steroid hormones are derived from cholesterol: proge- 
stogens, glucocorticoids, mineralocorticoids, androgens, and estrogens. 
Hydroxylations by P450 monooxygenases that use NADPH and O2 
play an important role in the synthesis of steroid hormones and bile 
salts from cholesterol. P450 enzymes, a large superfamily, also partici- 
pate in the detoxification of drugs and other foreign substances. 
Pregnenolone (C21) is an essential intermediate in the synthesis 
of steroids. This steroid is formed by scission of the side chain of 
cholesterol. Progesterone (C1), synthesized from pregnenolone, is 
the precursor of cortisol and aldosterone. Hydroxylation of pro- 
gesterone and cleavage of its side chain yields androstenedione, an 
androgen (Cj9). Estrogens (Cj) are synthesized from androgens by 
the loss of an angular methyl group and the formation of an aromatic 
A ring. Vitamin D, which is important in the control of calctum and 
phosphorus metabolism, is formed from a derivative of cholesterol by 
the action of light. 
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Key Terms 


phosphatidate (p. 760) 

triacylglycerol (p. 761) 

phospholipid (p. 761) 

cytidine diphosphodiacylglycerol 
(CDP-diacylglycerol) (p. 761) 

sphingolipid (p. 763) 

ceramide (p. 764) 

cerebroside (p. 764) 

ganglioside (p. 764) 


cholesterol (p. 767) 
mevalonate (p. 767) 


(p. 767) 


(SREBP) (p. 770) 


Problems 


1. Different roles. Describe the roles of glycerol 3-phos- 
phate, phosphatidate, and diacylglycerol in triacylglycerol 
and phospholipid synthesis. 


2. Needed supplies. How is the glycerol 3-phosphate 
required for phosphatidate synthesis generated? 


3. Making fat. Write a balanced equation for the synthesis 
of a triacylglycerol, starting from glycerol and fatty acids. 


4. Making a phospholipid. Write a balanced equation for 
the synthesis of phosphatidylethanolamine by the de novo 
pathway, starting from ethanolamine, glycerol, and fatty 
acids. 


5. ATP needs. How many high-phosphory]l-transfer-poten- 
tial molecules are required to synthesize phosphatidyletha- 
nolamine from ethanolamine and diacylglycerol? Assume 
that the ethanolamine is the activated component. 


6. Identifying differences. Differentiate among sphingomy- 
elin, a cerebroside, and a ganglioside. 


7. Let’s count the ways. There may be 50 ways to leave your 
lover, but, in principle, there are only three ways to make a 
glycerol-based phospholipid. Describe the three pathways. 


8. Activated donors. What is the activated reactant in each 
of the following biosyntheses? 


a) Phosphatidylinositol from inositol 

b) Phosphatidylethanolamine from ethanolamine 

c) Ceramide from sphingosine 

d) Sphingomyelin from ceramide 

e) Cerebroside from ceramide 

f) Ganglioside Gy, from ganglioside Gyy2 

(g) Farnesyl pyrophosphate from geranyl pyrophosphate 
9. No DAG, no TAG. What would be the effect of a 
mutation that decreased the activity of phosphatidic acid 
phosphatase? 


10. The Law of Three Stages. What are the three stages 
required for the synthesis of cholesterol? 


( 
( 
( 
( 
( 
( 


3-hydroxy-3-methylglutaryl CoA 
reductase (HMG-CoA reductase) 


3-isopentenyl pyrophosphate (p. 768) 
sterol regulatory element binding protein 


lipoprotein particles (p. 773) 


low-density lipoprotein (LDL) (p. 773) 
high-density lipoprotein (HDL) (p. 773) 
receptor-mediated endocytosis (p. 775) 
reverse cholesterol transport (p. 778) 
bile salt (p. 779) 
steroid hormone (p. 780) 
cytochrome P450 monooxygenase 

(p. 782) 
pregnenolone (p. 783) 


11. Many regulations to follow. Outline the mechanisms of 
the regulation of cholesterol biosynthesis. 


12. Telltale labels. What is the distribution of isotopic 
labeling in cholesterol synthesized from each of the follow- 
ing precursors? 


(a) Mevalonate labeled with “C in its carboxyl carbon 
atom 
(b) Malonyl CoA labeled with 4C in its carboxyl carbon 


atom 


13. Too much, too soon. What is familial hypercholester- 
olemia and what are its causes? 


14. Familial hypercholesterolemia. Several classes of LDL- 
receptor mutations have been identified as causes of this 
disease. Suppose that you have been given cells from 
patients with different mutations, an antibody specific for 
the LDL receptor that can be seen with an electron micro- 
scope, and access to an electron microscope. What differ- 
ences in antibody distribution might you expect to find in 
the cells from different patients? 


15. Breakfast conversation. You and a friend are eating 
breakfast together. While eating, your friend is reading the 
back of her cereal box and comes across the following state- 
ment: “Cholesterol plays beneficial roles in your body, 
making cells, hormones, and tissues.” Knowing that you are 
taking biochemistry, she asks if the statement makes sense. 
What do you reply? 


16. A good thing. What are statins? What is their pharma- 
cological function? 


17. Too much of a good thing. Would the development of a 
“super statin” that inhibited all HMG CoA reductase 
activity be a useful drug? Explain. 


18. RNA editing. A shortened version (apo B-48) of apoll- 
poprotein B is formed by the intestine, whereas the full- 
length protein (apo B-100) is synthesized by the liver. A 
glutamine codon (CAA) is changed into a stop codon. 
Propose a simple mechanism for this change. 


19. A means of entry. Describe the process of receptor- 
mediated endocytosis by using LDL as an example. 


20. Inspiration for drug design. Some actions of androgens 
are mediated by dihydrotestosterone, which is formed by 
the reduction of testosterone. This finishing touch is cata- 
lyzed by an NADPH-dependent 5a-reductase. 


NADPH + H+ = NADP+ 


Testosterone Lee 


5a-Reductase 


ie) 


5a-Dihydrotestosterone 


Chromosomal XY males with a genetic deficiency of this 
reductase are born with a male internal urogenital tract but 
predominantly female external genitalia. These people are 
usually reared as girls. At puberty, they masculinize because 
the testosterone level rises. The testes of these reductase- 
deficient men are normal, whereas their prostate glands 
remain small. How might this information be used to 
design a drug to treat benign prostatic hypertrophy, a common 
consequence of the normal aging process in men? A major- 
ity of men older than age 55 have some degree of prostatic 
enlargement, which often leads to urinary obstruction. 


21. Drug idiosyncrasies. Debrisoquine, a B-adrenergic 
blocking agent, has been used to treat hypertension. The 
optimal dose varies greatly (20-400 mg daily) in a popula- 
tion of patients. The urine of most patients taking the drug 
contains a high level of 4-hydroxydebrisoquine. However, 
those most sensitive to the drug (about 8% of the group 
studied) excrete debrisoquine and very little of the 4-hydroxy 
derivative. Propose a molecular basis for this drug idiosyn- 
crasy. Why should caution be exercised in giving other drugs 
to patients who are very sensitive to debrisoquine? 


Debrisoquine 


22. Removal of odorants. Many odorant molecules are 
highly hydrophobic and concentrate within the olfactory 
epithelium. They would give a persistent signal indepen- 
dent of their concentration in the environment if they were 
not rapidly modified. Propose a mechanism for converting 
hydrophobic odorants into water-soluble derivatives that 
can be rapidly eliminated. 


23. Development difficulties. Propecia (finasteride) is a syn- 
thetic steroid that functions as a competitive and specific 
inhibitor of 5a-reductase, the enzyme responsible for the 
synthesis of dihydrotestosterone from testosterone. 
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Problems 


2x 


“~C(CHs)5 


Finasteride 


N 
H 


It is now widely used to retard the development of male pat- 
tern hair loss. Pregnant women are advised to avoid han- 
dling this drug. Why is it vitally important that pregnant 
women avoid contact with Propecia? 


24. Life-style consequences. Human beings and the plant 
Arabidopsis evolved from the same distant ancestor poss- 
essing a small number of cytochrome P450 genes. Human 
beings have approximately 50 such genes, whereas 
Arabidopsis has more than 250 of them. Propose a role for 
the large number of P450 isozymes in plants. 


25. Personalized medicine. The cytochrome P450 system 
metabolizes many medicinally useful drugs. Although all 
human beings have the same number of P450 genes, 
individual polymorphisms exist that alter the specificity 
and efficiency of the proteins encoded by the genes. How 
could knowledge of individual polymorphisms be useful 
clinically? 


26. Honey-bee crisis. In 2006, there was a sudden, unex- 
plained die-off of honey-bee colonies throughout the 
United States. The die-off was economically significant 
because one-third of the human diet comes from insect- 
pollinated plants, and honey bees are responsible for 80% of 
the pollination. In October of 2006, the sequence of the 
honey bee genome was reported. Interestingly, the genome 
was found to contain far fewer cytochrome P450 genes than 
do the genomes of other insects. Suggest how the die-off 
and the paucity of P450 genes may be related. 


27. Let the sun shine in. At a biochemical level, vitamin D 
functions likea steroid hormone (see Chapter 31). Therefore, 
it is sometimes referred to as an honorary steroid. Why is 
vitamin D not an actual steroid? 


Mechanism Problems 


28. An interfering phosphate. In the course of the overall 
reaction catalyzed by HMG-CoA reductase, a histidine 
residue protonates a coenzyme A thiolate, CoA—S , gener- 
ated in an earlier step. 


NH 
Coa-S- fF 
a _ 
| 


= 
His Ser 


| 
__NH 
CoA—SH fn a 
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A nearby serine residue can be phosphorylated by AMP- 
dependent kinase, which results in a loss of activity. Propose 
an explanation for why phosphorylation of the serine resi- 
due inhibits enzyme activity. 


29. Demethylation. Methyl amines are often demethylated 
by cytochrome P450 enzymes. Propose a mechanism for 
the formation of methylamine from dimethylamine cata- 
lyzed by cytochrome P450. What is the other product? 


Chapter Integration Problems 


30. Similarities. Compare the role of CTP in phosphoglyc- 
eride synthesis with the role of UTP in glycogen synthesis. 


31. Hold on tight or you might be thrown to the cytoplasm. 
Many proteins are modified by the covalent attachment of 
a farnesyl (C15) or a geranylgeranyl (C9) unit to the carbox- 
yl-terminal cysteine residue of the protein. Suggest why 
this modification might occur. 


32. Fork in the road. 3- Hydroxy-3-methylglutaryl CoA is 
on the pathway for cholesterol biosynthesis. It is also a 
component of another pathway. Name the pathway. What 
determines which pathway 3-hydroxy-3-methylglutaryl 
CoA follows? 


33. Requires a club membership. How is methionine metab- 
olism related to the synthesis of phosphatidylcholine? 


34. ATP requirements. Explain how cholesterol synthesis 
depends on the activity of ATP-citrate lyase. 


Data Interpretation and Chapter Integration Problem 


35. Cholesterol feeding. Mice were divided into four groups, 
two of which were fed a normal diet and two of which were 
fed a cholesterol-rich diet. HMG-CoA reductase mRNA 
and protein from liver were then isolated and quantified. 
Graph A shows the results of the mRNA isolation. 


1 Actin mRNA 
[1] HMG-CoA reductase mRNA 


So 
a 
NI 
I 


Amount of mRNA 
(arbitrary units) 
° 
a 
I 


Control Control Cholesterol-fed Cholesterol-fed 
(A) mice mice mice mice 


(a) What is the effect of cholesterol feeding on the amount 
of HMG-CoA reductase mRNA? 

(b) What is the purpose of also isolating the mRNA for the 
protein actin, which is not under the control of the sterol 


regulatory element? 


HMG-CoA reductase protein was isolated by precipitation 
with a monoclonal antibody to HMG-CoA reductase. The 
amount of HMG-CoA protein in each group is shown in 
graph B. 


1.00 -— 


° 
a 
a] 
] 


Amount of HMG-CoA reductase 
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a 
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Cholesterol-fed 
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=> 
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(c) What is the effect of the cholesterol diet on the amount 
of HMG-CoA reductase protein? 


(d) Why is this result surprising in light of the results in 
graph A? 


(e) Suggest possible explanations for the results in graph B. 


of Metabolism 


The Integration 


Insulin 


The image at the left shows a detail of runners on a Greek amphora ij 
painted in the sixth century s.c. Athletic feats, as well as others as seemingly 
simple as the maintenance of blood-glucose levels, require elaborate 
metabolic integration. The representation at the right shows the organs that 
have essential roles in the metabolic integration that regulates blood-glucose 
levels during exercise and at rest. Insulin and leptin (secreted by adipocytes) 
are two of the hormones that modulate the metabolic pathways of organs 
throughout the body such that adequate energy is available to meet the 
demands of living. [(Left) Copyright © The Metropolitan Museum of Art/Art 
Resource, NY.] 


Pancreas 


WE have been examining the biochemistry of metabolism one pathway at 
a time. We have seen how useful energy is extracted from fuels and 
used to power biosynthetic reactions and signal-transduction pathways. 
In the Chapters 28 through 30, we will extend our study of biosynthetic 
reactions to the synthesis of proteins and nucleic acids. Before we do that, 
however, in this chapter we will take a step back to examine large-scale bio- 
chemical interactions that constitute the physiology of the organisms. In 
keeping with a central theme of life—energy manipulations—we will look at 
the regulation of energy at the organismal level, which boils down to an 
apparently simple but actually quite complex question: At the biochemical 
level, how does an organism know when to eat and when to refrain from 
eating? The ability to maintain adequate but not excessive energy stores is 
called caloric homeostasis or energy homeostasis. 
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Next, we will examine a significant perturbation of caloric homeostasis— 
obesity—and how this physiological condition affects insulin action, fre- 
quently resulting in diabetes. We then turn our attention to a biochemical 
examination of one of the most beneficial activities that humans can engage 
in—exercise—and see how exercise mitigates the effects of diabetes as well 
as how different forms of exercise use different sources of fuel. 

At the opposite end of the physiological spectrum from obesity and 
overnutrition are fasting and starvation, and we will examine the biochemi- 
cal responses to these challenges. The chapter ends with a consideration of 
another biochemical energy perturbation—excess alcohol consumption. 

We have already encountered instances of organismal energy regulation 
when we considered the actions of insulin and glucagon. Recall that insulin, 
secreted by the B cells of the pancreas, causes glucose to be removed from the 
blood and stimulates the synthesis of glycogen and lipids. Glucagon, secreted 
by the a cells of the pancreas, has effects opposite those of insulin. Glucagon 
increases the level of blood glucose by stimulating glycogen breakdown and 
gluconeogenesis. This chapter introduces two hormones that play key roles in 
caloric homeostasis. Leptin and adiponectin, secreted by the adipose tissue, 
work in concert with insulin to regulate caloric homeostasis. 


27.1 Caloric Homeostasis Is a Means of 
Regulating Body Weight 


By now in our study of biochemistry, we are well aware of the fact that many 
biochemicals, most notably carbohydrates and lipids, are potential sources 
of energy. We consume these energy sources as foods, convert the energy 
into ATP, and use the ATP to power our lives. Like all energy transforma- 
tions, our energy consumption and expenditure are governed by the laws of 
thermodynamics. Recall that the Second Law of Thermodynamics states 
that energy can neither be created nor destroyed. Translated into the practi- 
cal terms of our diets, 


Energy consumed = energy expended + energy stored 


This simple equation has severe physiological and health implications: 
according to the Second Law of Thermodynamics, if we consume more 
energy than we expend, we will become overweight or obese. Obesity is 
generally defined as a body mass index (BMI) of more than 30 kg m 7, 
whereas overweight is defined as a BMI of more than 25 kgm * (Figure 27.1). 
Recall that excess fat is stored in adipocytes as triacylglycerols. The num- 
ber of adipocytes becomes fixed in adults, and so obesity results in 
engorged adipocytes. Indeed, the cell may increase as much as 1000-fold 
in size. 

We are all aware that many of us, especially in the developed world, are 
becoming obese or have already attained that state. In the United States, obe- 
sity has become an epidemic, with nearly 30% of adults classified as such. 
Obesity is identified as a risk factor in a host of pathological conditions includ- 
ing diabetes mellitus, hypertension, and cardiovascular disease (‘Table 27.1). 
The cause of obesity is quite simple in most cases: more food is consumed than 
is needed, and the excess calories are stored as fat. We will consider the bio- 
chemical basis of pathologies caused by obesity later in the chapter. 

Before we undertake a biochemical analysis of the results of overcon- 
sumption, let us consider why the obesity epidemic is occurring in the first 
place. There are two complementary explanations. The first is a commonly 
held view that our bodies are programmed to rapidly store excess calories in 
times of plenty, an evolutionary adaptation from times past when humans 
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were notassured of having ample food, as many of usaretoday. Consequently, 
we store calories as if a fast might begin tomorrow, but no such fast arrives. 
The second possible explanation is that we no longer face the risks of preda- 
tion. Evidence indicates that predation was acommon cause of death for our 
ancestors. An obese individual would more likely have been culled from a 
group of our ancestors than would a more nimble, lean individual. As the 
risk of predation declined, leanness became less beneficial. Regardless of 
why we may have a propensity to gain weight, this propensity can be negat- 
ed behaviorally—by eating less and exercising more. However, genetic 
studies indicate that the tendency toward obesity may be highly heritable. 
As disturbing as the obesity epidemic is, an equally intriguing, almost 
amazing observation is that many people are able to maintain an approxi- 
mately constant weight throughout adult life. A few simple calculations of 
a simplified situation illustrates how remarkable this feat is. Consider a 
120-pound woman whose weight does not change significantly between 


Table 27.1 Health consequences of obesity or being overweight 


Coronary heart disease 

Type 2 diabetes 

Cancers (endometrial, breast, and colon) 

Hypertension (high blood pressure) 

Dyslipidemia (disruption of lipid metabolism, e.g., high cholesterol and triglycerides) 
Stroke 

Liver and gallbladder disease 

Sleep apnea and respiratory problems 

Osteoarthritis (degeneration of cartilage and underlying bone at a joint) 
Gynecological problems (abnormal menses, infertility) 


Source: Centers for Disease Control and Prevention Web site (www.cdc.gov). 
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27.1 Caloric Homeostasis 


Figure 27.1 Body mass index (BMI). The 
BMI value for an individual person is a reliable 
indicator of obesity for most people. [Data 
from the Centers for Disease Control.] 
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the ages of 25 and 65. Let us say that the woman requires 2000 kcal day |. 
Over the 40 years under consideration, she will have consumed 


40 years X 365 days year | X 2000 kcal day ! = 2.9 X 10’ kcalsin 40 years 


For simplicity’s sake, let us assume that the woman’s diet consists predomi- 
nantly of fatty acids derived from lipids. The energy density of fatty acids is 
9 kcal ¢'. Thus, over the 40-year span, our subject has ingested 


2.9 X 10’ kcal/9 kcal g! = 3.2 X 10°g = 3200 kg of food 


which is equivalent to more than 6 tons of food! Yet, remarkably, her weight 
has remained constant without her having to accurately and constantly 
calibrate and equalize her energy intake and energy output. Although will 
power, exercise, and a bathroom scale often play a role in this homeostasis, 
some biochemical signaling must be taking place to help with her energy 
regulation. Indeed, it is the case; but, before we consider this regulation, let 
us be cruel, but not too cruel, to our hypothetical subject and say that 
her weight increased 10% over the 40-year span, a percentage weight gain 
that most 65-year-olds would delightfully accept. Thus, at 65, she weighs 
132 pounds. On a daily basis, what increase in energy intake—food 
consumption—would have resulted in a 12-pound weight gain over 40 years? 
Again, let us consider only fats. 


12 pounds = 5.4kg = 5.4 X 10° g¢ = total weight gain 
5.4 X 10° ¢ (40 years 365 days year!) ! = 0.37 eday | 


Thus, to gain 12 pounds in 40 years, our subject needed to eat only 0.37 g of 
food per day more than she needed to meet her biochemical needs. How much 
food is it? Consider butter, which is essentially pure fat; 0.38 g of butter is 
approximately one-quarter of a pat. So, the simple excess of the equivalent of 
one-quarter pat of butter per day will lead to a (modest) weight gain of 
12 pounds over 40 years—a startling if depressing fact. However, even just a 
small sampling of your friends and family shows us that this hypothetical 
weight gain does not apply to everyone. People vary significantly in the way 
in which their bodies are able to regulate energy intake; an extra pat of butter 
a day may not make a difference in the long run to one person, whereas 
another person may end up with much more than a 10% weight gain over 
40 years. All things being equal, Problem 33 illustrates how little excess con- 
sumption is required to become obese over the 40-year time period. 


27.2 The Brain Plays a Key Role in Caloric Homeostasis 


What makes this remarkable balance of energy input and output possible? 
As you might imagine, the answer is complicated, entailing many biochem- 
ical signals as well as a host of behavioral factors. We will focus on a few key 
biochemical signals, and divide our discussion into two parts: short-term 
signals that are active during a meal and long-term signals that report on the 
overall energy status of the body. These signals originate in the gastrointes- 
tinal tract, the B cells of the pancreas, and fat cells. The primary target of 
these signals is the brain, in particular a groups of neurons in a region of the 
hypothalamus called the arcuate nucleus. 


Signals from the gastrointestinal tract induce feelings of satiety 


Short-term signals relay feelings of satiety from the gut to various regions of 
the brain and thus reduce the urge to eat (Figure 27.2). The best-studied 
short-term signal is cholecystokinin (CCK). Cholecystokinin is actually a 


Increased insulin secretion 
Increased insulin biosynthesis 
Increased B-cell proliferation 
Increased B-cell survival 


Increased satiety 
Decreased food intake 
Decreased body weight 


family of peptide hormones of various lengths (from 8 to 58 amino acids in 
length, depending on posttranslational processing) secreted into the blood 
by cells in the duodenum and jejunum regions of the small intestine as a 
postprandial satiation signal. The CCK binds to the CCK receptor, a 
G-protein-coupled receptor (p. 406) located in various peripheral neurons 
that relay signals to the brain. This binding initiates a signal-transduction 
pathway in the brain that generates a feeling of satiety. CCK also plays an 
important role in digestion, stimulating the secretion of pancreatic enzymes 
and bile salts from the gallbladder. 

Another important gut signal is glucagon-like peptide 1 (GLP-1), a hor- 
mone of approximately 30 amino acids in length. GLP-1 is secreted by 
intestinal L cells, hormone-secreting cells located throughout the lining of 
the gastrointestinal tract, and has a variety of effects, all apparently facili- 
tated by binding to a GLP-1 receptor, another G-protein-coupled receptor. 
Like CCK, GLP-1 induces feelings of satiety that inhibit further eating. 
GLP-1 also potentiates glucose-induced insulin secretion by the B cells of 
the pancreas while inhibiting glucagon secretion. Although we have 
examined only two short-term signals, many others are believed to exist 
(Table 27.2). Most of the short-term signals thus far identified are appetite 
suppressants. Ghrelin, a peptide that is 28 amino acids in length and secret- 
ed by the stomach, acts on regions of the hypothalamus to stimulate appe- 
tite through its receptor, a G-protein-coupled receptor. Ghrelin secretion 
increases before a meal and decreases afterward. 


Leptin and insulin regulate long-term control over caloric homeostasis 


Two key signal molecules regulate energy homeostasis over the time scale of 
hours or days: leptin, which is secreted by the adipocytes, and insulin, which 
is secreted by the 8 cells of the pancreas. Leptin reports on the status of the 
triacylglycerol stores, whereas insulin reports on the status of glucose in the 
blood—in other words, of carbohydrate availability. We will consider lep- 
tin first. 

Adipose tissue was formerly considered an inert depot of triacylglycer- 
ols. However, recent work has shown that adipose tissue is an active endo- 
crine tissue, secreting signal molecules called adipokines, such as leptin, 
that regulate a host of physiological processes. Leptin is secreted by the 
adipocytes in direct proportion to the amount of fat present. The more fat 
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Figure 27.2 Satiation signals. 
Cholecystokinin (CCK) and glucagon-like 
peptide 1 (GLP-1) are signal molecules that 
induce feelings of satiety in the brain. CCK is 
secreted by specialized cells of the small 
intestine in response to a meal and activates 
satiation pathways in the brain. GLP-1, 
secreted by L cells in the intestine, also 
activates satiation pathways in the brain and 
potentiates insulin action in the pancreas. 
[After S. C. Wood. Cell Metab. 9:489-498, 
2009, Fig. 11] 


Table 27.2 Gastrointestinal peptides 
that regulate food intake 


Appetite-suppressing signals 
Cholecystokinin 
Glucagon-like peptide 1 
Glucagon-like peptide 2 
Amylin 
Somatostatin 
Bombesin 
Enterostatin 
Apolipoprotein A-[V 
Gastric inhibitory peptide 


Appetite-enhancing peptides 
Ghrelin 


Source: After M. H. Stipanuk, Ed., Biochemical, 
Physiological, Molecular Aspects of Human 
Nutrition, 2d ed. (Saunders/Elsevier, 2006), 

p. 627, Box 22-1. 


796 


CHAPTER 27 The Integration of 
Metabolism 


(A) Decrease in fat cell mass 


U 


Decrease in leptin expression 


Decrease in leptin action in hypothalamus 


y, \ 


Activation of NPY- and Inhibition of 
AgRP-producing POMC-reducing 
neuron neuron 


y Y 


Increase in NPY and Decrease 
AgRP expression in MSH 
and release expression 


X y, 


Increase in food intake 


(B) Increase in fat cell mass 


Increase in leptin expression 


u 


Increase in leptin action in hypothalamus 


4 ‘ 


Inhibition in NPY- Activation of 
and AgRP-producing POMC-producing 


neuron neuron 
Decrease in NPY and Increase in 

AgRP expression MSH 
and release expression 


Decrease in food intake 


Figure 27.3 The effects of leptin in the 
brain. Leptin is an adipokine secreted by 
adipose tissue in direct relation to fat mass. 
(A) When leptin levels fall, as in fasting, 
appetite-enhancing neuropeptides NPY and 
AgRP are secreted, whereas the secretion of 
appetite-suppressing signals such as MSH is 
inhibited. (B) When fat mass increases, leptin 
inhibits NPY and AgRP secretion while 
stimulating the release of appetite-suppressing 
hormone MSH. [After M. H. Stipanuk, 
Biochemical, Physiological, & Molecular 
Aspects of Human Nutrition, 2d ed. 
(Saunders-Elsevier, 2006), Fig. 22-2.] 


in a body, the more leptin is secreted. Leptin binding to its receptor 
throughout the body increases the sensitivity of muscle and the liver to 
insulin, stimulates B oxidation of fatty acids, and decreases triacylglycerol 
synthesis. 

Let us consider the effects of leptin in the brain. Leptin exerts its effect 
by binding to membrane receptors in various regions of the brain, particu- 
larly in the arcuate nucleus of the hypothalamus. There, one population of 
neurons expresses appetite-stimulating (orexigenic) peptides, called neuro- 
peptide Y (NPY) and agouti-related peptide (AgRP). Leptin inhibits the 
NPY/AgRP neurons, preventing the release of NPY and AgRP and thus 
repressing the desire to eat. Fasting, on the other hand, stimulates the pro- 
duction of NPY and AgRP (Figure 27.3) owing to the decrease in leptin 
levels that results from diminishing adipose tissue. 

The second population of neurons containing leptin receptors expresses 
a large precursor polypeptide, proopiomelanocortin (POMC). In response to 
leptin binding to its receptor on POMC neurons, POMC is proteolytically 
processed to yield a variety of signal molecules, one of which, melanocyte- 
stimulating hormone (MSH), is especially important in this context. MSH, 
originally discovered as a stimulator of melanocytes (cells that synthesize 
the pigment melanin), activates appetite-suppressing (anorexigenic) neu- 
rons and thus inhibits food consumption. Fasting inhibits MSH activity 
and thus stimulates eating. AgRP inhibits MSH activity by acting as an 
antagonist, binding to the MSH receptor but failing to activate the receptor 
(see Figure 27.3). Thus, the net effect of leptin binding to its receptor is the 
initiation of a complex signal-transduction pathway that ultimately curtails 
food intake. 

Insulin receptors are also present in the hypothalamus, although the 
mechanism of insulin action in the brain is less clear than that of leptin. 
Insulin appears to inhibit NPY/AgRP-producing neurons, thus inhibiting 
food consumption. 


Leptin is one of several hormones secreted by adipose tissue 


Leptin was the first adipokine discovered because of the dramatic effects 
of its absence. Researchers discovered a strain of mice called ob/ob 
mice, which lack leptin and, as a result, are extremely obese. These mice 
display hyperphagia (overeating), hyperlipidemia (accumulation of triacyl- 
glycerides in muscle and liver), and an insensitivity to insulin. Since the 
discovery of leptin, other adipokines have been detected. For instance, adi- 
ponectin is another signal molecule produced by the adipocytes that acts in 
a similar fashion to leptin. Both leptin and adiponectin exert their effects 
through the key regulatory enzyme, AMP-dependent protein kinase 
(AMPK). Recall that this enzyme is active when AMP levels are elevated 
and ATP levels are diminished, and this activation leads to a decrease in 
anabolism and an increase in catabolism, most notably an increase in fatty 
acid oxidation. In insulin-resistant obese animals such as the ob/ob mice, 
leptin levels increase while those of adiponectin decrease. 

Adipocytes also produce two hormones, RBP4 and resistin, that promote 
insulin resistance. Although it is unclear why adipocytes secrete hormones 
that facilitate insulin resistance, a pathological condition, we can speculate 
on the answer. These signal molecules may help to fine-tune the actions of 
leptin and adiponectin or perhaps to act as “brakes” on the action of leptin 
and adiponectin to prevent hypoglycemia in the fasted state. Some evidence 
indicates that enlarged adipocytes that result from obesity may secrete 
higher levels of insulin-antagonizing hormones and thus contribute to insu- 
lin resistance. 


Leptin resistance may be a contributing factor to obesity 


If leptin is produced in proportion to body-fat mass and leptin inhibits eat- 
ing, why do people become obese? Obese people, in most cases, have both 
functioning leptin and a high blood concentration of leptin. The failure to 
respond to the anorexigenic effects of leptin is called leptin resistance. What 
is the basis of leptin resistance? 

As for most questions in the exciting area of energy homeostasis, the 
answer is not well worked out, but recent evidence suggests that a group 
of proteins called suppressors of cytokine signaling (SOCS) may take part. 
These proteins fine-tune some hormonal systems by inhibiting receptor 
action. SOCS proteins inhibit receptor signaling by a number of means 
(Figure 27.4). Consider, for example, the effect of SOCS proteins on the 
insulin receptor. Recall that insulin stimulates the autophosphorylation of 
tyrosine residues on the insulin receptor, initiating the insulin-signaling 
pathway (see Figure 27.5). SOCS proteins bind to phosphorylated tyrosine 
residues on receptors or other members of the signal-transduction pathway, 
thereby disrupting signal flow and thus altering the cell’s biochemical 
activity. In other cases, the binding of SOCS proteins to components of the 
signal-transduction pathway may also enhance proteolytic degradation of 
these components by the proteasome. Evidence in support of a role for 
SOCS in leptin resistance comes from mice that have had SOCS selectively 
deleted from POMC-expressing neurons. These mice display an enhanced 
sensitivity to leptin and are resistant to weight gain even when fed a high-fat 
diet. The reason why the activity of SOCS proteins increases, leading to 
leptin resistance, remains to be determined. 


Dieting is used to combat obesity 


Given the obesity epidemic that we currently face and its associated disor- 
ders, much attention has been focused on determining the most effective 
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Figure 27.4 Suppressors of cytokine signaling (SOCS) regulate receptor function. SOCS 
proteins disrupt interactions of components of the insulin-signaling pathway and thereby inhibit the 
pathway. The binding of a signal component by SOCS results in proteasomal degradation in some 
cases. (Akt, a protein kinase; IRS-1, insulin-receptor substrate 1; PDK1, PIPs-dependent protein 
kinase; PIPz, phosphatidylinositol 3,4,5-trisphosphate; SOCS, suppressor of cytokine signaling.) 
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Diabetes 
Named for the excessive urination in the 
disease. Aretaeus, a Cappadocian physician of 
the second century A.b., wrote: “The epithet 
diabetes has been assigned to the disorder, 
being something like passing of water by a 
siphon.” He perceptively characterized 
diabetes as “being a melting-down of the 
flesh and limbs into urine.” 
Mellitus 
From Latin, meaning “sweetened with 
honey.” Refers to the presence of sugar in 
the urine of patients having the disease. 
Mellitus distinguishes this disease from 
diabetes insipidus, which is caused by 
impaired renal reabsorption of water. 


weight-loss diet. In general, two categories of diet try to help us control our 
caloric intake—low-carbohydrate diets and low-fat diets. Low-carbohydrate 
diets usually emphasize protein consumption. Although studies of the 
effects of diets on humans are immensely complex, data are beginning to 
accumulate suggesting that low-carbohydrate—high-protein diets may be 
the most effective for losing weight. The exact reasons are not clear, but 
there are two common hypotheses. First, proteins seem to induce a feeling 
of satiation more effectively than do fats or carbohydrates. Second, proteins 
require more energy to digest than do fats or carbohydrates, and the 
increased energy expenditure contributes to weight loss. For instance, 
recent studies show that a diet that is 30% protein requires almost 30% more 
energy to digest than that required by a diet that is 10% protein. The mech- 
anisms by which protein-rich diets enhance energy expenditure and feelings 
of satiation remain to be determined. Regardless of the type of diet, the 
adage “Eat less, exercise more” always applies. 


27.3 Diabetes Is a Common Metabolic Disease Often 
Resulting from Obesity 


Having taken an overview of the regulation of body weight, we now examine 
the biochemical results when regulation fails because of behavior, genetics, or 
a combination of the two. The most common result of such a failure is obe- 
sity, a condition in which excess energy is stored as triacylglycerides. Recall 
that all excess food consumption is ultimately converted into triacylglycer- 
ides. Humans maintain about a day’s worth of glycogen and, after these stores 
have been replenished, excess carbohydrates are converted into fats and then 
into triacylglycerols. Amino acids are not stored at all, and so excess amino 
acids are ultimately converted into fats also. Thus, regardless of the type of 
food consumed, excess consumption results in increased fat stores. 

We begin our consideration of the effects of disruptions in caloric 
homeostasis with diabetes mellitus, a complex disease characterized by 
grossly abnormal fuel usage: glucose is overproduced by the liver and under- 
utilized by other organs. The incidence of diabetes mellitus (usually referred 
to simply as diabetes) is about 5% of the population. Indeed, diabetes is the 
most common serious metabolic disease in the world; it affects hundreds of 
millions. Type 1 diabetes is caused by the autoimmune destruction of the 
insulin-secreting B cells in the pancreas and usually begins before age 20. 
Type 1 diabetes is also referred to as insulin-dependent diabetes, meaning 
that the affected person requires the administration of insulin to live. Most 
diabetics, in contrast, have a normal or even higher level of insulin in their 
blood, but they are quite unresponsive to the hormone, a characteristic 
called insulin resistance. This form of the disease, known as type 2 diabetes, 
typically arises later in life than does the insulin-dependent form. Type 2 
diabetes accounts for approximately 90% of the diabetes cases throughout 
the world and is the most common metabolic disease in the world. In the 
United States, it is the leading cause of blindness, kidney failure, and ampu- 
tation. Obesity is a significant predisposing factor for the development of type 
2 diabetes. 


Insulin initiates a complex signal-transduction pathway in muscle 


What is the biochemical basis of insulin resistance? How does insulin resis- 
tance lead to failure of the B cells of the pancreas that results in type 2 dia- 
betes? How does obesity contribute to this progression? To answer these 
questions and begin to unravel the mysteries of metabolic disorders, let us 
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Figure 27.5 Insulin signaling. The binding of insulin results in the cross-phosphorylation and 
activation of the insulin receptor. Phosphorylated sites on the receptor act as binding sites for 
insulin-receptor substrates such as IRS-1. The lipid kinase phosphoinositide 3-kinase binds to 
phosphorylated sites on IRS-1 through its regulatory domain and then converts PIP> into PIP3. 
Binding to PIP activates PIPs-dependent protein kinase, which phosphorylates and activates kinases 
such as Akt1. Activated Akt1 can then diffuse throughout the cell and continue the signal- 
transduction pathway. 


examine the mechanism of action of insulin in muscle, the largest tissue 
regulated by insulin. 

In a normal cell, insulin binds to a receptor, which dimerizes and auto- 
phosphorylates on tyrosine residues, with each subunit of the dimer phos- 
phorylating its partner. Phosphorylation of the receptor generates binding 
sites for insulin-receptor substrates (IRSs), such as IRS-1 (Figure 27.5). 
Subsequent phosphorylation of IRS-1 by the tyrosine kinase activity of the 
insulin receptor engages the insulin-signaling pathway. Phosphorylated 
IRS-1 binds to phosphatidylinositol 3-kinase (PI3K) and activates it. PI3K 
catalyzes the conversion of phosphatidylinositol 4,5-bisphosphate (PIP>2) 
into phosphatidylinositol 3,4,5-trisphosphate (PIP3), a second messenger. 
PIP3 activates the phosphatidylinositol-dependent protein kinase (PDK), 
which in turn activates several other kinases, most notably protein kinase B 
(PKB), also known as Akt. Protein kinase Akt facilitates the translocation of 
GLUT‘4-containing vesicles to the cell membrane, which leads to a more 
robust absorption of glucose from the blood. Moreover, Akt phosphorylates 
and inhibits glycogen synthase kinase (GSK3). Recall that GSK3 inhibits 
glycogen synthase (p. 629). Thus, insulin also leads to the activation of gly- 
cogen synthase and enhances glycogen synthesis. 

Like all signal pathways, the insulin-signaling cascade must be capable 
of being turned off. Three different processes contribute to the down- 
regulation of insulin signaling. First, phosphatases deactivate the insulin 
receptor and destroy a key second messenger. Tyrosine phosphatase IB 
removes phosphoryl groups from the receptor, thus inactivating it. The 
second messenger PIP3 is inactivated by the phosphatase PTEN (phosphatase 
and tensin homolog), which dephosphorylates it, forming PIP, which itself 
has no second-messenger properties. 


PTEN 
————— 


Phosphatidylinositol 3,4,5-trisphosphate (PIP,) 


pa 
oH OPOs 


OPO; 
OH 


Phosphatidylinositol 4,5-bisphosphate (PIP,) 


Second, the IRS protein can be inactivated by phosphorylation on serine 
residues by specific Ser/Thr kinases. These kinases are activated by overnu- 
trition and other stress signals and may play a role in the development 
of insulin resistance. Finally, SOCS proteins, the regulatory proteins dis- 
cussed earlier, interact with the insulin receptor and IRS-1 and apparently 
facilitate their proteolytic degradation by the proteasome complex. 


Caloric excess and obesity 
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Figure 27.6 The storage capacity of fat tissue can be exceeded 
in obesity. In caloric excess, the storage capacity of adipocytes can be 
exceeded with deleterious results. The excess fat accumulates in other 
tissues, resulting in biochemical malfunction of the tissues. When the 
pancreas, muscle, liver, and cells lining the blood vessels are affected, 
metabolic syndrome, a condition that often precedes type 2 diabetes 
may result. [After S. Frojd6, H. Vidal, and L. Pirola. Biochim. Biophys. Acta 
1792:83-92, 2009, Fig. 1.] 
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Metabolic syndrome often precedes type 2 diabetes 


With our knowledge of the key components of energy 
homeostasis, let us begin our investigation of the bio- 
chemical basis of insulin resistance and type 2 diabetes. 
Obesity is a contributing factor to the development of 
insulin resistance, which is an early development on the 
path to type 2 diabetes. Indeed, a cluster of pathologies— 
including insulin resistance, hyperglycemia, dyslipi- 
demia (high blood levels of triacylglycerols, cholesterol, 
and low-density lipoproteins)—often develop together. 
This clustering, called metabolic syndrome, is thought to 
be a predecessor of type 2 diabetes. 

A consequence of obesity is that the amount of 
triacylglycerides consumed exceeds the adipose tissue’s 
storage capacity. As a result, other tissues begin to accu- 
mulate fat, most notably liver and muscle (Figure 27.6). 
For reasons to be presented later in the chapter, this 
accumulation results in insulin resistance and ultimately 
in pancreatic failure. We will focus on muscle and the 
B cells of the pancreas. 


Excess fatty acids in muscle modify metabolism 


We have seen many times the importance of fats as 
fuels for cells. In regard to obesity, more fats are pres- 
ent than can be processed by muscle. Although the 
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Figure 27.7 Excess fat in peripheral tissues can result in insulin insensitivity. Excess fat 
accumulation in peripheral tissues, most notably muscle, can disrupt some signal-transduction 
pathways and inappropriately activate others. In particular, diacylglycerides and ceramide activate 
stress-induced pathways that interfere with insulin signaling, resulting in insulin resistance. 
(Abbreviations: DAG, diacylglycerol; TGs, triacylglycerides; ROS, reactive oxygen species; CT1, 
carnitine acyltransferase 1; GLUT4, glucose transporter; ETC, electron-transport chain.) 


rate of B oxidation increases in response to the high concentration of 
fats, mitochondria are not capable of processing all of the fatty acids by 
B oxidation; fatty acids accumulate in the mitochondria and eventually spill 
over into the cytoplasm. Indeed, the inability to process all of the fatty acids 
results in their reincorporation into triacylglycerols and the accumulation of 
fat in the cytoplasm. In the cytoplasm, levels of diacylglycerol and ceramide 
(a component of sphingolipids) also increase. Diacylglycerol is a second 
messenger that activates protein kinase C (PKC) (p. 409). When active, PKC 
and other Ser/Thr protein kinases are capable of phosphorylating IRS and 
reducing the ability of IRS to propagate the insulin signal. Ceramide or its 
metabolites inhibit glucose uptake and glycogen synthesis, apparently by 
inhibiting PDK and PKB (p. 799). The result is a diet-induced insulin resis- 
tance (Figure 27.7). 


Insulin resistance in muscle facilitates pancreatic failure 


What is the effect of overnutrition on the pancreas? This question is impor- 
tant because a primary function of the pancreas is to respond to the presence 
of glucose in the blood by secreting insulin, a process referred to as glucose- 
stimulated insulin secretion (GSIS). Indeed, the B cell is a virtual insulin fac- 
tory. Proinsulin mRNA constitutes 20% of the total mRNA in the pancreas, 
whereas 50% of the total protein synthesizes in the pancreas as proinsulin, a 
precursor of insulin. 

Glucose enters the B cells of the pancreas through the glucose trans- 
porter GLUT2. Recall that GLUT2 will allow glucose transport only when 
blood glucose is plentiful, ensuring that insulin is secreted only when 
glucose is abundant, such as after a meal. The B cell metabolizes glucose to 
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Figure 27.8 Insulin release is regulated 
by ATP. The metabolism of glucose by 
glycolysis and cellular respiration increases the 
concentration of ATP, which causes an ATP- 
sensitive potassium channel to close. The 
closure of this channel alters the charge 
across the membrane (V) and causes a 
calcium channel to open. The influx of 
calcium causes insulin-containing granules to 
fuse with the plasma membrane, releasing 
insulin into the blood. 


802 


Vesicle 


CO, and H20O in the process of cellular respiration, gen- 
erating ATP (Chapters 16, 17, and 18). The resulting 
increase in the ATP/ADP ratio closes an ATP-sensitive 
K* channel that, when open, allows potassium to flow 
out of the cell (Figure 27.8). The resulting alteration in 
the cellular ionic environment opens a Ca?* channel. The 
influx of Ca?* causes insulin-containing secretory vesi- 
cles to fuse with the cell membrane and release insulin 
into the blood. Thus, the increase in energy charge 
resulting from the metabolism of glucose has been trans- 
lated by the membrane proteins into a physiological 
response—the secretion of insulin and the subsequent 
removal of glucose from the blood. 

Lalas What aspect of B-cell function ultimately fails as a 
result of overnutrition, causing the transition from insu- 
lin resistance to full-fledged type 2 diabetes? Recall that, 
under normal circumstances, the £ cells of the pancreas 
synthesize large amounts of proinsulin. The proinsulin 
folds in the endoplasmic reticulum, is processed to insu- 

lin, and is subsequently packaged into vesicles for secretion. As insulin 
resistance develops in the muscle, the B cells respond by synthesizing yet 
more insulin in a futile attempt do drive insulin action. The ability of the 
endoplasmic reticulum to process all of the proinsulin and insulin becomes 
compromised, a condition known as endoplasmic reticulum (ER) stress, and 
unfolded or misfolded proteins accumulate. ER stress initiates a signal 
pathway called the unfolded protein response (UPR), a pathway intended to 
save the cell. UPR consists of several steps. First, general protein synthesis 
is inhibited so as to prevent more proteins from entering the ER. Second, 
chaperone synthesis is stimulated. Recall that chaperones are proteins that 
assist the folding of other proteins. Third, misfolded proteins are removed 
from the ER and are subsequently delivered to the proteasome for destruc- 
tion. Finally, if the described response fails to alleviate the ER stress, 
apoptosis is triggered, which ultimately leads to cell death and full-fledged 
type 2 diabetes. 

What is the treatment for type 2 diabetes? Most are behavioral in nature. 
Diabetics are advised to count calories, making sure that energy intake does 
not exceed energy output; to consume a diet rich in vegetables, fruits, and 
grains; and to get plenty of aerobic exercise. Note that these guidelines are 
the same as those for healthy living, even for those not suffering from type 
2 diabetes. Treatments specific for type 2 diabetes include the monitoring 
of blood-glucose levels so that these levels are within the target range (nor- 
mal is 3.6 to 6.1 mM). For those who are not able to maintain proper glucose 
levels with the behaviors described herein, drug treatments are required. 
The administration of insulin may be necessary on pancreatic failure, and 
treatment with the use of metformin (Glucophage), which activates AMPK, 
may be effective. 


Metabolic derangements in type 1 diabetes result from insulin 
insufficiency and glucagon excess 


We now turn to the more-straightforward type 1 diabetes. In type 1 diabe- 
tes, insulin production is insufficient because of autoimmune destruction of 
the B cells of the pancreas. Consequently, the glucagon/insulin ratio is at 
higher-than-normal levels. In essence, the diabetic person is in biochemical 
fasting mode despite a high concentration of blood glucose. Because insulin 
is deficient, the entry of glucose into adipose and muscle cells is impaired. The 
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Figure 27.9 Regulation of glycolysis and gluconeogenesis. Phosphofructokinase is the key 
enzyme in the regulation of glycolysis, whereas fructose 1,6-bisphosphatase is the principal enzyme 
controlling the rate of gluconeogenesis. Note the reciprocal relation between the pathways and the 
signal molecules. 


liver becomes stuck in a gluconeogenic and ketogenic state. The gluconeo- 
genic state is characterized by excessive production of glucose. The exces- 
sive level of glucagon relative to that of insulin leads to a decrease in the 
amount of fructose 2,6-bisphosphate (F-2,6-BP), which stimulates glycoly- 
sis and inhibits gluconeogenesis in the liver. Hence, glycolysis is inhibited 
and gluconeogenesis is stimulated because of the opposite effects of F-2, 
6-BP on phosphofructokinase and fructose-1,6-bisphosphatase (Section 
16.4; Figure 27.9). Essentially, the cells’ response to a lack of insulin ampli- 
fies the amount of glucose in the blood. The high glucagon/insulin ratio in 
diabetes also promotes glycogen breakdown. Hence, an excessive amount of 
glucose is produced by the liver and released into the blood. Glucose is excret- 
ed in the urine (hence the name mellitus) when its concentration in the blood 
exceeds the reabsorptive capacity of the renal tubules. Water accompanies 
the excreted glucose, and so an untreated diabetic in the acute phase of the 
disease is hungry and thirsty. 

Because carbohydrate utilization is impaired, a lack of insulin leads to 
the uncontrolled breakdown of lipids and proteins, resulting in the keto- 
genic state. Large amounts of acetyl CoA are then produced by B oxidation. 
However, much of the acetyl CoA cannot enter the citric acid cycle, because 
there is insufficient oxaloacetate for the condensation step. Recall that 
mammals can synthesize oxaloacetate from pyruvate, a product of glycoly- 
sis, but not from acetyl CoA; instead, they generate ketone bodies. A strik- 
ing feature of diabetes is the shift in fuel usage from carbohydrates to fats; 
glucose, more abundant than ever, is spurned. In high concentrations, ketone 
bodies overwhelm the kidney’s capacity to maintain acid—base balance. The 
untreated diabetic can go into a coma because of a lowered blood-pH level 
and dehydration. Interestingly, diabetic ketosis is rarely a problem in type 2 
diabetes because insulin is active enough to prevent excessive lipolysis in 
liver and adipose tissue. 

What is the treatment for type 1 diabetes? Many of the behaviors applied 
to type 2 diabetes apply to type 1: watching calories, exercising, and eating 
a healthy diet. Likewise, blood-glucose levels must be monitored. Insulin 
treatments are required for survival. 


27.4 Exercise Beneficially Alters the 
Biochemistry of Cells 


Exercise, coupled with a healthy diet, is one of the most effective treat- 
ments for diabetes as well as a host of other pathological conditions 
including coronary disease, hypertension, depression, and a variety of 
cancers. With regard to diabetes, exercise increases the insulin sensitivity 
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of people who are insulin resistant or type 2 diabetics. What is the basis of 
this beneficial effect? 


Mitochondrial biogenesis is stimulated by muscular activity 


When muscle is stimulated to contract during exercise by receiving nerve 
impulses from motor neurons, calcium is released from the sarcoplasmic 
reticulum. Calcium induces muscle contraction, as will be discussed 
Chapter 35. Recall that calcium is also a potent second messenger and 
frequently works in association with the calcium-binding protein calm- 
odulin (p. 410). In its capacity as a second messenger, calcium stimulates 
various calcium-dependent enzymes, such as calmodulin-dependent pro- 
tein kinase. The calcium-dependent enzymes, as well as AMPK, subse- 
quently activate particular transcription-factor complexes. As we will see 
in Chapters 29 and 31, transcription factors are proteins that control gene 
expression. T'wo patterns of gene expression, in particular, change in 
response to regular exercise (Figure 27.10). Regular exercise enhances the 
production of proteins required for fatty acid metabolism, such as the 
enzymes of B oxidation. Interestingly, fatty acids themselves function as 
signal molecules to activate the transcription of enzymes of fatty acid 
metabolism. Additionally, another set of transcription factors activated 
by the calcium signal cascade institutes metabolic reprogramming that 
leads to increased mitochondrial biogenesis. In concert, the increase in 
fatty acid oxidizing capability and additional mitochondria allow for the 
efficient metabolism of fatty acids. Because an excess of fatty acids results 
in insulin resistance, as already discussed, efficient metabolism of fatty 
acids results in an increase in insulin sensitivity. Indeed, muscles of well- 
trained athletes may contain high concentrations of triacylglycerides and 
still maintain exquisite sensitivity to insulin. 


i <P 


Action potential 


Fatty acid oxidatio 


oh Ba" 


Figure 27.10 Exercise results in mitochondrial biogenesis and enhanced fat metabolism. 
An action potential causes Ca?* release from the sarcoplasmic reticulum (SR), the muscle-cell 
equivalent of the endoplasmic reticulum. The Ca?*, in addition to instigating muscle contraction, 
activates nuclear transcription factors that activate genes that, in conjunction with mitochondrial 
genes, are responsible for mitochondrial biogenesis. Fatty acids activate a different set of genes that 
increase the fatty acid oxidation capability of mitochondria. [After D. A. Hood. J Appl. Physiol. 
90:1137-1157, 2001, Fig. 2.] 


Fuel choice during exercise is determined by the intensity and 
duration of activity 


In keeping with our theme of energy use under different physiological 
conditions, we now examine how fuels are used in different types of exer- 
cise. The fuels used in anaerobic exercises—sprinting, for example—differ 
from those used in aerobic exercises—such as distance running. The 
selection of fuels during these different forms of exercise illustrates many 
important facets of energy transduction and metabolic integration. ATP 
directly powers myosin, the protein immediately responsible for convert- 
ing chemical energy into movement (Chapter 35). However, the amount 
of ATP in muscle is small. Hence, the power output and, in turn, the 
velocity of running depend on the rate of ATP production from other 
fuels. As shown in Table 27.3, creatine phosphate (phosphocreatine) can 
swiftly transfer its high-potential phosphoryl group to ADP to generate 
ATP. However, the amount of creatine phosphate, like that of ATP itself, 
is limited. Creatine phosphate and ATP can power intense muscle con- 
traction for 5 to 6 s. Maximum speed in a sprint can thus be maintained 
for only 5 to 6 s (see Figure 15.7). Thus, the winner in a 100-meter sprint 
is the runner who both achieves the highest initial velocity and then slows 
down the least. 

During a ~10-second sprint, the ATP level in muscle drops from 5.2 
to 3.7 mM, and that of creatine phosphate decreases from 9.1 to 2.6 mM. 
Anaerobic glycolysis provides fuel to make up for the loss of ATP and 
creatine phosphate. A 100-meter sprint is powered by stored ATP, creatine 
phosphate, and the anaerobic glycolysis of muscle glycogen. The conver- 
sion of muscle glycogen into lactate can generate a good deal more ATP, 
but the rate is slower than that of phosphoryl-group transfer from cre- 
atine phosphate. Because of anaerobic glycolysis, the blood-lactate level 
is elevated from 1.6 to 8.3 mM. The release of H”™ from the intensely 
active muscle concomitantly lowers the blood pH from 7.42 to 7.24. 
This pace cannot be sustained in a 1000-meter run (~132 s) for two rea- 
sons. First, creatine phosphate is consumed within a few seconds. 
Second, the lactate produced would cause acidosis. Thus, alternative 
fuel sources are needed. 

The complete oxidation of muscle glycogen to CO, by aerobic respi- 
ration substantially increases the energy yield, but this aerobic process is 
a good deal slower than anaerobic glycolysis. However, as the distance of 
a run increases, aerobic respiration, or oxidative phosphorylation, 
becomes increasingly important. For instance, part of the ATP consumed 
in a 1000-meter run must come from oxidative phosphorylation. Because 
ATP is produced more slowly by oxidative phosphorylation than by 
glycolysis (see Table 27.3), the runner’s pace is necessarily slower than in 


Table 27.3 Fuel sources for muscle contraction 


Maximal rate of Total 

ATP production ~P available 
Fuel source (mmol s~!) (mmol) 
Muscle ATP 223 
Creatine phosphate 73.3 446 
Conversion of muscle glycogen into lactate 39.1 6,700 
Conversion of muscle glycogen into CO2 16.7 84,000 
Conversion of liver glycogen into CO 6.2 19,000 
Conversion of adipose-tissue fatty acids into CO 6.7 4,000,000 


Note: Fuels stored are estimated for a 70-kg person having a muscle mass of 28 kg. 
Source: After E. Hultman and R. C. Harris. In Principles of Exercise Biochemistry, edited by J. R. Poortmans 
(Karger, 2004), pp. 78-119. 


805 


27.4 Effects of Exercise 


100-meter sprint 


i 956s 


131.96 s 


é 


Velocity (m s~') 
© 
] 


5 | | 


1000-meter sprint 


a 100-meter sprint. The championship velocity for the 
1000-meter run is about 7.6 m ¢ compared with 
approximately 10.4 m s | for the 100-meter event 

(Figure 27.11). 
The running of a marathon (26 miles 385 yards, or 
42,200 meters) requires a different selection of fuels 

Marathon run : : : 

(42,000 meters) and is characterized by cooperation between muscle, 
2 hr 3 min 59 s liver, and adipose tissue. Liver glycogen complements 
muscle glycogen as an energy store that can be tapped. 
However, the total body glycogen stores (103 mol of 
ATP at best) are insufficient to provide the 150 mol of 
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Figure 27.11 Dependency of the 
velocity of running on the duration of 
the race. The values shown are world track 
records. [Data from trackandfieldnews.com.] 
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ea ATP needed for this grueling ~2-hour event. Much 
larger quantities of ATP can be obtained by the 
oxidation of fatty acids derived from the breakdown of fat in adipose tis- 
sue, but the maximal rate of ATP generation is slower yet than that of 
glycogen oxidation and is more than 10-fold slower than that with cre- 
atine phosphate. Thus, ATP is generated much more slowly from high- 
capacity stores than from limited ones, accounting for the different veloci- 
ties of anaerobic and aerobic events. ATP generation from fatty acids is 
essential for distance running. Fats are rapidly consumed in activities 
such as distance running, explaining why extended aerobic exercise is 
beneficial for people who are insulin resistant. However, for an elite 
marathoner, fats cannot be the sole source of fuel. Studies have shown 
that, when muscle glycogen has been depleted, the power output of the 
muscle falls to approximately 50% of maximum. Power output decreases 
despite the fact that ample supplies of fat are available, suggesting that 
fats can supply only about 50% of maximal aerobic effort. A marathon 
would take about 6 hours to run if all the ATP came from fatty acid 
oxidation, because it is much slower than glycogen oxidation. Elite run- 
ners consume about equal amounts of glycogen and fatty acids during a 
marathon to achieve a mean velocity of 5.5 ms !, about half the velocity 
of a 100-meter sprint. How is an optimal mix of these fuels achieved? 
A low blood-sugar level leads to a high glucagon/insulin ratio, which in 
turn mobilizes fatty acids from adipose tissue. Fatty acids readily enter 
muscle, where they are degraded by £ oxidation to acetyl CoA and then 
to CO. The elevated acetyl CoA level decreases the activity of the pyru- 
vate dehydrogenase complex to block the conversion of pyruvate into 
acetyl CoA. Hence, fatty acid oxidation decreases the funneling of glu- 
cose into the citric acid cycle and oxidative phosphorylation. Glucose is 
spared so that just enough remains available at the end of the marathon. 
The simultaneous use of both fuels gives a higher mean velocity than 
would be attained if glycogen were totally consumed before the start of 
fatty acid oxidation. 

If carbohydrate-rich meals are consumed after glycogen depletion, gly- 
cogen stores are rapidly restored. In addition, glycogen synthesis continues 
during the consumption of carbohydrate-rich meals, increasing glycogen 
stores far above normal. This phenomenon 1s called “super compensation” 
or, more commonly, carbo-loading. 


27.5 Food Intake and Starvation Induce 
Metabolic Changes 


Thus far, we have been considering metabolism in the context of excess con- 
sumption of calories, as in obesity, or extreme caloric needs, as in exercise. We 
now look at the opposite physiological condition—lack of calories. 


The starved-fed cycle is the physiological response to a fast 


We begin with a physiological condition called the starved—fed cycle, which 
we all experience in the hours after an evening meal and through the night’s 
fast. This nightly starved—fed cycle has three stages: the well-fed state after 
a meal, the early fasting during the night, and the refed state after breakfast. 
A major goal of the many biochemical alterations in this period is to main- 
tain glucose homeostasis—that is, a constant blood-glucose level. Maintaining 
glucose homeostasis is especially important because glucose is normally the 
only fuel source for the brain. As discussed earlier, the major defect in dia- 
betes is the inability to perform this vital task. The two primary signals 
regulating the starved—fed cycle are insulin and glucagon. 


1. The Well-Fed, or Postprandial, State. After we consume and digest an 
evening meal, glucose and amino acids are transported from the intestine to 
the blood. The dietary lipids are packaged into chylomicrons and trans- 
ported to the blood by the lymphatic system. This fed condition leads to the 
secretion of insulin, which in cooperation with glucagon, maintains glucose 
homeostasis. In essence, insulin signals the fed state; it stimulates the stor- 
age of fuels and the synthesis of proteins in a variety of ways. Insulin stimu- 
lates glycogen synthesis in both muscle and the liver and suppresses gluco- 
neogenesis by the liver. Insulin also accelerates glycolysis in the liver, which 
in turn increases the synthesis of fatty acids. 

The liver helps to limit the amount of glucose in the blood during times 
of plenty by storing it as glycogen so as to be able to release glucose in times 
of scarcity. How is the excess blood glucose present after a meal removed? 
The liver is able to trap large quantities of glucose because it possesses an 
isozyme of hexokinase called glucokinase, which converts glucose into glu- 
cose 6-phosphate, which cannot be transported out of the cell. Recall that 
glucokinase has a high Kg value and is thus active only when blood-glucose 
levels are high. Furthermore, glucokinase is not inhibited by glucose 
6-phosphate as hexokinase is. Consequently, the liver forms glucose 6-phosphate 
more rapidly as the blood-glucose level rises. The increase in glucose 6-phosphate 
coupled with insulin action leads to a buildup of glycogen stores. The hor- 
monal effects on glycogen synthesis and storage are reinforced by a direct 
action of glucose itself. Phosphorylase a is a glucose sensor in addition to being 
the enzyme that cleaves glycogen. When the glucose level is high, the binding 
of glucose to phosphorylase a renders the enzyme susceptible to the action 
of a phosphatase that converts it into phosphorylase b, which does not read- 
ily degrade glycogen (Section 21.2). Thus, glucose allosterically shifts the 
glycogen system from a degradative to a synthetic mode. 

The high insulin level in the fed state also promotes the entry of glucose 
into muscle and adipose tissue. Insulin stimulates the synthesis of glycogen 
by muscle as well as by the liver. The entry of glucose into adipose tissue 
provides glycerol 3-phosphate for the synthesis of triacylglycerols. The 
action of insulin also extends to amino acid and protein metabolism. Insulin 
promotes the uptake of branched-chain amino acids (valine, leucine, and 
isoleucine) by muscle. Indeed, insulin has a general stimulating effect on 
protein synthesis, which favors a building up of muscle protein. In addition, 
it inhibits the intracellular degradation of proteins. 


2. The Early Fasting, or Postabsorptive, State. The blood-glucose level 
begins to drop several hours after a meal, leading to a decrease in insulin 
secretion and arise in glucagon secretion; glucagon is secreted by the a cells 
of the pancreas in response to a low blood-sugar level in the fasting state. 
Just as insulin signals the fed state, glucagon signals the starved state. It 
serves to mobilize glycogen stores when there is no dietary intake of 
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CHAPTER 27 The Integration of glycogen breakdown and inhibits glycogen synthesis by triggering the 

Metabolism cyclic AMP cascade leading to the phosphorylation and activation of 
phosphorylase and the inhibition of glycogen synthase (Section 21.5). 
Glucagon also inhibits fatty acid synthesis by diminishing the production 
of pyruvate and by lowering the activity of acetyl CoA carboxylase by 
maintaining it in a phosphorylated state. In addition, glucagon stimulates 
gluconeogenesis in the liver and blocks glycolysis by lowering the level of 
F-2,6-BP (see Figure 27.9). 

All known actions of glucagon are mediated by protein kinases that are 
activated by cyclic AMP. The activation of the cyclic AMP cascade results 
in a higher level of phosphorylase a activity and a lower level of glycogen 
synthase a activity. Glucagon’s effect on this cascade is reinforced by the 
low concentration of glucose in the blood. The diminished binding of glu- 
cose to phosphorylase a makes the enzyme less susceptible to the hydro- 
lytic action of the phosphatase. Instead, the phosphatase remains bound to 
phosphorylase a, and so the synthase stays in the inactive phosphorylated 
form. Consequently, there is a rapid mobilization of glycogen. 

The large amount of glucose formed by the hydrolysis of glucose 
6-phosphate derived from glycogen is then released from the liver into the 
blood. The entry of glucose into muscle and adipose tissue decreases in 
response to a low insulin level. The diminished utilization of glucose by 
muscle and adipose tissue also contributes to the maintenance of the blood- 
glucose level. The net result of these actions of glucagon is to markedly 
increase the release of glucose by the liver. Both muscle and the liver use fatty 
acids as fuel when the blood-glucose level drops, saving the glucose for use 
by the brain and red blood cells. Thus, the blood-glucose level is kept at or 
above 4.4 mM (80 mg dl’) by three major factors: (1) the mobilization of gly- 
cogen and the release of glucose by the liver, (2) the release of fatty acids by 
adipose tissue, and (3) the shift in the fuel used from glucose to fatty acids by 
muscle and the liver. 

What is the result of the depletion of the liver’s glycogen stores? 
Gluconeogenesis from lactate and alanine continues, but this process mere- 
ly replaces glucose that had already been converted into lactate and alanine 
by tissues such as muscle and red blood cells. Moreover, the brain oxidizes 
glucose completely to CO2 and H2O. Thus, for the net synthesis of glucose 
to take place, another source of carbon is required. Glycerol released from 
adipose tissue on lipolysis provides some of the carbon atoms, with the 
remaining carbon atoms coming from the hydrolysis of muscle proteins. 


3. The Refed State. What are the biochemical responses to a hearty break- 
fast? Fat is processed exactly as it is processed in the normal fed state. 
However, it is not the case for glucose. The liver does not initially absorb 
glucose from the blood, but, instead, leaves it for the other tissues. Moreover, 
the liver remains in a gluconeogenic mode. Now, however, the newly syn- 
thesized glucose is used to replenish the liver’s glycogen stores. As the 
blood-glucose levels continue to rise, the liver completes the replenishment 
of its glycogen stores and begins to process the remaining excess glucose for 
fatty acid synthesis. 


Metabolic adaptations in prolonged starvation minimize 
protein degradation 


Earlier, we considered the metabolic results of overnutrition, a condition 
becoming all too common in prosperous nations. Let us now examine the 
opposite extreme. What are the adaptations if fasting is prolonged to the 
point of starvation, a circumstance affecting nearly a billion people world- 


Table 27.4 Fuel reserves in a typical 70-kg man 


Available energy in kilojoules (kcal) 


Organ Glucose or glycogen Triacylglycerols Mobilizable proteins 
Blood 250 (60) 20. ~~ (45) ) 
Liver 1700 (400) 2000 (450) 1700 (400) 
Brain 30. (8) 0 (0) 0 (0) 
Muscle 5000 (1200) 2000 (450) 100,000 (24,000) 
Adipose tissue 330 (80) 560,000 (135,000) 170 (40) 


Source: After G. F. Cahill, Jr. Clin. Endocrinol. Metab. 5(1976):398. 


wide? A typical well-nourished 70-kg man has fuel reserves totaling about 
670,000 kJ (161,000 kcal; see Table 27.4). The energy need for a 24-hour 
period ranges from about 6700 kJ (1600 kcal) to 25,000 kJ (6000 kcal), 
depending on the extent of activity. Thus, stored fuels suffice to meet 
caloric needs in starvation for 1 to 3 months. However, the carbohydrate 
reserves are exhausted in only a day. 

Even under starvation conditions, the blood-glucose level must be 
maintained above 2.2 mM (40 mg dl“!). The first priority of metabolism in 
starvation is to provide sufficient glucose to the brain and other tissues (such as 
red blood cells) that are absolutely dependent on this fuel. However, precursors 
of glucose are not abundant. Most energy is stored in the fatty acyl moieties 
of triacylglycerols. However, recall that fatty acids cannot be converted into 
glucose, because acetyl CoA resulting from fatty acid breakdown cannot be 
transformed into pyruvate (p. 656). The glycerol moiety of triacylglycerol 
can be converted into glucose, but only a limited amount is available. The 
only other potential source of glucose is the carbon skeletons of amino acids 
derived from the breakdown of proteins. However, proteins are not stored, 
and so any breakdown will necessitate a loss of function. Thus, the second 
priority of metabolism in starvation is to preserve protein, which is accom- 
plished by shifting the fuel being used from glucose to fatty acids and ketone 
bodies (Figure 27.12). 

The metabolic changes on the first day of starvation are like those after 
an overnight fast. The low blood-sugar level leads to decreased secretion of 
insulin and increased secretion of glucagon. The dominant metabolic pro- 
cesses are the mobilization of triacylglycerols in adipose tissue and gluconeogen- 
esis by the liver. The liver obtains energy for its own needs by oxidizing fatty 
acids released from adipose tissue. The concentrations of acetyl CoA and 
citrate consequently increase, which switches off glycolysis. The uptake of 
glucose by muscle is markedly diminished because of the low insulin level, 
whereas fatty acids enter freely. Consequently, muscle uses no glucose and 
relies exclusively on fatty acids for fuel. The B oxidation of fatty acids by 
muscle halts the conversion of pyruvate into acetyl CoA, because acetyl 
CoA stimulates the phosphorylation of the pyruvate dehydrogenase com- 
plex, which renders it inactive (Section 17.3). Hence, pyruvate, lactate, and 
alanine are exported to the liver for conversion into glucose. Glycerol derived 
from the cleavage of triacylglycerols is another raw material for the synthe- 
sis of glucose by the liver. 

Proteolysis also provides carbon skeletons for gluconeogenesis. During 
starvation, degraded proteins are not replenished and serve as carbon 
sources for glucose synthesis. Initial sources of protein are those that turn 
over rapidly, such as proteins of the intestinal epithelium and the secretions 
of the pancreas. Proteolysis of muscle protein provides some of the three- 
carbon precursors of glucose. However, survival for most animals depends 
on being able to move rapidly, which requires a large muscle mass, and so 
muscle loss must be minimized. 
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Table 27.5 Fuel metabolism in starvation 


Amount formed or consumed 
in 24 hours (grams) 


Fuel exchanges and consumption 3d day 40th day 
Fuel use by the brain 
Glucose 100 40 
Ketone bodies 50 100 
All other use of glucose 50 40 
Fuel mobilization 
Adipose-tissue lipolysis 180 180 
Muscle-protein degradation 75 20 
Fuel output of the liver 
Glucose 150 80 
Ketone bodies 150 150 


How is the loss of muscle curtailed? After about 3 days of starvation, the 
liver forms large amounts of acetoacetate and D-3-hydroxybutyrate (ketone 
bodies; Figure 27.13). Their synthesis from acetyl CoA increases markedly 
because the citric acid cycle is unable to oxidize all the acetyl units gener- 
ated by the degradation of fatty acids. Gluconeogenesis depletes the supply 
of oxaloacetate, which is essential for the entry of acetyl CoA into the citric 
acid cycle. Consequently, the liver produces large quantities of ketone bod- 
ies, which are released into the blood. At this time, the brain begins to con- 
sume significant amounts of acetoacetate in place of glucose. After 3 days of 
starvation, about a quarter of the energy needs of the brain are met by ketone 
bodies (Table 27.5). The heart also uses ketone bodies as fuel. 

After several weeks of starvation, ketone bodies become the major fuel of 
the brain. Acetoacetate is activated by the transfer of CoA from succinyl 
CoA to give acetoacetyl CoA (Figure 27.14). Cleavage by thiolase then 
yields two molecules of acetyl CoA, which enter the citric acid cycle. In 
essence, ketone bodies are equivalents of fatty acids that are an accessible fuel 
source for the brain. Only 40 g of glucose is then needed per day for the 
brain, compared with about 120 g in the first day of starvation. The effective 
conversion of fatty acids into ketone bodies by the liver and their use by the 
brain markedly diminishes the need for glucose. Hence, less muscle is degraded 
than in the first days of starvation. The breakdown of 20 g of muscle daily 
compared with 75 g early in starvation is most important for survival. A 
person’s survival time is mainly determined by the size of the triacylglyc- 
erol depot. 

What happens after depletion of the triacylglycerol stores? The only 
source of fuel that remains is protein. Protein degradation accelerates, and 
death inevitably results from a loss of heart, liver, or kidney function. 


27.6 Ethanol Alters Energy Metabolism in the Liver 


Ethanol has been a part of the human diet for centuries. However, its con- 
sumption in excess can result ina number of health problems, most notably 
liver damage. What is the biochemical basis of these health problems? 


Ethanol metabolism leads to an excess of NADH 


Ethanol cannot be excreted and must be metabolized, primarily by the liver. 
This metabolism is accomplished by two pathways. The first pathway com- 
prises two steps. The first step, catalyzed by the enzyme alcohol dehydroge- 
nase, takes place in the cytoplasm: 


Alcohol 


CH;CH,OH + NAD* “"""**, CH;CHO + NADH + H* 


Ethanol Acetaldehyde 


The second step, catalyzed by aldehyde dehydrogenase, takes place in 
mitochondria: 


Aldehyde 
CH;CHO + NAD* + H,O “@"**, CH;COO7 + NADH + H* 
Acetaldehyde Acetate 


Note that ethanol consumption leads to an accumulation of NADH. This high 
concentration of NADH inhibits gluconeogenesis by preventing the oxida- 
tion of lactate to pyruvate. In fact, the high concentration of NADH will 
cause the reverse reaction to predominate, and lactate will accumulate. The 
consequences may be hypoglycemia and lactic acidosis. 

The overabundance of NADH also inhibits fatty acid oxidation. The 
metabolic purpose of fatty acid oxidation is to generate NADH for ATP 
generation by oxidative phosphorylation, but an alcohol consumer’s NADH 
needs are met by ethanol metabolism. In fact, the excess NADH signals 
that conditions are right for fatty acid synthesis. Hence, triacylglycerols 
accumulate in the liver, leading to a condition known as “fatty liver”’ that is 
exacerbated in obese persons. The biochemical effects of ethanol consump- 
tion can be quite rapid. For instance, fat accumulates in the liver within a 
few days of moderate alcohol consumption. This accumulation is reversible 
with a decrease in alcohol intake. 

The second pathway for ethanol metabolism is called the ethanol-in- 
ducible microsomal ethanol-oxidizing system (MEOS). This cytochrome 
P450-dependent pathway (Section 26.4) generates acetaldehyde and subse- 
quently acetate while oxidizing biosynthetic reducing power, NADPH, to 
NADP”. Because it uses oxygen, this pathway generates free radicals that 
damage tissues. Moreover, because the system consumes NADPH, the 
antioxidant glutathione cannot be regenerated (Section 20.5), exacerbat- 
ing the oxidative stress. 

What are the effects of the other metabolites of ethanol? Liver mitochon- 
dria can convert acetate into acetyl CoA in a reaction requiring ATP. The 
enzyme is the thiokinase that normally activates short-chain fatty acids. 


Acetate + coenzyme A + ATP ——> acetyl CoA + AMP + PP; 
PP; —— 2P, 


However, further processing of the acetyl CoA by the citric acid cycle is 
blocked, because NADH inhibits two important citric acid cycle regula- 
tory enzymes—isocitrate dehydrogenase and a-ketoglutarate dehydro- 
genase. The accumulation of acetyl CoA has several consequences. 
First, ketone bodies will form and be released into the blood, aggravat- 
ing the acidic condition already resulting from the high lactate concen- 
tration. The processing of the acetate in the liver becomes inefficient, 
leading to a buildup of acetaldehyde. This very reactive compound 
forms covalent bonds with many important functional groups in pro- 
teins, impairing protein function. If ethanol is consistently consumed at 
high levels, the acetaldehyde can significantly damage the liver, eventu- 
ally leading to cell death. 

Liver damage from excessive ethanol consumption occurs in three 
stages. The first stage is the aforementioned development of fatty liver. In 
the second stage—alcoholic hepatitis—groups of cells die and inflammation 
results. This stage can itself be fatal. In stage three—cirrhosis—fibrous 
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Prolyl residue 


structure and scar tissue are produced around the dead cells. Cirrhosis 
impairs many of the liver’s biochemical functions. The cirrhotic liver is 
unable to convert ammonia into urea, and blood levels of ammonia rise. 
Ammonia is toxic to the nervous system and can cause coma and death. 
Cirrhosis of the liver arises in about 25% of alcoholics, and about 75% of all 
cases of liver cirrhosis are the result of alcoholism. Viral hepatitis is a non- 
alcoholic cause of liver cirrhosis. 


Excess ethanol consumption disrupts vitamin metabolism 


The adverse effects of ethanol are not limited to the metabolism of ethanol 
itself. Vitamin A (retinol) is converted into retinoic acid, an important 
signal molecule for growth and development in vertebrates, by the same 
dehydrogenases that metabolize ethanol. Consequently, this activation does 
not take place in the presence of ethanol, which acts as a competitive inhib- 
itor. Moreover, the MEOS system induced by ethanol inactivates retinoic 
acid. These disruptions in the retinoic acid signaling pathway are believed 
to be responsible, at least in part, for fetal alcohol syndrome as well as the 
development of a variety of cancers. 

The disruption of vitamin A metabolism is a direct result of the bio- 
chemical changes induced by excess ethanol consumption. Other dis- 
ruptions in metabolism result from another common characteristic of 
alcoholics—malnutrition. Alcoholics will frequently drink instead of 
eating. A dramatic neurological disorder, referred to as Wernicke— 
Korsakoff syndrome, results from insufficient intake of the vitamin thia- 
mine. Symptoms include mental confusion, unsteady gait, and lack of 
fine motor skills. The symptoms of Wernicke—Korsakoff syndrome are 
similar to those of beriberi (Section 17.4) because both conditions result 
from a lack of thiamine. Thiamine is converted into the coenzyme thiamine 
pyrophosphate, a key constituent of the pyruvate dehydrogenase com- 
plex. Recall that this complex links glycolysis with the citric acid cycle. 
Disruptions in the pyruvate dehydrogenase complex are most evident as 
neurological disorders because the brain is normally dependent on glu- 
cose for energy generation. 

Alcoholic scurvy is occasionally observed because of an insufficient 
ingestion of vitamin C. Vitamin C is required for the formation of 
stable collagen fibers. The symptoms of scurvy include skin lesions 
and blood-vessel fragility. Most notable are bleeding gums, the loss of 
teeth, and periodontal infections. Gums are especially sensitive to a 
lack of vitamin C because the collagen in gums turns over rapidly. 
What is the biochemical basis for scurvy? Vitamin C is required for the 
continued activity of prolyl hydroxylase. This enzyme synthesizes 


Prolyl hydroxylase 
+ ascorbate 


a-Ketoglutarate 4-Hydroxyprolyl Succinate 
residue 


Figure 27.15 Formation of 4-hydroxyproline. Proline is hydroxylated at C-4 by the action of 
prolyl hydroxylase, an enzyme that activates molecular oxygen. 


Ascorbic acid Ascorbate Dehydroascorbic acid 


Figure 27.16 Forms of ascorbic acid (vitamin C). Ascorbate is the ionized form of vitamin C, 
and dehydroascorbic acid is the oxidized form of ascorbate. 


4-hydroxyproline, an amino acid that is required in collagen. To form 
this unusual amino acid, proline residues on the amino side of glycine 
residues in nascent collagen chains become hydroxylated. One oxygen 
atom from O becomes attached to C-4 of proline while the other oxy- 
gen atom is taken up by a-ketoglutarate, which is converted into suc- 
cinate (Figure 27.15). This reaction is catalyzed by prolyl hydroxylase, 
a dioxygenase, which requires an Fe** ion to activate O». The enzyme 
also converts a-ketoglutarate into succinate without hydroxylating 
proline. In this partial reaction, an oxidized iron complex is formed, 
which inactivates the enzyme. How is the active enzyme regenerated? 
Ascorbate (vitamin C’) comes to the rescue by reducing the ferric ion of 
the inactivated enzyme. In the recovery process, ascorbate is oxidized 
to dehydroascorbic acid (Figure 27.16). Thus, ascorbate serves here as 
a specific antioxidant. Why does impaired hydroxylation have such 
devastating consequences? Collagen synthesized in the absence of ascor- 
bate is less stable than the normal protein. Hydroxyproline stabilizes the 
collagen triple helix by forming interstrand hydrogen bonds. The 
abnormal fibers formed by insufficiently hydroxylated collagen account 
for the symptoms of scurvy. 


Summary 


27.1 Caloric Homeostasis Is a Means of Regulating Body Weight 

Many people are able to maintain a near-constant body weight 
throughout adult life. This ability is a demonstration of caloric homeo- 
stasis, a physiological condition in which energy needs match energy 
intake. When energy intake is greater than energy needs, weight gain 
results. In the developed world, obesity is at epidemic proportions 
and is implicated as a contributing factor in a host of pathological 
conditions. 


27.2 The Brain Plays a Key Role in Caloric Homeostasis 
Various signal molecules act on the brain to control appetite. Short- 
term signals such as CCK and GLP-1 relay satiety signals to the brain 
while eating is in progress. Long-term signals include leptin and insu- 
lin. Leptin, secreted by adipose tissue in direct proportion to adipose- 
tissue mass, is an indication of fat stores. Leptin inhibits eating. 
Insulin also works in the brain, signaling carbohydrate availability. 
Leptin acts by binding to a receptor in brain neurons, which 
initiates signal-transduction pathways that reduce appetite. Obesity 
can develop in individuals with normal amounts of leptin and the 
leptin receptor, suggesting that such individuals are leptin resistant. 
Suppressors of cytokine signaling may inhibit leptin signaling, leading 
to leptin resistance and obesity. 
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27.3 


27.4 


27.5 


Diabetes Is a Common Metabolic Disease Often Resulting 

from Obesity 

Diabetes is the most common metabolic disease in the world. Type 1 
diabetes results when insulin is absent due to autoimmune destruc- 
tion of the B cells of the pancreas. Type 2 diabetes is characterized by 
normal or higher levels of insulin, but the target tissues of insulin, 
notably muscle, do not respond to the hormone, a condition called 
insulin resistance. Obesity is a significant predisposing factor for type 
2 diabetes. 

In muscle, excess fats accumulate in an obese individual. These fats 
are processed to second messengers that activate signal-transduction 
pathways that inhibit insulin signaling, leading to insulin resistance. 
Insulin resistance in target tissues ultimately leads to pancreatic B-cell 
failure. The pancreas tries to compensate for a lack of insulin action 
by synthesizing more insulin, resulting in ER stress and subsequent 
activation of apoptotic pathways that lead to B-cell death. 

Type 1 diabetes is due to metabolic derangements resulting in an 
insufficiency of insulin and an excess of glucagon relative to a person’s 
needs. The result is an elevated blood-glucose level, the mobilization 
of triacylglycerols, and excessive ketone-body formation. Accelerated 
ketone-body formation can lead to acidosis, coma, and death in 
untreated insulin-dependent diabetics. 


Exercise Beneficially Alters the Biochemistry of Cells 

Exercise is a useful prescription for insulin resistance and type 2 dia- 
betes. Muscle activity stimulates mitochondrial biogenesis in a calcitum- 
dependent manner. The increase in the number of mitochondria 
facilitates fatty acid oxidation in the muscle, resulting in increased 
insulin sensitivity. 

Fuel choice in exercise is determined by the intensity and duration 
of the bout of exercise. Sprinting and marathon running are powered 
by different fuels to maximize power output. The 100-meter sprint is 
powered by stored ATP, creatine phosphate, and anaerobic glycoly- 
sis. In contrast, the oxidation of both muscle glycogen and fatty acids 
derived from adipose tissue is essential in the running of a marathon, 
a highly aerobic process. 


Food Intake and Starvation Induce Metabolic Changes 

Insulin signals the fed state; it stimulates the formation of glycogen 
and triacylglycerols and the synthesis of proteins. In contrast, gluca- 
gon signals a low blood-glucose level; it stimulates glycogen break- 
down and gluconeogenesis by the liver and triacylglycerol hydrolysis 
by adipose tissue. After a meal, the rise in the blood-glucose level leads 
to an increased secretion of insulin and a decreased secretion of gluca- 
gon. Consequently, glycogen is synthesized in muscle and the liver. 
When the blood-glucose level drops several hours later, glucose is 
then formed by the degradation of glycogen and by the gluconeogenic 
pathway, and fatty acids are released by the hydrolysis of triacylglyc- 
erols. The liver and muscle then increasingly use fatty acids instead of 
glucose to meet their own energy needs so that glucose is conserved for 
use by the brain and the red blood cells. 

The metabolic adaptations in starvation serve to minimize protein 
degradation. Large amounts of ketone bodies are formed by the liver 
from fatty acids and released into the blood within a few days after the 
onset of starvation. After several weeks of starvation, ketone bodies 
become the major fuel of the brain. The diminished need for glucose 


decreases the rate of muscle breakdown, and so the likelihood of sur- 815 


vival is enhanced. 


27.6 Ethanol Alters Energy Metabolism in the Liver 


Problems 


The oxidation of ethanol results in an unregulated overproduction of 
NADH, which has several consequences. A rise in the blood levels of 
lactic acid and ketone bodies causes a fall in blood pH, or acidosis. The 
liver is damaged because the excess NADH causes excessive fat for- 
mation as well as the generation of acetaldehyde, a reactive molecule. 


Severe liver damage can result. 


Key Terms 


caloric homeostasis (energy 
homeostasis) (p. 791) 


cholecystokinin (CCK) (p. 794) 
glucagon-like peptide 1 (GLP-1) (p. 795) 
leptin (p. 795) 

insulin (p. 795) 


Problems 


1. Depot fat. Adipose tissue was once only considered a 
storage site for fat. Why is this view no longer considered 
correct? 


2. Balancing act. What is meant by caloric homeostasis? 


3. Dynamic duo. What are the key hormones responsible 
for maintaining caloric homeostasis? 


4, Dual roles. What two biochemical roles does CCK play? 
GLP-1? 


5. Failure to communicate. Leptin inhibits eating and is 
secreted in amounts in direct proportion to body fat. 
Moreover, obese people have normal amounts of leptin and 
leptin receptor. Why, then, do people become obese? 


6. Many signals. Match the characteristic (1-9) with the 
appropriate hormone (a-f ). 


1. Secreted by adipose tissue (a) leptin 
2. Stimulates liver gluconeogenesis (b) adiponectin 
3. GPCR pathway (c) GLP-1 
4, Satiety signal (d) CCK 
5. Enhances insulin secretion (e) insulin 
6. Secreted by the pancreas during (f) glucagon 
a fast 
7. Secreted after a meal 
8. Stimulates glycogen synthesis 
9. Missing in type 1 diabetes 


7. A key chemical. What are the sources of glucose 
6-phosphate in liver cells? 


8. Neither option is good. Differentiate between type 1 and 
type 2 diabetes. 


leptin resistance (p. 797) 
type 1 diabetes (p. 798) 
insulin resistance (p. 798) 
type 2 diabetes (p. 798) 
metabolic syndrome (p. 800) 


endoplasmic reticulum (ER) stress 

(p. 802) 
unfolded protein response (UPR) (p. 802) 
starved—fed cycle (p. 807) 


glucose homeostasis (p. 807) 


9. Fighting diabetes. Leptin is considered an “anti- 
diabetogenic” hormone. Explain. 


10. Metabolic energy and power. The rate of energy expendi- 
ture of a typical 70-kg person at rest is about 70 watts (W), 
like that of a light bulb. 


(a) Express this rate in kilojoules per second and in kilo- 
calories per second. 


(b) How many electrons flow through the mitochondrial 
electron-transport chain per second under these conditions? 


(c) Estimate the corresponding rate of ATP production. 
(d) The total ATP content of the body is about 50 g. 


Estimate how often an ATP molecule turns over in a person 
at rest. 


11. Respiratory quotient (RQ ). This classic metabolic index 
is defined as the volume of CO) released divided by the 


volume of O» consumed. 

(a) Calculate the RQ values for the complete oxidation of 
glucose and of tripalmitoylglycerol. 

(b) What do RQ measurements reveal about the contribu- 


tions of different energy sources during intense exercise? 
(Assume that protein degradation is negligible.) 


12. Camel’s hump. Compare the H2O yield from the 
complete oxidation of 1 g of glucose with that of 1 g of 
tripalmitoylglycerol. Relate these values to the evolution- 
ary selection of the contents of a camel’s hump. 


13. Hungry—nourished. What is meant by the starved—fed 
cycle? 
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14. Of course, too much is bad for you. What are the pri- 
mary means of processing ethanol? 


15. Started out with burgundy, but soon hit the harder stuff. 
Describe the three stages of ethanol consumption that lead 
to liver damage and possibly death. 


16. The wages of sin. How long does a person have to jog to 
offset the calories obtained from eating 10 macadamia nuts 
(75 kJ, or 18 kcal, per nut)? (Assume an incremental power 
consumption of 400 W.) 


17. Sweet hazard. Ingesting large amounts of glucose 
before a marathon might seem to be a good way of increas- 
ing the fuel stores. However, experienced runners do not 
ingest glucose before a race. What is the biochemical reason 
for their avoidance of this potential fuel? (Hint: Consider 
the effect of glucose ingestion on the level of insulin.) 


18. Lipodystrophy. Lipodystrophy is a condition in which 
an individual lacks adipose tissue. The muscles and liver 
from such individuals are insulin resistant, and both tissues 
accumulate large amounts of triacylglycerides (hyperlipi- 
demia). The administration of leptin partly ameliorates this 
condition. What does it indicate about the relation of adi- 
pose tissue to insulin action? 


19. Therapeutic target. What would be the effect of a mutation 
in the gene for PT P1B (protein tyrosine phosphatase 1B) that 
inactivated the enzyme in a person who has type 2 diabetes? 


20. An effect of diabetes. Insulin-dependent diabetes is 
often accompanied by hypertriglyceridemia, which is an 
excess blood level of triacylglycerols in the form of very low 
density lipoproteins. Suggest a biochemical explanation. 


21. Sharing the wealth. The hormone glucagon signifies the 
starved state, yet it inhibits glycolysis in the liver. How does 
this inhibition of an energy-production pathway benefit the 
organism? 


22. Compartmentation. Glycolysis takes place in the cyto- 
plasm, whereas fatty acid degradation takes place in mito- 
chondria. What metabolic pathways depend on the inter- 
play of reactions that take place in both compartments? 


23. Kwashiorkor. The most common form of malnutrition 
in children in the world, kwashiorkor, is caused by a diet 
having ample calories but little protein. The high levels of 
carbohydrate result in high levels of insulin. What is the 
effect of high levels of insulin on 


(a) lipid utilization? 
(b) protein metabolism? 


(c) Children suffering from kwashiorkor often have large 
distended bellies caused by water from the blood leaking 
into extracellular spaces. Suggest a biochemical basis for 
this condition. 


24. One for all, all for one. How is the metabolism of the 
liver coordinated with that of skeletal muscle during strenu- 
ous exercise? 


25. A little help, please? What is the advantage of convert- 
ing pyruvate into lactate in skeletal muscle? 


26. Fuel choice. What is the major fuel for resting muscle? 
What is the major fuel for muscle under strenuous work 
conditions? 


27. Hefty reimbursement. Endurance athletes sometimes 
follow the exercise-and-diet plan described here: 7 days 
before an event, do exhaustive exercises so as to all but 
deplete glycogen stores. For the next 2 to 3 days, consume 
few carbohydrates and do moderate- to low-intensity exer- 
cises. Finally, 3 to 4 days before the event, consume a diet 
rich in carbohydrates. Explain the benefits of this regime. 


28. Oxygen deficit. After light exercise, the oxygen con- 
sumed in recovery is approximately equal to the oxygen 
deficit, which is the amount of additional oxygen that 
would have been consumed had oxygen consumption 
reached steady state immediately. How is the oxygen con- 
sumed in recovery used? 


29. Excess postexercise oxygen consumption. The oxygen 
consumed after strenuous exercise stops is significantly 
greater than the oxygen deficit and is termed excess post- 
exercise oxygen consumption (EPOC). Why is so much more 
oxygen required after intense exercise? 


30. Psychotropic effects. Ethanol is unusual in that it is 
freely soluble in both water and lipids. Thus, it has access to 
all regions of the highly vascularized brain. Although the 
molecular basis of ethanol action in the brain is not clear, 
ethanol evidently influences a number of neurotransmitter 
receptors and ion channels. Suggest a biochemical explana- 
tion for the diverse effects of ethanol. 


31. Fiber type. Skeletal muscle has several distinct fiber 
types. Type I is used primarily for aerobic activity, where- 
as type II is specialized for short, intense bursts of activity. 
How could you distinguish between these types of muscle 
fiber if you viewed them with an electron microscope? 


32. Tour de France. Cyclists in the Tour de France (more 
than 2000 miles in 3 weeks) require about 836,000 kJ 
(200,000 kcal) of energy, or 41,840 kJ (10,000 kcal) day! 
(a resting male requires ~8368 kJ, or 2000 kcal, day !). 


(a) With the assumptions that the energy yield of ATP is 
about 50.2 kJ (12 kcal) mol! and that ATP has a molecular 
weight of 503 g mol ', how much ATP would be expended 
by a Tour de France cyclist? 


(b) Pure ATP can be purchased at a cost of approximately 
$150 per gram. How much would it cost to power a cyclist 
through the Tour de France if the ATP had to be purchased? 


33. Spare tive. Suppose that our test subject from the begin- 
ning of the chapter gained 55 pounds between the ages 
of 25 and 65, and that her weight at 65 years of age is 
175 pounds. Calculate how many excess calories she con- 
sumed per day to gain the 55 pounds over 40 years. Assume 
that our test subject is 5 feet 6 inches tall. What is her BMI? 
Would she be considered obese at 175 Ibs? 


34. Responding to stress. Why does it make good physiolog- 
ical sense that regular bouts of prolonged exercise will 
result in mitochondrial biogenesis? 


35. Too much of a good thing. What is the relation between 
fatty acid oxidation and insulin resistance in the muscle? 


36. Aneurin? Really? Why are the symptoms of beriberi 
similar to those of Wernicke—Korsakoff syndrome? 


Data Interpretation Problem 


37. Lactate threshold. The graph shows the relation between 
blood-lactate levels, oxygen consumption, and heart rate 
during exercise of increasing intensity. The values for oxy- 
gen consumption and heart rate are indicators of the degree 
of exertion. 
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(a) Why is some lactate produced even when exercise is 
moderate? 


(b) Biochemically, what is taking place when the lactate 
concentration begins to rise rapidly, a point called the lac- 
tate threshold? 


(c) Endurance athletes will sometimes measure blood- 
lactate levels during training so that they know their lactate 
threshold. Then, during events, they will race just at or 
below their lactate threshold until the late stages of the race. 
Biochemically, why is this practice wise? 


(d) Training can increase the lactate threshold. Explain. 
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CHAPTER 


Newly 
synthesized DNA 


Template DNA 


Faithful copying is essential to the storage of genetic 


information. With the precision of a diligent monk copying an 
illuminated manuscript, a DNA polymerase (above) copies 
DNA strands, preserving the precise sequence of bases with 
very few errors. [(Left)The Pierpont Morgan Library/Art 


Resource.] 


erhaps the most exciting aspect of the structure of DNA deduced by 

Watson and Crick was, as expressed in their words, that the “specific 
pairing we have postulated immediately suggests a possible copying mecha- 
nism for the genetic material.” A double helix separated into two single 
strands can be replicated because each strand serves as a template on which 
its complementary strand can be assembled (Figure 28.1). To preserve the 
information encoded in DNA through many cell divisions, copying of the 
genetic information must be extremely faithful. To replicate the human 
genome without mistakes, an error rate of less than 1 bp per 3 X 10° bp must 
be achieved. Such remarkable accuracy is achieved through a multilayered 
system of accurate DNA synthesis (which has an error rate of 1 per 10°—-10* 
bases inserted), proofreading during DNA synthesis (which reduces that 
error rate to approximately 1 per 10°—10’ bp), and postreplication mismatch 
repair (which reduces the error rate to approximately 1 per 10’—10'° bp). 

Even after DNA has been initially replicated, the genome is still not safe. 
Although DNA is remarkably robust, ultraviolet light as well as a range of 
chemical species can damage DNA, introducing changes in the DNA 
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DNA Replication Proceeds by 
the Polymerization of 
Deoxyribonucleoside 
Triphosphates Along a Template 


DNA Unwinding and 
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Topoisomerases 
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DNA Recombination Plays 
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Original parent molecule 


ss 


First-generation daughter molecules 


Figure 28.1 DNA replication. Each strand 
of one double helix (shown in blue) acts as a 
template for the synthesis of a new 
complementary strand (shown in red). 
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sequence (mutations) or lesions that can block further DNA replication 
(Figure 28.2). All organisms contain DNA-repair systems that detect DNA 
damage and act to preserve the original sequence. Mutations in genes that 
encode components of DNA-repair systems are key factors in the develop- 
ment of cancer. Among the most potentially devastating types of DNA 
damage are double-stranded breaks in DNA. With both strands of the 
double helix broken in a local region, neither strand is intact to act as a tem- 
plate for future DNA synthesis. A mechanism used to repair such lesions 
relies on DNA recombination—that is, the reassortment of DNA sequences 
present on two different double helices. In addition to its role in DNA 
repair, recombination is crucial for the generation of genetic diversity in 
meiosis. Recombination is also the key to generating a highly diverse reper- 
toire of genes for key molecules in the immune system (Chapter 34). 


Replication 
NIN AS NIN 


ame 


Figure 28.2 DNA Replication, damage, and repair. Some errors (shown as a black dot) may 
arise in the replication processes. Additional defects (shown in yellow) including modified bases, 
cross-links, and single- and double-strand breaks are introduced into DNA by subsequent DNA- 
damaging reactions. Many of the errors are detected and subsequently repaired. 


28.1 DNA Replication Proceeds by the Polymerization of 
Deoxyribonucleoside Triphosphates Along a Template 


The base sequences of newly synthesized DNA must faithfully match the 
sequences of parent DNA. To achieve faithful replication, each strand within 
the parent double helix acts as a template for the synthesis of a new DNA 
strand with a complementary sequence. The building blocks for the synthe- 
sis of the new strands are deoxyribonucleoside triphosphates. They are 
added, one at a time, to the 3’ end of an existing strand of DNA. 

Although this reaction is in principle quite simple, it is significantly 
complicated by specific features of the DNA double helix. First, the two 
strands of the double helix run in opposite directions. Because DNA strand 
synthesis always proceeds in the 5’-to-3’ direction, the DNA replication 
process must have special mechanisms to accommodate the oppositely 
directed strands. Second, the two strands of the double helix interact with 
one another in such a way that the bases, key templates for replication, are 
on the inside of the helix. Thus, the two strands must be separated from 
each other so as to generate appropriate templates. Finally, the two strands 
of the double helix wrap around each other. Thus, strand separation also 
entails the unwinding of the double helix. This unwinding creates super- 
coils that must themselves be resolved as replication continues. We begin 
with a consideration of the chemistry that underlies the formation of the 
phosphodiester backbone of newly synthesized DNA. 


DNA polymerases require a template and a primer 


DNA polymerases catalyze the formation of polynucleotide chains. Each 
incoming nucleoside triphosphate first forms an appropriate base pair with 
a base in the template. Only then does the DNA polymerase link the incom- 


ing base with the predecessor in the chain. Thus, DNA polymerases are 
template-directed enzymes. 

DNA polymerases add nucleotides to the 3’ end of a polynucleotide 
chain. The polymerase catalyzes the nucleophilic attack by the 3’-hydroxyl- 
group terminus of the polynucleotide chain on the a phosphoryl group of 
the nucleoside triphosphate to be added (see Figure 4.25). To initiate this 
reaction, DNA polymerases require a primer with a free 3'-hydroxyl group 
already base-paired to the template. They cannot start from scratch by add- 
ing nucleotides to a free single-stranded DNA template. RNA polymerase, 
in contrast, can initiate RNA synthesis without a primer, as we shall see in 
Chapter 29. 


All DNA polymerases have structural features in common 


The three-dimensional structures of anumber of DNA polymerase enzymes 
are known. The first such structure was elucidated by Tom Steitz and 
coworkers, who determined the structure of the so-called Klenow fragment 
of DNA polymerase I from E. coli (Figure 28.3). This fragment comprises 
two main parts of the full enzyme, including the polymerase unit. This unit 
approximates the shape of a right hand with domains that are referred to as 
the fingers, the thumb, and the palm. In addition to the polymerase, the 
Klenow fragment includes a domain with 3’ —> 5’ exonuclease activity that 
participates in proofreading and correcting the polynucleotide product. 


“4 DNA polymerases are remarkably similar in overall shape, although 

they differ substantially in detail. At least five structural classes have 
been identified; some of them are clearly homologous, whereas others 
appear to be the products of convergent evolution. In all cases, the finger 
and thumb domains wrap around DNA and hold it across the enzyme’s 
active site, which comprises residues primarily from the palm domain. 
Furthermore, all DNA polymerases use similar strategies to catalyze the 
polymerase reaction, making use of a mechanism in which two metal ions 
take part. 


Two bound metal ions participate in the polymerase reaction 


Like all enzymes with nucleoside triphosphate substrates, DNA polymerases 
require metal ions for activity. Examination of the structures of DNA poly- 
merases with bound substrates and substrate analogs reveals the presence of 
two metal ions in the active site. One metal ion binds both the deoxynucleo- 
side triphosphate (dNTP) and the 3’-hydroxyl group of the primer, where- 
as the other interacts only with the dNTP (Figure 28.4). The two metal ions 


Figure 28.4 DNA polymerase mechanism. 
Two metal ions (typically, Mg**) participate in 
the DNA polymerase reaction. One metal ion 
coordinates the 3’-hydroxyl group of the 
primer, whereas the other metal ion interacts 
only with the dNTP. The phosphoryl group of 
the nucleoside triphosphate bridges between 
the two metal ions. The hydroxyl group of the 
primer attacks the phosphoryl group to form a 
new O-P bond. 
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Primer 
The initial segment of a polymer that is to be 
extended on which elongation depends. 
Template 
A sequence of DNA or RNA that directs the 
synthesis of a complementary sequence. 


Fingers 


Exonuclease 


Figure 28.3 DNA polymerase 
structure. The first DNA polymerase structure 
determined was that of a fragment of E. coli 
DNA polymerase | called the Klenow 
fragment. Notice that, like other DNA 
polymerases, the polymerase unit resembles 
a right hand with fingers (blue), palm 
(yellow), and thumb (red). The Klenow 
fragment also includes an exonuclease 
domain that removes incorrect nucleotide 
bases. [Drawn from 1DPI.pdb.] 
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Figure 28.5 Shape complementarity. The 
base analog on the right has the same shape 
as adenosine, but groups that form hydrogen 
bonds between base pairs have been replaced 
by groups (shown in red) not capable of 
hydrogen bonding. Nonetheless, studies reveal 
that, when incorporated into the template 
strand, this analog directs the insertion of 
thymidine in DNA replication. 


Figure 28.6 Minor-groove interactions. 
DNA polymerases donate two hydrogen 
bonds to base pairs in the minor groove. 
Hydrogen-bond acceptors are present in these 
two positions for all Watson—Crick base pairs, 
including the A-T base pair shown. 
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Adenosine Analog lacking the ability to form 


base-pairing hydrogen bonds 


are bridged by the carboxylate groups of two aspartate residues in the palm 
domain of the polymerase. These side chains hold the metal ions in the 
proper positions and orientations. The metal ion bound to the primer 
activates the 3’-hydroxyl group of the primer, facilitating its attack on the a 
phosphoryl group of the dNTP substrate in the active site. The two metal 
ions together help stabilize the negative charge that accumulates on the 
pentacoordinate transition state. The metal ion initially bound to dNTP 
stabilizes the negative charge on the pyrophosphate product. 


The specificity of replication is dictated by complementarity of shape 
between bases 


DNA must be replicated with high fidelity. Each base added to the growing 
chain should, with high probability, be the Watson—Crick complement of 
the base in the corresponding position in the template strand. The binding 
of the dNTP containing the proper base is favored by the formation of 
a base pair with its partner on the template strand. Although hydrogen 
bonding contributes to the formation of this base pair, overall shape com- 
plementarity is crucial. Studies show that a nucleotide with a base that is 
very similar in shape to adenine but lacks the ability to form base-pairing 
hydrogen bonds can still direct the incorporation of thymidine, both in vitro 
and in vivo (Figure 28.5). 

An examination of the crystal structures of various DNA polymerases 
reveals why shape complementarity is so important. First, residues of the 
enzyme form hydrogen bonds with the minor-groove side of the base pair in 
the active site (Figure 28.6). In the minor groove, hydrogen-bond acceptors 
are present in the same positions for all Watson—Crick base pairs. These 
interactions act as a “ruler” that measures whether a properly spaced base 
pair has formed in the active site. 

Second, DNA polymerases close down around the incoming dNTP 
(Figure 28.7). The binding of a deoxyribonucleoside triphosphate into the 
active site of a DNA polymerase triggers a conformational change: the 
finger domain rotates to form a tight pocket into which only a properly 
shaped base pair will readily fit. Many of the residues lining this pocket are 
important to ensure the efficiency and fidelity of DNA synthesis. For 
example, mutation of a conserved tyrosine residue that forms part of the 
pocket results in a polymerase that is approximately 40 times as error prone 
as the parent polymerase. 


dNTP inside 
tight pocket 


hk Figure 28.7 Shape selectivity. The binding of a deoxyribonucleoside triphosphate (dNTP) to 


DNA polymerase induces a conformational change, generating a tight pocket for the base pair 


consisting of the dNTP and its partner on the template strand. Such a conformational change is 


possible only when the dNTP corresponds to the Watson—Crick partner of the template base. 


[Drawn from 2BDP.pdb and 1T7P.pdb.] 


An RNA primer synthesized by primase enables DNA synthesis to 
begin 

DNA polymerases cannot initiate DNA synthesis without a primer, 
a section of nucleic acid having a free 3’ end that forms a double helix 
with the template. How is this primer formed? An important clue 
came from the observation that RNA synthesis is essential for the 
initiation of DNA synthesis. In fact, RNA primes the synthesis of 
DNA. An RNA polymerase called primase synthesizes a short stretch 
of RNA (about five nucleotides) that is complementary to one of the 
template DNA strands (Figure 28.8). Primase, like other RNA 
polymerases, can initiate synthesis without a primer. After DNA 
synthesis has been initiated, the short stretch of RNA is removed by 
hydrolysis and replaced by DNA. 


One strand of DNA is made continuously, whereas the 
other strand is synthesized in fragments 


Both strands of parental DNA serve as templates for the synthesis 
of new DNA. The site of DNA synthesis is called the replication 
fork because the complex formed by the newly synthesized daugh- 
ter helices arising from the parental duplex resembles a two- 
pronged fork. Recall that the two strands are antiparallel; that is, 
they run in opposite directions. During DNA replication, both 
daughter strands appear on cursory examination to grow in the 
same direction. However, all known DNA polymerases synthesize 
DNA in the 5’ — 3’ direction but not in the 3’ —> 5’ direction. How 
then does one of the daughter DNA strands appear to grow in the 
3' + 5’ direction? 

This dilemma was resolved by Reiji Okazaki, who found that a 
significant proportion of newly synthesized DNA exists as small frag- 
ments. These units of about a thousand nucleotides (called Okazaki 
fragments) are present briefly in the vicinity of the replication fork 
(Figure 28.9). 


DNA template 


[pines 


3 5 


5 


5! 3’ 


RNA primer 
[ow polymerase 


3’ 5! 


3! 


Newly synthesized DNA 


Figure 28.8 Priming. DNA replication is primed by a 

short stretch of RNA that is synthesized by primase, an 
RNA polymerase. The RNA primer is removed at a later 
stage of replication. 
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Figure 28.9 Okazaki fragments. At a replication fork, 
both strands are synthesized in the 5’ — 3’ direction. The 
leading strand is synthesized continuously, whereas the 
lagging strand is synthesized in short pieces termed 
Okazaki fragments. 
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Figure 28.10 DNA ligase reaction. DNA 
ligase catalyzes the joining of one DNA strand 
with a free 3’-hydroxyl group to another with 
a free 5'-phosphoryl group. In eukaryotes 
and archaea, ATP is cleaved to AMP and PP; 
to drive this reaction. In bacteria, NAD* 

is cleaved to AMP and nicotinamide 
mononucleotide (NMN). 


hk Figure 28.11 Helicase structure. The structure of 
the hexameric helicase from bacteriophage T7. One of 


the six subunits is shown in yellow with the P-loop 
NTPase shown in purple. The loops that participate in 


DNA binding are highlighted by a yellow oval. Notice that 
each subunit interacts closely with its neighbors and that 
the DNA-binding loops line the hole in the center of the 


structure. [Drawn from 1EOK.pdb.] 
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As replication proceeds, these fragments become covalently joined 
through the action of the enzyme DNA ligase to form a continuous daugh- 
ter strand. The other new strand is synthesized continuously. The strand 
formed from Okazaki fragments is termed the lagging strand, whereas the 
one synthesized without interruption is the leading strand. The discontinu- 
ous assembly of the lagging strand enables 5' — 3’ polymerization at the 
nucleotide level to give rise to overall growth in the 3’ — 5’ direction. 


DNA ligase joins ends of DNA in duplex regions 


The joining of Okazaki fragments requires an enzyme that catalyzes the 
joining of the ends of two DNA chains. The existence of circular DNA 
molecules also points to the existence of such an enzyme. In 1967, scientists 
in several laboratories simultaneously discovered DNA ligase. This enzyme 
catalyzes the formation of a phosphodiester bond between the 3'-hydroxyl group 
at the end of one DNA chain and the 5'-phosphoryl group at the end of the other 
(Figure 28.10). An energy source is required to drive this thermodynamically 
uphill reaction. In eukaryotes and archaea, ATP is the energy source. In 
bacteria, NAD* typically plays this role. 

DNA ligase cannot link two molecules of single-stranded DNA or circu- 
larize single-stranded DNA. Rather, ligase seals breaks in double-stranded 
DNA molecules. The enzyme from E. coli ordinarily forms a phosphodiester 
bridge only if there are at least a few bases of single-stranded DNA on the 
end of a double-stranded fragment that can come together with those 
on another fragment to form base pairs. Ligase encoded by T4 bacte- 
riophage can link two blunt-ended double-helical fragments, a capa- 
bility that is exploited in recombinant DNA technology. 


The separation of DNA strands requires specific helicases 
and ATP hydrolysis 


For a double-stranded DNA molecule to replicate, the two strands of the 
double helix must be separated from each other, at least locally. This 
separation allows each strand to act as a template on which a new poly- 
nucleotide chain can be assembled. Specific enzymes, termed helicases, 
utilize the energy of ATP hydrolysis to power strand separation. 
Helicases are a large and diverse family of enzymes taking part in 
many biological processes. The helicases in DNA replication are 
typically oligomers containing six subunits that form a ring structure. 
The structure of one such helicase, that from bacteriophage T7, has 
been determined and has been a source of considerable insight into the 
helicase mechanism (Figure 28.11). Each of the subunits within this 
hexameric structure has a core structure that includes a P-loop NTPase 
domain (see Figure 9.51). In addition to the P-loop, each subunit has 
two loops that extend toward the center of the ring structure and inter- 
act with DNA. Each subunit interacts closely with its two neighbors 
within the ring structure. Closer examination of this structure 
reveals that the ring deviates significantly from six-fold symmetry. 
This deviation is even more apparent when the nonhydrolyzable 


ATP analog AMP-PNP has been added in laboratory studies of helicase’s 
structure. 
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The AMP-PNP binds to only four of the six subunits within the ring 
(Figure 28.12). Furthermore, the four nucleotide-binding sites are not iden- 
tical but fall into two classes. One class appears to be well positioned to bind 
ATP but not catalyze its hydrolysis, whereas the other class is more well 
suited to catalyze the hydrolysis but not release the hydrolysis products. 
The classes are analogous to myosin’s two different conformations—one for 
binding ATP and one for hydrolyzing it (Section 9.4). Finally, the six sub- 
units fall into three classes with regard to their orientation with respect to 
the overall ring structure, with differences in rotation around an axis in the 
plane of the ring of approximately 30°. These differences in orientation 
affect the position of the two DNA-binding loops in each subunit. 

These observations are consistent with the following mechanism for the 
helicase (Figure 28.13). Only a single strand of DNA can fit through the 
center on the ring. This single strand binds to loops on two adjacent 
subunits, one of which has bound ATP and the other of which has bound 
ADP + P;. The binding of ATP to the domains that initially had no bound 
nucleotides leads to a conformational change within the entire hexamer, 
leading to the release of ADP + P; from two subunits and the binding of the 
single-stranded DNA by one of the domains that just bound ATP. This 
conformational change pulls the DNA through the center of the hexamer. 
The protein acts as a wedge, forcing the two strands of the double helix 
apart. This cycle then repeats itself, moving two bases along the DNA 
strand with each cycle. 


28.2 DNA Unwinding and Supercoiling Are 
Controlled by Topoisomerases 


As a helicase moves along unwinding DNA, the DNA in front of the heli- 
case will become overwound in the absence of other changes. As discussed 
in Chapter 4, DNA double helices that are torsionally stressed tend to fold 
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h Figure 28.12 Helicase asymmetry. 
The structure of the T7 helicase complexes 
with the ATP analog AMP-PNP is shown. The 
three classes of helicase subunits are shown 
in blue, red, and yellow. The rotation relative 
to the plane of the hexamer is shown for 
each subunit. Notice that only four of the 
subunits, those shown in blue and yellow, 
bind AMP-PNP. [Drawn from 1EOK.pdb.] 


Figure 28.13 Helicase mechanism. One of 
he strands of the double helix passes through 
he hole in the center of the helicase, bound 
o the loops of two adjacent subunits. Two 

of the subunits do not contain bound 
nucleotides. On the binding of ATP to these 
two subunits and the release of ADP + P; 
rom two other subunits, the helicase hexamer 
undergoes a conformational change, pulling 
he DNA through the helicase. The helicase 
acts as a wedge to force separation of the two 
strands of DNA. 
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CHAPTER 28 DNA Replication, Repair, first consider the supercoiling of DNA in quantitative terms and then turn 

and Recombination to topoisomerases, enzymes that can directly modulate DNA winding and 
supercoiling. Supercoiling is most readily understood by considering circu- 
lar DNA molecules, but it also applies to linear DNA molecules constrained 
to be in loops by other means. Most DNA molecules inside cells are subject 
to supercoiling. 

Consider a linear 260-bp DNA duplex in the B-DNA form (Fig- 
ure 28.14A). Because the number of base pairs per turn in an unstressed 
DNA molecule averages 10.4, this linear DNA molecule has 25 (260/10.4) 
turns. The ends of this helix can be joined to produce a relaxed circular 
DNA (Figure 28.14B). A different circular DNA can be formed by unwind- 
ing the linear duplex by two turns before joining its ends (Figure 28.14C). 
What is the structural consequence of unwinding before ligation? Two 
limiting conformations are possible. The DNA can fold into a structure 
containing 23 turns of B helix and an unwound loop (Figure 28.14D). 
Alternatively, the double helix can fold up to cross itself. Such crossings 
are called supercoils. In particular, a supercoiled structure with 25 turns 
of B helix and 2 turns of right-handed (termed negative) superhelix can be 
formed (Figure 28.14E). 
Supercoiling markedly alters the overall form of DNA. A supercoiled 

DNA molecule is more compact than a relaxed DNA molecule of the same 
length. Hence, supercoiled DNA moves faster than relaxed DNA when 
analyzed by centrifugation or electrophoresis. Unwinding will cause super- 
coiling in circular DNA molecules, whether covalently closed or constrained 
in closed configurations by other means. 


The linking number of DNA, a topological property, determines the 
degree of supercoiling 


Our understanding of the conformation of DNA is enriched by concepts 
drawn from topology, a branch of mathematics dealing with structural 
properties that are unchanged by deformations such as stretching and bend- 
ing. A key topological property of a circular DNA molecule is its linking 
number (Lk), which is equal to the number of times that a strand of DNA 
winds in the right-handed direction around the helix axis when the axis 
lies in a plane, as in Figure 28.14A. For the relaxed DNA shown in 
Figure 28.14B, Lk = 25. For the partly unwound molecule shown in part D 
and the supercoiled one shown in part E, Lk = 23 because the linear duplex 
was unwound two complete turns before closure. Molecules differing only 
in linking number are topological isomers, or topoisomers, of one another. 
Topoisomers of DNA can be interconverted only by cutting one or both DNA 
strands and then rejoining them. 

The unwound DNA and supercoiled DNA shown in Figure 28.14D and 
E are topologically identical but geometrically different. They have the 
same value of Lk but differ in twist (Tw) and writhe (Wr). Although the 
rigorous definitions of twist and writhe are complex, twist is a measure 
of the helical winding of the DNA strands around each other, whereas 
writhe is a measure of the coiling of the axis of the double helix—that 
is, supercoiling. A right-handed coil is assigned a negative number (nega- 
tive supercoiling) and a left-handed coil is assigned a positive number 
(positive supercoiling). 

Is there a relation between Tw and W’? Indeed, there is. Topology tells 
us that the sum of Tw and Wr is equal to LR. 


Lk = Tw+ Wr 
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Figure 28.14 Linking number. The relations between the linking number (LA), twisting 
number (Tw), and writhing number (Wr) of a circular DNA molecule revealed schematically. 
[After W. Saenger, Principles of Nucleic Acid Structure (Springer Verlag, 1984), p. 452.] 


In Figure 28.14, the partly unwound circular DNA has Tw ~ 23, meaning 
the helix has 23 turns and Wr ~ 0, meaning the helix has not crossed itself 
to create a supercoil. The supercoiled DNA, however has Tw ~ 25 and Wr 
~ —2. These forms can be interconverted without cleaving the DNA chain 
because they have the same value of Lk—namely, 23. The partitioning of Lk 
(which must be an integer) between Tw and Wr (which need not be integers) 
is determined by energetics. The free energy 1s minimized when about 70% 
of the change in Lk is expressed in Wr and 30% is expressed in Tw. Hence, 
the most stable form would be one with Tw = 24.4 and Wr = —1.4. Thus, 
a lowering of Lk causes both right-handed (negative) supercoiling of the DNA 
axis and unwinding of the duplex. Topoisomers differing by just 1 in Lk, and 
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Figure 28.16 Structure of 
topoisomerase I. The structure of a complex 
between a fragment of human topoisomerase 
| and DNA is shown. Notice that DNA lies in a 
central cavity within the enzyme. [Drawn from 
1EJ9.pdb.] 


Figure 28.15 Topoisomers. An electron micrograph showing negatively supercoiled and relaxed 
DNA. [Courtesy of Dr. Jack Griffith.] 


consequently by 0.7 in Wr, can be readily separated by agarose gel electro- 
phoresis becausetheir hydrodynamic volumesare quite different; supercoiling 
condenses DNA (Figure 28.15). 


Topoisomerases prepare the double helix for unwinding 


Most naturally occurring DNA molecules are negatively supercoiled. What 
is the basis for this prevalence? As already stated, negative supercoiling 
arises from the unwinding or underwinding of the DNA. In essence, nega- 
tive supercoiling prepares DNA for processes requiring separation of the 
DNA strands, such as replication. Positive supercoiling condenses DNA as 
effectively, but it makes strand separation more difficult. 

The presence of supercoils in the immediate area of unwinding would, 
however, make unwinding difficult. Therefore, negative supercoils must be 
continuously removed, and the DNA relaxed, as the double helix unwinds. 
Specific enzymes called topoisomerases that introduce or eliminate super- 
coils were discovered by James Wang and Martin Gellert. Type I topoiso- 
merases catalyze the relaxation of supercoiled DNA, a thermodynamically 
favorable process. Type II topoisomerases utilize free energy from ATP 
hydrolysis to add negative supercoils to DNA. Both type I and type II 
topoisomerases play important roles in DNA replication as well as in tran- 
scription and recombination. 

These enzymes alter the linking number of DNA by catalyzing a three- 
step process: (1) the cleavage of one or both strands of DNA, (2) the passage 
of a segment of DNA through this break, and (3) the resealing of the DNA 
break. Type I topoisomerases cleave just one strand of DNA, whereas type II 
enzymes cleave both strands. The two types of enzymes have several com- 
mon features, including the use of key tyrosine residues to form covalent 
links to the polynucleotide backbone that is transiently broken. 


Type | topoisomerases relax supercoiled structures 


The three-dimensional structures of several type I topoisomerases have 
been determined (Figure 28.16). These structures reveal many features of 
the reaction mechanism. Human type I topoisomerase comprises four 
domains, which are arranged around a central cavity having a diameter of 
20 A, just the correct size to accommodate a double-stranded DNA mole- 
cule. This cavity also includes a tyrosine residue (Tyr 723), which acts as a 
nucleophile to cleave the DNA backbone in the course of catalysis. 
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Figure 28.17 Topoisomerase | mechanism. On binding to DNA, topoisomerase | cleaves one 
strand of the DNA by means of a tyrosine (Y) residue attacking a phosphoryl group. When the 
strand has been cleaved, it rotates in a controlled manner around the other strand. The reaction is 
completed by religation of the cleaved strand. This process results in partial or complete relaxation 
of a supercoiled plasmid. 


From analyses of these structures and the results of other studies, the 
relaxation of negatively supercoiled DNA molecules is known to proceed in 
the following manner (Figure 28.17). First, the DNA molecule binds inside 
the cavity of the topoisomerase. The hydroxyl group of tyrosine 723 attacks 
a phosphoryl group on one strand of the DNA backbone to form a phospho- 
diester linkage between the enzyme and the DNA, cleaving the DNA and 
releasing a free 5’-hydroxyl group. 
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With the backbone of one strand cleaved, the DNA can now rotate 
around the remaining strand, its movement driven by the release of energy 
stored because of the supercoiling. The rotation of the DNA unwinds the 
supercoils. The enzyme controls the rotation so that the unwinding is not 
rapid. The free hydroxyl group of the DNA attacks the phosphotyrosine 
residue to reseal the backbone and release tyrosine. The DNA is then free to 
dissociate from the enzyme. Thus, reversible cleavage of one strand of 
supercoiled DNA allows controlled rotation to partly relax the supercoils. 


Type II topoisomerases can introduce negative supercoils 
through coupling to ATP hydrolysis 


Supercoiling requires an input of energy because a supercoiled molecule, in 
contrast with its relaxed counterpart, is torsionally stressed. The introduc- 
tion of an additional supercoil into a 3000-bp plasmid typically requires 
about 30 kJ mol”! (7 kcal mol” !). 
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DNA-segment- 
binding sites 


h Figure 28.18 Structure of topoisomerase II. The dimeric 
structure of a typical topoisomerase II, that from the archaeon 
Sulfolobus shibatae. Notice that each half of the enzyme has one 
domain (shown in yellow) that contains a region for binding a DNA 
double helix and another domain (shown in green) that contains ATP 
binding sites. [Drawn from 2ZBK.pdb.] 


G segment 


Supercoiling can be catalyzed by type II topoisomer- 
ases. These elegant molecular machines couple the bind- 
ing and hydrolysis of ATP to the directed passage of one 
DNA double helix through another, temporarily cleaved 
DNA double helix. These enzymes have several mecha- 
nistic features in common with the type I topoisomerases. 

Topoisomerase II molecules are dimeric with a large 
internal cavity (Figure 28.18). The large cavity has gates at 
both the top and the bottom that are crucial to topo- 
isomerase action. The reaction begins with the binding of 
one double helix (hereafter referred to as the G, for gate, 
segment) to the enzyme (Figure 28.19). Each strand is 
positioned next to a tyrosine residue, one from each mono- 
mer, capable of forming a covalent linkage with the DNA 
backbone. This complex then loosely binds a second DNA 
double helix (hereafter referred to as the T, for transported, 
segment). Each monomer of the enzyme has a domain that 
binds ATP; this ATP binding leads to a conformational 
change that strongly favors the coming together of the two 
domains. As these domains come closer together, they trap 
the bound T segment. This conformational change also 
forces the separation and cleavage of the two strands of the 
G segment. Each strand is linked to the enzyme by a 
tyrosine—phosphodiester bond. Unlike the type I enzymes, 


T segment 


Figure 28.19 Topoisomerase II mechanism. Topoisomerase II first binds one DNA duplex 
termed the G (for gate) segment. The binding of ATP to the two N-terminal domains brings these 
two domains together. This conformational change leads to the cleavage of both strands of the G 
segment and the binding of an additional DNA duplex, the T segment. This T segment then moves 
through the break in the G segment and out the bottom of the enzyme. The hydrolysis of ATP 
resets the enzyme with the G segment still bound. 
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the type II topoisomerases hold the DNA tightly so that it cannot rotate. 
The T segment then passes through the cleaved G segment and into the 
large central cavity. The ligation of the G segment leads to the release of the 
T segment through the gate at the bottom of the enzyme. The hydrolysis of 
ATP and the release of ADP and orthophosphate allow the ATP-binding 
domains to separate, preparing the enzyme to bind another T segment. The 
overall process leads to a decrease in the linking number by two. 


e The bacterial topoisomerase II (often called DNA gyrase) is the 
target of several antibiotics that inhibit the prokaryotic enzyme much 
more than the eukaryotic one. Novobiocin blocks the binding of ATP to 
gyrase. Nalidixic acid and ciprofloxacin, in contrast, interfere with the 
breakage and rejoining of DNA chains. These two gyrase inhibitors are 
widely used to treat urinary-tract and other infections including those due 
to Bacillus anthracis (anthrax). Camptothecin, an antitumor agent, inhibits 
human topoisomerase I by stabilizing the form of the enzyme covalently 
linked to DNA. 


Nalidixic acid A 


28.3 DNA Replication Is Highly Coordinated 


DNA replication must be very rapid, given the sizes of the genomes and the 
rates of cell division. The E. coli genome contains 4.6 million base pairs and 
is copied in less than 40 minutes. Thus, 2000 bases are incorporated per 
second. Enzyme activities must be highly coordinated to replicate entire 
genomes precisely and rapidly. 

We begin our consideration of the coordination of DNA replication by 
looking at E. coli, which has been extensively studied. For this organism 
with a relatively small genome, replication begins at a single site and contin- 
ues around the circular chromosome. The coordination of eukaryotic DNA 
replication is more complex because there are many initiation sites through- 
out the genome and an additional enzyme is needed to replicate the ends of 
linear chromosomes. 


DNA replication requires highly processive polymerases 


Replicative polymerases are characterized by their very high catalytic potency, 
fidelity, and processivity. Processivity refers to the ability of an enzyme to 
catalyze many consecutive reactions without releasing its substrate. These 
polymerases are assemblies of many subunits that have evolved to grasp 
their templates and not let go until many nucleotides have been added. The 
source of the processivity was revealed by the determination of the three- 
dimensional structure of the B2 subunit of the E. coli replicative polymerase 
called DNA polymerase III (Figure 28.20). This unit keeps the polymerase 
associated with the DNA double helix. It has the form of a star-shaped ring. 
A 35-A-diameter hole in its center can readily accommodate a duplex DNA 
molecule, yet leaves enough space between the DNA and the protein to 
allow rapid sliding during replication. To achieve a catalytic rate of 1000 
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hk Figure 28.20 Structure of a sliding 
DNA clamp. The dimeric B subunit of DNA 
polymerase Ill forms a ring that surrounds the 
DNA duplex. Notice the central cavity through 
which the DNA template slides. Clasping the 
DNA molecule in the ring, the polymerase 
enzyme is able to move without falling off the 
DNA substrate. [Drawn from 2POL.pdb.] 
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Processive enzyme 
From the Latin procedere, “to go forward.” 
An enzyme that catalyzes multiple 
rounds of the elongation or digestion of a 
polymer while the polymer stays bound. A 
distributive enzyme, in contrast, releases its 
polymeric substrate between successive 
catalytic steps. 


Figure 28.21 Replication fork. A schematic 
view of the arrangement of DNA polymerase 
Ill and associated enzymes and proteins 
present in the replication of DNA. The helicase 
separates the two strands of the parent 
double helix, allowing DNA polymerases to 
use each strand as a template for DNA 
synthesis. Abbreviation: SSB, single-stranded- 
binding protein. 


nucleotides polymerized per second requires that 100 turns of duplex DNA 
(a length of 3400 A, or 0.34 mm) slide through the central hole of B» per 
second. Thus, B» plays a key role in replication by serving as a sliding DNA 
clamp. 

How does DNA become entrapped inside the sliding clamp? Replicative 
polymerases also include assemblies of subunits that function as clamp loaders. 
These enzymes grasp the sliding clamp and, utilizing the energy of ATP 
binding, pull apart one of the interfaces between the two subunits of the 
sliding clamp. DNA can move through the gap, inserting itself through the 
central hole. ATP hydrolysis then releases the clamp, which closes around 
the DNA. 


The leading and lagging strands are synthesized 
in a coordinated fashion 


Replicative polymerases such as DNA polymerase III synthesize the lead- 
ing and lagging strands simultaneously at the replication fork (Figure 28.21). 
DNA polymerase III begins the synthesis of the leading strand starting 
from the RNA primer formed by primase. The duplex DNA ahead of the 
polymerase is unwound by a hexameric helicase called DnaB. Copies of 
single-stranded-binding protein (SSB) bind to the unwound strands, keep- 
ing the strands separated so that both strands can serve as templates. The 
leading strand is synthesized continuously by polymerase III. Topoisomerase 
II concurrently introduces right-handed (negative) supercoils to avert a 
topological crisis. 

The mode of synthesis of the lagging strand is necessarily more complex. 
As mentioned earlier, the lagging strand is synthesized in fragments so that 
5’ — 3’ polymerization leads to overall growth in the 3’ — 5’ direction. Yet 
the synthesis of the lagging strand is coordinated with the synthesis of the 
leading strand. How is this coordination accomplished? Examination of the 
subunit composition of the DNA polymerase III holoenzyme reveals an 
elegant solution (Figure 28.22). The holoenzyme includes two copies of the 
polymerase core enzyme, which consists of the DNA polymerase itself (the 
a subunit); the e subunit, a 3’-to-5’ proofreading exonuclease; another sub- 
unit called 0; and two copies of the dimeric B-subunit sliding clamp. The 
core enzymes are linked to a central structure having the subunit composi- 
tion yT258' xd. The yt258’ complex is the clamp loader, and the x and 


Lagging 
strand 


Leading 
strand 


subunits interact with the single-stranded-DNA-—binding protein. The 
entire apparatus interacts with the hexameric helicase DnaB. Eukaryotic 
replicative polymerases have similar, albeit slightly more complicated, sub- 
unit compositions and structures. 

The lagging-strand template is looped out so that it passes through the 
polymerase site in one subunit of a dimeric polymerase III in the same 
direction as that of the leading-strand template in the other subunit, 5’ > 3’. 
DNA polymerase III lets go of the lagging-strand template after adding 
about 1000 nucleotides by releasing the sliding clamp. A new loop is then 
formed, a sliding clamp is added, and primase again synthesizes a short 
stretch of RNA primer to initiate the formation of another Okazaki frag- 
ment. This mode of replication has been termed the trombone model because 
the size of the loop lengthens and shortens like the slide on a trombone 
(Figure 28.23). 
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Figure 28.22 DNA polymerase 
holoenzyme. Each holoenzyme consists of 
two copies of the polymerase core enzyme, 
which comprises the a, e, and 8 subunits and 
two copies of the B subunit, linked to a 
central structure. The central structure includes 
the clamp-loader complex and the hexameric 
helicase DnaB. 


Figure 28.23 Trombone model. The 
replication of the leading and lagging strands is 
coordinated by the looping out of the lagging 
strand to form a structure that acts somewhat as 
a trombone slide, growing as the replication fork 
moves forward. When the polymerase on the 
lagging strand reaches a region that has been 
replicated, the sliding clamp is released and a 
new loop is formed. 
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DnaA 
hexamer 


Figure 28.25 Assembly of DnaA. 
Monomers of DnaA bind to their binding sites 
(shown in yellow) in oriC and come together 
to form a complex structure, possibly the 
cyclic hexamer shown here. This structure 
marks the origin of replication and favors DNA 
strand separation in the AT-rich sites (green). 


Figure 28.26 Prepriming complex. The 
AT-rich regions are unwound and trapped by 
the single-stranded-binding protein (SSB). The 
hexameric DNA helicase DnaB is loaded on 
each strand. At this stage, the complex is 
ready for the synthesis of the RNA primers 
and and assembly of the DNA polymerase III 
holoenzyme. 


Tandem array of Binding sites for DnaA protein 
13-mer sequences 
(AT rich) 


CTNTTNTTTT-3' 
GANAANAAAA-:S' 
Consensus sequence 


ra 


Figure 28.24 Origin of replication in E. coli. The oriC locus has a length of 245 bp. It contains 
a tandem array of three nearly identical 13-nucleotide sequences (green) and five binding sites 
(yellow) for the DnaA protein. 


The gaps between fragments of the nascent lagging strand are filled by 
DNA polymerase I. This essential enzyme also uses its 5’ —> 3’ exonuclease 
activity to remove the RNA primer lying ahead of the polymerase site. The 
primer cannot be erased by DNA polymerase III, because the enzyme lacks 
5’ —> 3’ editing capability. Finally, DNA ligase connects the fragments. 


DNA replication in Escherichia coli begins at a unique site 


In E. coli, DNA replication starts at a unique site within the entire 4.6 X 
10° bp genome. This origin of replication, called the oriC locus, is a 245- 
bp region that has several unusual features (Figure 28.24). The oriC locus 
contains five copies of a sequence that are preferred binding sites for the 
origin-recognition protein DnaA. In addition, the locus contains a tandem 
array of 13-bp sequences that are rich in AT base pairs. Several steps are 
required to prepare for the start of replication: 


1. The binding of DnaA proteins to DNA is the first step in the preparation 
for replication. DnaA is a member of the P-loop NTPase family related to 
the hexameric helicases. Each DnaA monomer comprises an ATPase 
domain linked to a DNA-binding domain at its C-terminus. DnaA mole- 
cules are able to bind to each other through their ATPase domains; a group 
of bound DnaA molecules will break apart on the binding and hydrolysis of 
ATP. The binding of DnaA molecules to one another signals the start of the 
preparatory phase, and their breaking apart signals the end of that phase. 
The DnaA proteins bind to the five high-affinity sites in oriC’ and then 
come together with DnaA molecules bound to lower-affinity sites to form 
an oligomer, possibly a cyclic hexamer. The DNA is wrapped around the 
outside of the DnaA hexamer (Figure 28.25). 


2. Single DNA strands are exposed in the prepriming complex. With DNA 
wrapped around a DnaA hexamer, additional proteins are brought into 
play. The hexameric helicase DnaB is loaded around the DNA with the help 
of the helicase loader protein DnaC. Local regions of oriC, including the AT 
regions, are unwound and trapped by the single-stranded- DNA-binding 
protein. The result of this process is the generation of a structure called the 
prepriming complex, which makes single-stranded DNA accessible to other 
proteins (Figure 28.26). Significantly, the primase, DnaG, is now able to 
insert the RNA primer. 


3. The polymerase holoenzyme assembles. The DNA polymerase III holoen- 
zyme assembles on the prepriming complex, initiated by interactions 
between DnaB and the sliding-clamp subunit of DNA polymerase III. 
These interactions also trigger ATP hydrolysis within the DnaA subunits, 
signaling the initiation of DNA replication. The breakup of the DnaA 
assembly prevents additional rounds of replication from beginning at the 
replication origin. 


DNA synthesis in eukaryotes is initiated at multiple sites 


Replication in eukaryotes is mechanistically similar to replication in 
prokaryotes but is more challenging for a number of reasons. One of them 
is sheer size: E. coli must replicate 4.6 million base pairs, whereas a human 
diploid cell must replicate more than 6 billion base pairs. Second, the genetic 
information for E. coli is contained on 1 chromosome, whereas, in human 
beings, 23 pairs of chromosomes must be replicated. Finally, whereas the E. 
coli chromosome is circular, human chromosomes are linear. Unless coun- 
termeasures are taken, linear chromosomes are subject to shortening with 
each round of replication. 

The first two challenges are met by the use of multiple origins of replica- 
tion. In human beings, replication requires about 30,000 origins of replica- 
tion, with each chromosome containing several hundred. Each origin of 
replication is the starting site for a replication unit, or replicon. In contrast 
with EF. coli, the origins of replication in human beings do not contain 
regions of sharply defined sequence. Instead, more broadly defined AT-rich 
sequences are the sites around which the origin of replication complexes 
(ORCs) are assembled. 


1. The assembly of the ORC is the first step in the preparation for replication. 
In human beings, the ORC is composed of six different proteins, each 
homologous to DnaA. These proteins likely come together to form a hexa- 
meric structure analogous to the assembly formed by DnaA. 


2. Licensing factors recruit a helicase that exposes single strands of DNA. 
After the ORC has been assembled, additional proteins are recruited, 
including Cdc6, a homolog of the ORC subunits, and Cdt1. These proteins, 
in turn, recruit a hexameric helicase with six distinct subunits called Mcm2-7. 
These proteins, including the helicase, are sometimes called licensing factors 
because they permit the formation of the initiation complex. After the 
initiation complex has formed, Mcm2-7 separates the parental DNA 
strands, and the single strands are stabilized by the binding of replication 
protein A, a single-stranded- DNA-binding protein. 


3. Two distinct polymerases are needed to copy a eukaryotic replicon. An ini- 
tiator polymerase called polymerase a begins replication but is soon replaced 
by a more processive enzyme. This process is called polymerase switching 
because one polymerase has replaced another. This second enzyme, called 
DNA polymerase 6, is the principal replicative polymerase in eukaryotes 
(Table 28.1). 


Table 28.1 Some types of DNA polymerases 


Name Function 


Prokaryotic Polymerases 


DNA polymerase I Erases primer and fills in gaps on 
lagging strand 
DNA polymerase II (error-prone polymerase) | DNA repair 
DNA polymerase III Primary enzyme of DNA synthesis 
Eukaryotic Polymerases 
DNA polymerase a Initiator polymerase 
Primase subunit Synthesizes the RNA primer 
DNA polymerase unit Adds stretch of about 20 nucleotides 


to the primer 
DNA polymerase B (error-prone polymerase) DNA repair 
DNA polymerase 5 Primary enzyme of DNA synthesis 
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Figure 28.27 Eukaryotic cell cycle. DNA 
replication and cell division must take place in 
a highly coordinated fashion in eukaryotes. 
Mitosis (M) takes place only after DNA 
synthesis (S). Two gaps (G; and G3) in time 
separate the two processes. 


Figure 28.28 Proposed model for 
telomeres. A single-stranded segment of the 
G-rich strand extends from the end of the 
telomere. In one model for telomeres, this 
single-stranded region invades the duplex to 
form a large duplex loop. 


Replication begins with the binding of DNA polymerase a. This enzyme 
includes a primase subunit, used to synthesize the RNA primer, as well as 
an active DNA polymerase. After this polymerase has added a stretch of 
about 20 deoxynucleotides to the primer, another replication protein, called 
replication factor C' (RFC), displaces DNA polymerase a. Replication factor 
C attracts a sliding clamp called proliferating cell nuclear antigen (PCNA), 
which is homologous to the B2 subunit of E. coli polymerase III. The bind- 
ing of PCNA to DNA polymerase 6 renders the enzyme highly processive 
and suitable for long stretches of replication. Replication continues in both 
directions from the origin of replication until adjacent replicons meet and 
fuse. RNA primers are removed and the DNA fragments are ligated by 
DNA ligase. 

The use of multiple origins of replication requires mechanisms for 
ensuring that each sequence is replicated once and only once. The events 
of eukaryotic DNA replication are linked to the eukaryotic cell cycle 
(Figure 28.27). The processes of DNA synthesis and cell division are coor- 
dinated in the cell cycle so that the replication of all DNA sequences is 
complete before the cell progresses into the next phase of the cycle. This 
coordination requires several checkpoints that control the progression along 
the cycle. A family of small proteins termed cyclins are synthesized and 
degraded by proteasomal digestion in the course of the cell cycle. Cyclins 
act by binding to specific cyclic-dependent protein kinases and activating 
them. One such kinase, cyclin-dependent kinase 2 (cdk2) binds to assem- 
blies at origins of replication and regulates replication through a number of 
interlocking mechanisms. 


Telomeres are unique structures at the ends of linear chromosomes 


Whereas the genomes of essentially all prokaryotes are circular, the chro- 
mosomes of human beings and other eukaryotes are linear. The free ends of 
linear DNA molecules introduce several complications that must be 
resolved by special enzymes. In particular, complete replication of DNA 
ends is difficult because polymerases act only in the 5’ — 3’ direction. The 
lagging strand would have an incomplete 5’ end after the removal of 
the RNA primer. Each round of replication would further shorten the 
chromosome. 

The first clue to how this problem is resolved came from sequence 
analyses of the ends of chromosomes, which are called telomeres (from the 
Greek telos, “an end”). Telomeric DNA contains hundreds of tandem 
repeats of a six-nucleotide sequence. One of the strands is G rich at the 3’ 
end, and it is slightly longer than the other strand. In human beings, the 
repeating G-rich sequence is AGGGTT. 

The structure adopted by telomeres has been extensively investigated. 
Recent evidence suggests that they may form large duplex loops 
(Figure 28.28). The single-stranded region at the very end of the structure 
has been proposed to loop back to form a DNA duplex with another part of 
the repeated sequence, displacing a part of the original telomeric duplex. 
This looplike structure is formed and stabilized by specific telomere-binding 
proteins. Such structures would nicely mask and protect the end of the 
chromosome. 


G-rich strand 


Telomeres are replicated by telomerase, a specialized polymerase 
that carries its own RNA template 


How are the repeated sequences generated? An enzyme, termed telomerase, 
that executes this function has been purified and characterized. When a 
primer ending in GGT'T is added to human telomerase in the presence of 
deoxynucleoside triphosphates, the sequences GGITAGGGTT and 
GGTTAGGGTTAGGGTT, as well as longer products, are generated. 
Elizabeth Blackburn and Carol Greider discovered that the enzyme adding 
the repeats contains an RNA molecule that serves as the template for the 
elongation of the G-rich strand (Figure 28.29). Thus, telomerase carries the 
information necessary to generate the telomere sequences. The exact num- 
ber of repeated sequences is not crucial. 

Subsequently, a protein component of telomerases also was identified. 
This component is related to reverse transcriptases, enzymes first discov- 
ered in retroviruses that copy RNA into DNA. Thus, telomerase is a special- 
ized reverse transcriptase that carries its own template. Telomerase is generally 
expressed at high levels only in rapidly growing cells. Thus, telomeres and 
telomerase can play important roles in cancer-cell biology and in cell 


aging. 


e Because cancer cells express high levels of telomerase, whereas most 
normal cells do not, telomerase is a potential target for anticancer 
therapy. A variety of approaches for blocking telomerase expression or 
blocking its activity are under investigation for cancer treatment and 
prevention. 


28.4 Many Types of DNA Damage Can Be Repaired 


We have examined how even very large and complex genomes can, in prin- 
ciple, be replicated with considerable fidelity. However, DNA does become 
damaged, both in the course of replication and through other processes. 
Damage to DNA can be as simple as the misincorporation of a single base 
or it can take more-complex forms such as the chemical modification of 
bases, chemical cross-links between the two strands of the double helix, or 
breaks in one or both of the phosphodiester backbones. The results may be 
cell death or cell transformation, changes in the DNA sequence that can be 
inherited by future generations, or blockage of the DNA replication process 
itself. A variety of DNA-repair systems have evolved that can recognize 
these defects and, in many cases, restore the DNA molecule to its undam- 
aged form. We begin with some of the sources of DNA damage. 


Errors can arise in DNA replication 


Errors introduced in the replication process are the simplest source of damage 
in the double helix. With the addition of each base, there is the possibility that 
an incorrect base might be incorporated, forming a non-Watson—Crick base 
pair. These non-Watson—Crick base pairs can locally distort the DNA 
double helix. Furthermore, such mismatches can be mutagenic; that is, they 
can result in permanent changes in the DNA sequence. When a double 
helix containing a non-Watson—Crick base pair is replicated, the two 
daughter double helices will have different sequences because the mis- 
matched base is very likely to pair with its Watson—Crick partner. Errors 
other than mismatches include insertions, deletions, and breaks in one or 
both strands. Furthermore, replicative polymerases can stall or even fall off 
a damaged template entirely. As a consequence, replication of the genome 
may halt before it is complete. 
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Figure 28.29 Telomere formation. 
Mechanism of synthesis of the G-rich strand 
of telomeric DNA. The RNA template of 
telomerase is shown in blue and the 
nucleotides added to the G-rich strand of the 
primer are shown in red. [After E. H. 
Blackburn. Nature 350:569-573, 1991.] 
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A variety of mechanisms have evolved to deal with such interruptions, 
including specialized DNA polymerases that can replicate DNA across 
many lesions. A drawback is that such polymerases are substantially more 
error prone than are normal replicative polymerases. Nonetheless, these 
error-prone polymerases allow the completion of a draft sequence of the 
genome that can be at least partly repaired by DNA-repair processes. DNA 
recombination (Section 28.5) provides an additional mechanism for salvag- 
ing interruptions in DNA replication. 


Bases can be damaged by oxidizing agents, alkylating agents, 


H 
| y and light 
HN N H 
Yo O----... Hy A variety of chemical agents can alter specific bases within DNA after 
n N replication is complete. Such mutagens include reactive oxygen spe- 
ee fi ba cies such as hydroxyl radical. For example, hydroxyl radical reacts 
Nesteeseess" : \ with guanine to form 8-oxoguanine. 8-Oxoguanine is mutagenetic 


8-Oxoguanine Adenine 


Figure 28.30 Oxoguanine-adenine base pair. 
When guanine is oxidized to 8-oxoguanine, the 
damaged base can form a base pair with adenine 
through an edge of the base that does not normally 
participate in base-pair formation. 


Figure 28.31 Adenine deamination. The 
base adenine can be deaminated to form 
hypoxanthine. Hypoxanthine forms base pairs 
with cytosine in a manner similar to that of 
guanine, and so the deamination reaction can 
result in mutation. 
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Figure 28.32 Aflatoxin activation. The 
compound, produced by molds that grow on 
peanuts, is activated by cytochrome P450 to 
form a highly reactive species that modifies 
bases such as guanine in DNA, leading to 
mutations. 


because it often pairs with adenine rather than cytosine in DNA rep- 
lication. Its choice of pairing partner differs from that of gua- 
nine because it uses a different edge of the base to form base pairs 
(Figure 28.30). Deamination is another potentially deleterious pro- 
cess. For example, adenine can be deaminated to form hypoxanthine 
(Figure 28.31). This process is mutagenic because hypoxanthine pairs 
with cytosine rather than thymine. Guanine and cytosine also can be 
deaminated to yield bases that pair differently from the parent base. 


Adenine Hypoxanthine 


In addition to oxidation and deamination, nucleotide bases are subject to 
alkylation. Electrophilic centers can be attacked by nucleophiles such as 
N-7 of guanine and adenine to form alkylated adducts. Some compounds 
are converted into highly active electrophiles through the action of enzymes 
that normally play a role in detoxification. A striking example is aflatoxin 
By, acompound produced by molds that grow on peanuts and other foods. 
A cytochrome P450 enzyme (p. 781) converts this compound into a highly 
reactive epoxide (Figure 28.32). This agent reacts with the N-7 atom of 
guanosine to form a mutagenic adduct that frequently leads to a G-C-to-T-A 
transversion. 

The ultraviolet component of sunlight is a ubiquitous DNA-damaging 
agent. Its major effect is to covalently link adjacent pyrimidine residues 
along a DNA strand (Figure 28.33). Such a pyrimidine dimer cannot fit into 
a double helix, and so replication and gene expression are blocked until the 
lesion is removed. 

A thymine dimer is an example of an intrastrand cross-link because both 
participating bases are in the same strand of the double helix. Cross-links 
between bases on opposite strands also can be introduced by various agents. 
Psoralens are compounds produced by a Chinese herb that form such inter- 
strand cross-links (Figure 28.34). Interstrand cross-links disrupt replication 
because they prevent strand separation. 


Thymine dimer 


Figure 28.33 Cross-linked dimer of two thymine bases. Ultraviolet light induces cross-links 
between adjacent pyrimidines along one strand of DNA. 


High-energy electromagnetic radiation such as x-rays can damage DNA 
by producing high concentrations of reactive species in solution. X-ray 
exposure can induce several types of DNA damage including single- and 
double-stranded breaks in DNA. This ability to induce such DNA damage 
led Hermann Muller to discover the mutagenic effects of x-rays in Drosophila 
in 1927. This discovery contributed to the development of Drosophila as one 
of the premier organisms for genetic studies. 


DNA damage can be detected and repaired by a variety of systems 


To protect the genetic message, a wide range of DNA-repair systems are 
present in most organisms. Many systems repair DNA by using sequence 
information from the uncompromised strand. Such single-strand replica- 
tion systems follow a similar mechanistic outline: 


1. Recognize the offending base(s). 
2. Remove the offending base(s). 
3. Repair the resulting gap with a DNA polymerase and DNA ligase. 


We will briefly consider examples of several repair pathways. Although 
many of these examples are taken from E. coli, corresponding repair systems 
are present in most other organisms, including humans. 

The replicative DNA polymerases themselves are able to correct many 
DNA mismatches produced in the course of replication. For example, the ¢ 
subunit of E. coli DNA polymerase III functions as a 3’-to-5’ exonuclease. 
This domain removes mismatched nucleotides from the 3’ end of DNA by 
hydrolysis. How does the enzyme sense whether a newly added base is cor- 
rect? As a new strand of DNA is synthesized, it is proofread. If an incorrect 
base is inserted, then DNA synthesis slows down owing to the difficulty of 
threading a non-Watson—Crick base pair into the polymerase. In addition, 
the mismatched base is weakly bound and therefore able to fluctuate in 
position. The delay from the slowdown allows time for these fluctuations to 
take the newly synthesized strand out of the polymerase active site and into 
the exonuclease active site (Figure 28.35). There, the DNA is degraded, one 
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Psoralen 


Figure 28.34 A cross-linking agent. The 
compound psoralen and its derivatives can 
form interstrand cross-links through two 
reactive sites that can form adducts with 
nucleotide bases. 


Figure 28.35 Proofreading. The growing 
polynucleotide chain occasionally leaves the 
polymerase site and migrates to the active site 
of exonuclease. There, one or more 
nucleotides are excised from the newly 
synthesized chain, removing potentially 
incorrect bases. 
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Figure 28.36 Mismatch repair. DNA 
mismatch repair in E. coli is initiated by the 
interplay of MutS, MutL, and MutH proteins. A 
G-T mismatch is recognized by MutS. MutH 
cleaves the backbone in the vicinity of the 
mismatch. A segment of the DNA strand 
containing the erroneous T is removed by 
exonuclease | and synthesized anew by DNA 
polymerase Ill. [After R. F. Service. Science 
263:1559-1560, 1994] 


h Figure 28.37 Structure of DNA-repair 
enzyme. A complex between the DNA-repair 
enzyme AlkA and an analog of a DNA 
molecule missing a purine base (an apurinic 
site) is shown. Notice that the backbone sugar 
in the apurinic site is flipped out of the double 
helix into the active site of the enzyme. 
[Drawn from 1BNK.pdb.] 
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nucleotide at a time, until it moves back into the polymerase active site and 
synthesis continues. 

A second mechanism is present in essentially all cells to correct errors 
made in replication that are not corrected by proofreading (Figure 28.36). 
Mismatch-repair systems consist of at least two proteins, one for detecting 
the mismatch and the other for recruiting an endonuclease that cleaves the 
newly synthesized DNA strand close to the lesion to facilitate repair. In 
E. coli, these proteins are called MutS and MutL and the endonuclease is 
called MutH. 

Another mechanism of DNA repair is direct repair, one example of which 
is the photochemical cleavage of pyrimidine dimers. Nearly all cells contain 
a photoreactivating enzyme called DNA photolyase. The E. coli enzyme, a 
35-kd protein that contains bound N 5 N 10_methenyltetrahydrofolate and 
flavin adenine dinucleotide (FAD) cofactors, binds to the distorted region 
of DNA. The enzyme uses light energy—specifically, the absorption of a 
photon by the N°,N!°-methenyltetrahydrofolate coenzyme—to form an 
excited state that cleaves the dimer into its component bases. 

The excision of modified bases such as 3-methyladenine by the E. coli 
enzyme AIRA is an example of base-excision repair. The binding of this 
enzyme to damaged DNA flips the affected base out of the DNA double 
helix and into the active site of the enzyme (Figure 28.37). The enzyme 
then acts as a glycosylase, cleaving the glycosidic bond to release the dam- 
aged base. At this stage, the DNA backbone is intact, but a base is missing. 
This hole is called an AP site because it is apurinic (devoid of A or G) or 
apyrimidinic (devoid of C or T). An AP endonuclease recognizes this defect 
and nicks the backbone adjacent to the missing base. Deoxyribose phospho- 
diesterase excises the residual deoxyribose phosphate unit, and DNA 
polymerase I inserts an undamaged nucleotide, as dictated by the base on 
the undamaged complementary strand. Finally, the repaired strand is sealed 
by DNA ligase. 

One of the best-understood examples of nucleotide-excision repair is 
utilized for the excision of a pyrimidine dimer. Three enzymatic activities 
are essential for this repair process in FE. coli (Figure 28.38). First, an enzyme 
complex consisting of the proteins encoded by the uvrABC genes detects the 
distortion produced by the DNA damage. The UvrABC enzyme then cuts 


Backbone sugar 
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Figure 28.38 Nucleotide-excision repair. Repair of a region of DNA containing a thymine dimer 
by the sequential action of a specific exinuclease, a DNA polymerase, and a DNA ligase. The 
thymine dimer is shown in blue and the new region of DNA is in red. [After P. C. Hanawalt. 
Endevour 31:83, 1982.] 


the damaged DNA strand at two sites, 8 nucleotides away from the damaged 
site on the 5’ side and 4 nucleotides away on the 3’ side. The 12-residue 
oligonucleotide excised by this highly specific excinuclease (from the Latin 
exci, “to cut out”) then diffuses away. DNA polymerase I enters the gap to 
carry out repair synthesis. The 3’ end of the nicked strand is the primer, and 
the intact complementary strand is the template. Finally, the 3’ end of the 
newly synthesized stretch of DNA and the original part of the DNA chain 
are joined by DNA ligase. 

DNA ligase is able to seal simple breaks in one strand of the DNA back- 
bone. However, alternative mechanisms are required to repair breaks on 
both strands that are close enough together to separate the DNA into two 
double helices. Several distinct mechanisms are able to repair such damage. 
One mechanism, nonhomologous end joining (NHEJ), does not depend on 
other DNA molecules in the cell. In NHEJ, the free double-stranded ends 
are bound by a heterodimer of two proteins, Ku70 and Ku80. These pro- 
teins stabilize the ends and mark them for subsequent manipulations. 
Through mechanisms that are not yet well understood, the Ku70/80 het- 
erodimers act as handles used by other proteins to draw the two double- 
stranded ends close together so that enzymes can seal the break. 

Alternative mechanisms of double-stranded-break repair can operate if 
an intact stretch of double-stranded DNA with an identical or very similar 
sequence is present in the cell. These repair processes use homologous 
recombination, presented in Section 28.5. 


The presence of thymine instead of uracil in DNA permits the 
repair of deaminated cytosine 


b4 The presence in DNA of thymine rather than uracil, as in RNA, was 

an enigma for many years. Both bases pair with adenine. The only 
difference between them is a methyl group in thymine in place of the C-5 
hydrogen atom in uracil. Why is a methylated base employed in DNA and 
not in RNA? The existence of an active repair system to correct the deami- 
nation of cytosine provides a convincing solution to this puzzle. 

Cytosine in DNA spontaneously deaminates at a perceptible rate to 
form uracil. The deamination of cytosine is potentially mutagenic because 
uracil pairs with adenine, and so one of the daughter strands will contain a 
U_A base pair rather than the original C-—G base pair. This mutation is 
prevented by a repair system that recognizes uracil to be foreign to DNA 
(Figure 28.39). The repair enzyme, uracil DNA glycosylase, is homologous 
to AlkA. The enzyme hydrolyzes the glycosidic bond between the uracil 
and deoxyribose moieties but does not attack thymine-containing nucle- 
otides. The AP site generated is repaired to reinsert cytosine. Thus, the 
methyl group on thymine is a tag that distinguishes thymine from deaminated 
cytosine. If thymine were not used in DNA, uracil correctly in place would 
be indistinguishable from uracil formed by deamination. The defect would 
persist unnoticed, and so a C—G base pair would necessarily be mutated to 
U_A in one of the daughter DNA molecules. This mutation is prevented by 
a repair system that searches for uracil and leaves thymine alone. Thymine is 
used instead of uracil in DNA to enhance the fidelity of the genetic message. 


Figure 28.39 Uracil repair. Uridine bases in DNA, formed by the deamination of cytidine, are 
excised and replaced by cytidine. 
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Some genetic diseases are caused by the expansion of 
repeats of three nucleotides 


e Some genetic diseases are caused by the presence of DNA sequences 

that are inherently prone to errors in the course of repair and 
replication. A particularly important class of such diseases is characterized 
by the presence of long tandem arrays of repeats of three nucleotides. An 
example is Huntington disease, an autosomal dominant neurological disorder 
with a variable age of onset. The mutated gene in this disease expresses a 
protein in the brain called huntingtin, which contains a stretch of consecu- 
tive glutamine residues. These glutamine residues are encoded by a tandem 
array of CAG sequences within the gene. In unaffected persons, this array 
is between 6 and 31 repeats, whereas, in those with the disease, the array is 
between 36 and 82 repeats or longer. Moreover, the array tends to become 
longer from one generation to the next. The consequence is a phenomenon 
called anticipation: the children of an affected parent tend to show symp- 
toms of the disease at an earlier age than did the parent. 

The tendency of these trinucleotide repeats to expand is explained by the 
formation of alternative structures in the course of DNA repair. On cleav- 
age of the DNA backbone, part of the array can loop out without disrupting 
base-pairing outside this region. Then, in replication, DNA polymerase 
extends this strand through the remainder of the array, leading to an increase 
in the number of copies of the trinucleotide sequence. 

A number of other neurological diseases are characterized by expanding 
arrays of trinucleotide repeats. How do these long stretches of repeated amino 
acids cause disease? For huntingtin, it appears that the polyglutamine stretches 
become increasingly prone to aggregate as their length increases; the addi- 
tional consequences of such aggregation are still under investigation. 


Many cancers are caused by the defective repair of DNA 


¥ As described in Chapter 14, cancers are caused by mutations in genes 

associated with growth control. Defects in DNA-repair systems 
increase the overall frequency of mutations and, hence, the likelihood of 
cancer-causing mutations. Indeed, the synergy between studies of muta- 
tions that predispose people to cancer and studies of DNA repair in model 
organisms has been tremendous in revealing the biochemistry of DNA- 
repair pathways. Genes for DNA-repair proteins are often tumor-suppressor 
genes; that is, they suppress tumor development when at least one copy of 
the gene is free of a deleterious mutation. When both copies of a gene are 
mutated, however, tumors develop at rates greater than those for the popu- 
lation at large. People who inherit defects in a single tumor-suppressor allele 
do not necessarily develop cancer but are susceptible to developing the dis- 
ease because only the one remaining normal copy of the gene must develop 
a new defect to further the development of cancer. 

Consider, for example, xeroderma pigmentosum, a rare human skin disease. 
The skin of an affected person is extremely sensitive to sunlight or ultraviolet 
light. In infancy, severe changes in the skin become evident and worsen with 
time. The skin becomes dry, and there is a marked atrophy of the dermis. 
Keratoses appear, the eyelids become scarred, and the cornea ulcerates. Skin 
cancer usually develops at several sites. Many patients die before age 30 from 
metastases of these malignant skin tumors. Studies of xeroderma pigmento- 
sum patients have revealed that mutations occur in genes for a number of 
different proteins. These proteins are components of the human nucleotide- 
excision-repair pathway, including homologs of the UvrABC subunits. 

Defects in other repair systems can increase the frequency of other 
tumors. For example, hereditary nonpolyposis colorectal cancer (HNPCC, or 
Lynch syndrome) results from defective DNA mismatch repair. HNPCC is 


not rare—as many as 1 in 200 people will develop this form of cancer. 
Mutations in two genes, called hMSH2 and hMLH1, account for most 
cases of this hereditary predisposition to cancer. The striking finding is that 
these genes encode the human counterparts of MutS and MutL of E. colli. 
Mutations in hMSH2 and hMLH1 seem likely to allow mutations to accu- 
mulate throughout the genome. In time, genes important in controlling cell 
proliferation become altered, resulting in the onset of cancer. 

Not all tumor-suppressor genes are specific to particular types of cancer. 
The gene for a protein called p53 is mutated in more than half of all tumors. 
The p53 protein helps control the fate of damaged cells. First, it plays a 
central role in sensing DNA damage, especially double-stranded breaks. 
Then, after sensing damage, the protein either promotes a DNA-repair 
pathway or activates the apoptosis pathway, leading to cell death. Most 
mutations in the p53 gene are sporatic; that is, they occur in somatic cells 
rather than being inherited. People who inherit a deleterious mutation in 
one copy of the p53 gene suffer from Li- Fraumeni syndrome and havea high 
probability of developing several types of cancer. 

Cancer cells often have two characteristics that make them especially 
vulnerable to agents that damage DNA molecules. First, they divide fre- 
quently, and so their DNA replication pathways are more active than they 
are in most cells. Second, as already noted, cancer cells often have defects in 
DNA-repair pathways. Several agents widely used in cancer chemotherapy, 
including cyclophosphamide and cisplatin, act by damaging DNA. Cancer 
cells are less able to avoid the effect of the induced damage than are normal 
cells, providing a therapeutic window for specifically killing cancer cells. 


Many potential carcinogens can be detected by their 
mutagenic action on bacteria 


Many human cancers are caused by exposure to chemicals that cause muta- 
tions. It is important to identify such compounds and to ascertain their 
potency so that human exposure to them can be minimized. Bruce Ames 
devised a simple and sensitive test for detecting chemical mutagens. In the 
Ames test, a thin layer of agar containing about 10° bacteria of a specially 
constructed tester strain of Salmonella is placed on a petri plate. These 
bacteria are unable to grow in the absence of histidine, because a mutation 
is present in one of the genes for the biosynthesis of this amino acid. The 
addition of a chemical mutagen to the center of the plate results in many 
new mutations. A small proportion of them reverse the original mutation, 
and histidine can be synthesized. These revertants multiply in the absence of 
an external source of histidine and appear as discrete colonies after the plate 
has been incubated at 37°C for 2 days (Figure 28.40). For example, 0.5 pg 
of 2-aminoanthracene gives 11,000 revertant colonies, compared with only 
30 spontaneous revertants in its absence. A series of concentrations of a 
chemical can be readily tested to generate a dose-response curve. These 
curves are usually linear, which suggests that there is no threshold 
concentration for mutagenesis. 

Some of the tester strains are responsive to base-pair substitutions, whereas 
others detect deletions or additions of base pairs (frameshifts). The sensitivity 
of these specially designed strains has been enhanced by the genetic deletion 
of their excision-repair systems. Potential mutagens enter the tester strains 
easily because the lipopolysaccharide barrier that normally coats the surface 
of Salmonella is incomplete in these strains. A key feature of this detection 
system is the inclusion of a mammalian liver homogenate. Recall that some 
potential carcinogens such as aflatoxin are converted into their active forms 
by enzyme systems in the liver or other mammalian tissues. Bacteria lack 
these enzymes, and so the test plate requires a few milligrams of a liver 
homogenate to activate this group of mutagens. 
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Figure 28.40 Ames test. (A) A petri plate 
containing about 10? Salmonella bacteria that 
cannot synthesize histidine and (B) a petri 
plate containing a filter-paper disc with a 
mutagen, which produces a large number of 
revertants that can synthesize histidine. After 
2 days, the revertants appear as rings of 
colonies around the disc. The small number 
of visible colonies in plate A are spontaneous 
revertants. [From B. N. Ames, J. McCann, 

and E. Yamasake. Mutat. Res. 31:347- 

364, 1975] 
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The Salmonella test is extensively used to help evaluate the mutagenic 
and carcinogenic risks of a large number of chemicals. This rapid and 
inexpensive bacterial assay for mutagenicity complements epidemiological 
surveys and animal tests that are necessarily slower, more laborious, and far 
more expensive. The Salmonella test for mutagenicity is an outgrowth of 
studies of gene—protein relations in bacteria. It is a striking example of how 
fundamental research in molecular biology can lead directly to important 
advances in public health. 


28.5 DNA Recombination Plays Important Roles in 
Replication, Repair, and Other Processes 


Most processes associated with DNA replication function to copy the 
genetic message as faithfully as possible. However, several biochemical 
processes require the recombination of genetic material between two DNA 
molecules. In genetic recombination, two daughter molecules are formed 
by the exchange of genetic material between two parent molecules 
(Figure 28.41). Recombination is essential in the following processes. 


1. When replication stalls, recombination processes can reset the replica- 
tion machinery so that replication can continue. 


2. Some double-stranded breaks in DNA are repaired by recombination. 


3. In meiosis, the limited exchange of genetic material between paired 
chromosomes provides a simple mechanism for generating genetic diversity 
in a population. 


4. As we shall see in Chapter 34, recombination plays a crucial role in gen- 
erating molecular diversity for antibodies and some other molecules in the 
immune system. 


5. Some viruses employ recombination pathways to integrate their genetic 
material into the DNA of a host cell. 


6. Recombination is used to manipulate genes in, for example, the genera- 
tion of “gene knockout” mice (Section 5.4). 


Recombination is most efficient between DNA sequences that are similar in 
sequence. In homologous recombination, parent DNA duplexes align at 
regions of sequence similarity, and new DNA molecules are formed by the 
breakage and joining of homologous segments. 


QDDVIVDVDUBDLE rccomtination OA. DMADVINL 
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Figure 28.41 Recombination. Two DNA molecules can recombine with each other to form new 
DNA molecules that have segments from both parent molecules. 


RecA can initiate recombination by promoting strand invasion 


In many recombination pathways, a DNA molecule with a free end recom- 
bines with a DNA molecule having no free ends available for interaction. 
DNA molecules with free ends are the common result of double-stranded 
DNA breaks, but they may also be generated in DNA replication if the 
replication complex stalls. This type of recombination has been studied 
extensively in E. coli, but it also takes place in other organisms through the 
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Figure 28.42 Strand invasion. This process, promoted by proteins such as RecA, can initiate 
recombination. 


action of proteins homologous to those of E. coli. Often dozens of proteins 
participate in the complete recombination process. However, the key protein 
is RecA. To accomplish the exchange, the single-stranded DNA displaces one 
of the strands of the double helix (Figure 28.42). The resulting three-stranded 
structure is called a displacement loop or D-loop. This process is often referred 
to as strand invasion. Because a free 3’ end is now base-paired to a contiguous 
strand of DNA, the 3’ end can act as a primer to initiate new DNA synthesis. 
Strand invasion can initiate many processes, including the repair of double- 
stranded breaks and the reinitiation of replication after the replication appara- 
tus has come off its template. In the repair of a break, the recombination 
partner is an intact DNA molecule with an overlapping sequence. 


Some recombination reactions proceed through 
Holliday-junction intermediates 


In recombination pathways for meiosis and some other processes, interme- 
diates form that are composed of four polynucleotide chains in a crosslike 
structure. Intermediates with these crosslike structures are often referred to 
as Holliday junctions, after Robin Holliday, who proposed their role in 
recombination in 1964. Such intermediates have been characterized by a 
wide range of techniques, including x-ray crystallography. 

Specific enzymes, termed recombinases, bind to these structures and 
resolve them into separated DNA duplexes. The Cre recombinase from 
bacteriophage P1 has been extensively studied. The mechanism begins with 
the recombinase binding to the DNA substrates (Figure 28.43). 


Bond 
Cleavage 


ow & 
oie, 


Recombination synapse 


Bond 
formation Cleavage 
—__— eS 
—\— ——\— 


Displacement loop 


Holliday junction 


ee 


Figure 28.43 Recombination mechanism. 
Recombination begins as two DNA molecules 
come together to form a recombination 
synapse. One strand from each duplex is 
cleaved by the recombinase enzyme; the 3’ 
end of each of the cleaved strands is linked to 
a tyrosine (Y) residue on the recombinase 
enzyme. New phosphodiester bonds are 
formed when a 5’ end of the other cleaved 
strand in the complex attacks these tyrosine— 
DNA adducts. After isomerization, these steps 
are repeated to form the recombined 
products. 


Holliday junction 
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Four molecules of the enzyme and two DNA molecules come together to 
form a recombination synapse. The reaction begins with the cleavage of one 
strand from each duplex. The 5’-hydroxyl group of each cleaved strand 
remains free, whereas the 3’-phosphoryl group becomes linked to a specific 
tyrosine residue in the recombinase. The free 5’ ends invade the other 
duplex in the synapse and attack the DNA-tyrosine units to form new 
phosphodiester bonds and free the tyrosine residues. These reactions result 
in the formation of a Holliday junction. This junction can then isomerize to 
form a structure in which the polynucleotide chains in the center of the 
structure are reoriented. From this junction, the processes of strand cleav- 
age and phosphodiester-bond formation repeat. The result is a synapse 
containing the two recombined duplexes. Dissociation of this complex gen- 
erates the final recombined products. 

Cre catalyzes the formation of Holliday junctions as well as their resolu- 
tion. In contrast, other proteins bind to Holliday junctions that have already 
been formed by other processes and resolve them into separate duplexes. In 
many cases, these proteins also promote the process of branch migration 
whereby a Holliday junction is moved along the two component double 
helices. Branch migration can affect which segments of DNA are exchanged 
in a recombination process. 


Summary 


28.1 DNA Replication Proceeds by the Polymerization of 

Deoxyribonucleoside Triphosphates Along a Template 

DNA polymerases are template-directed enzymes that catalyze the 
formation of phosphodiester bonds by nucleophilic attack by the 
3'-hydroxyl group on the innermost phosphorus atom of a deoxyribo- 
nucleoside 5’-triphosphate. The complementarity of shape between 
correctly matched nucleotide bases is crucial to ensuring the fidelity of 
base incorporation. DNA polymerases cannot start chains de novo; a 
primer with a free 3’-hydroxyl group 1s required. Thus DNA synthe- 
sis is initiated by the synthesis of an RNA primer, the task of a special- 
ized primase enzyme. After serving as a primer, the RNA is degraded 
and replaced by DNA. DNA polymerases always synthesize a DNA 
strand in the 5’-to-3’ direction. So that both strands of the double 
helix can be synthesized in the same direction simultaneously, one 
strand is synthesized continuously while the other is synthesized in 
fragments called Okazaki fragments. Gaps between the fragments are 
sealed by DNA ligases. ATP-driven helicases prepare the way for 
DNA replication by separating the strands of the double helix. 


28.2 DNA Unwinding and Supercoiling Are Controlled by Topoisomerases 
A key topological property of DNA is its linking number (Lk), which 
is defined as the number of times one strand of DNA winds around the 
other in the right-hand direction when the DNA axis is constrained to 
lie in a plane. Molecules differing in linking number are topoisomers 
of one another and can be interconverted only by cutting one or both 
DNA strands; these reactions are catalyzed by topoisomerases. 
Changes in linking number generally lead to changes in both the num- 
ber of turns of double helix and the number of turns of superhelix. 
Topoisomerase II catalyzes the ATP-driven introduction of negative 
supercoils, which leads to the compaction of DNA and renders it more 
susceptible to unwinding. Supercoiled DNA can be relaxed by topo- 
isomerase I or topoisomerase II. Topoisomerase I acts by transiently 


28.3 


28.4 


28.5 


cleaving one strand of DNA in a double helix, whereas topoisomerase 
II transiently cleaves both strands simultaneously. 


DNA Replication Is Highly Coordinated 

Replicative DNA polymerases are processive; that is, they catalyze the 
addition of many nucleotides without dissociating from the template. 
A major contributor to processivity is the DNA sliding clamp, such as 
the dimeric B subunit of the E. coli replicative polymerase. The sliding 
clamp has a ring structure that encircles the DNA double helix and 
keeps the enzyme and DNA associated. The DNA polymerase 
holoenzyme is a large DNA-copying machine formed by two DNA 
polymerase enzymes, one to act on each template strand, associated 
with other subunits including a sliding clamp and a clamp loader. 

The synthesis of the leading and lagging strands of a double- 
stranded DNA template is coordinated. As a replicative polymerase 
moves along a DNA template, the leading strand is copied smoothly 
while the lagging strand forms loops that change length in the course 
of the synthesis of each Okazaki fragment. The mode of action is 
referred to as the trombone model. 

DNA replication is initiated at a single site within the E. coli 
genome. A set of specific proteins recognize this origin of replication 
and assemble the enzymes needed for DNA synthesis, including a 
helicase that promotes strand separation. The initiation of replica- 
tion in eukaryotes is more complex. DNA synthesis is initiated at 
thousands of sites throughout the genome. Assemblies homologous to 
those in E. coli, but more complicated, are assembled at each eukary- 
otic origin of replication. A special polymerase called telomerase that 
relies on an RNA template synthesizes specialized structures called 
telomeres at the ends of linear chromosomes. 


Many Types of DNA Damage Can Be Repaired 

A wide variety of DNA damage can occur. For example, mismatched 
bases may be incorporated in the course of DNA replication or indi- 
vidual bases may be damaged by oxidation or alkylation after DNA 
replication. Other forms of damage are the formation of cross-links 
and the introduction of single- or double-stranded breaks in the DNA 
backbone. Several different repair systems detect and repair DNA 
damage. Repair begins with the process of proofreading in DNA rep- 
lication: mismatched bases that were incorporated in the course of 
synthesis are excised by exonuclease activity present in replicative 
polymerases. Some DNA lesions such as thymine dimers can be 
directly reversed through the action of specific enzymes. Other DNA- 
repair pathways act through the excision of single damaged bases 
(base-excision repair) or short segments of nucleotides (nucleotide- 
excision repair). Double-stranded breaks in DNA can be repaired by 
homologous or nonhomologous end-joining processes. Defects in 
DNA-repair components are associated with susceptibility to many 
different sorts of cancer. Such defects are a common target of cancer 
treatments. Many potential carcinogens can be detected by their 
mutagenic action on bacteria (the Ames test). 


DNA Recombination Plays Important Roles in Replication, Repair, and 
Other Processes 

Recombination is the exchange of segments between two DNA 
molecules. Recombination is important in some types of DNA repair 
as well as other processes such as meiosis, the generation of antibody 
diversity, and the life cycles of some viruses. Some recombination 
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the end of a DNA double helix forms base pairs with one strand 
of DNA in another double helix and displaces the other strand. A 


common intermediate formed in other recombination pathways is the 
Holliday junction, which consists of four strands of DNA that come 
together to form a crosslike structure. Recombinases promote recom- 
bination reactions through the introduction of specific DNA breaks 
and the formation and resolution of Holliday-junction intermediates. 
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Problems 


1. Activated intermediates. DNA polymerase I, DNA ligase, 
and topoisomerase I catalyze the formation of phosphodi- 
ester bonds. What is the activated intermediate in the link- 
age reaction catalyzed by each of these enzymes? What is 
the leaving group? 


2. Life in a hot tub. An archaeon (Sulfolobus acidocaldarius) 
found in acidic hot springs contains a topoisomerase that 
catalyzes the ATP-driven introduction of positive super- 
coils into DNA. How might this enzyme be advantageous 
to this unusual organism? 


3. Which way? Provide a chemical explanation of why 
DNA synthesis proceeds in a 5’-to-3' direction. 


4, Nucleotide requirement. DNA replication does not take 
place in the absence of the ribonucleotides ATP, CTP, 
GTP, and UTP. Propose an explanation. 


5. Close contact. Examination of the structure of DNA 
polymerases bound to nucleotide analogs reveals that con- 
served residues come within van der Waals contact of C-2’ 
of the bound nucleotide. What is the potential significance 
of this interaction? 


6. Molecular motors in replication. (a) How fast does tem- 
plate DNA spin (expressed in revolutions per second) at an 
E. coli replication fork? (b) What is the velocity of move- 
ment (in micrometers per second) of DNA polymerase II 
holoenzyme relative to the template? 


7. Wound tighter than a drum. Why would replication come 
to a halt in the absence of topoisomerase II? 


8. The missing link. One form of a plasmid shows a twist of 
Tw = 48 anda writhe of Wr = 3. What is the linking num- 
ber? What would the value of writhe be for a form with 
twist Tw = 50 if the linking number is the same as that for 
the preceding form. 


9. Telomeres and cancer. Telomerase is not active in most 
human cells. Some cancer biologists have suggested that 
activation of the telomerase gene would be a requirement 
for a cell to become cancerous. Explain why it might be the 
case. 


10. Backward? Bacteriophage T7 helicase moves along 
DNA in the 5’-to-3’ direction. Other helicases have been 
reported to move in the 3’-to-5’ direction. Is there any fun- 


damental reason why you would expected helicases to move 
in one direction or the other? 


11. Nick translation. Suppose that you wish to make a 
sample of DNA duplex highly radioactive to use asa DNA 
probe. You have a DNA endonuclease that cleaves the DNA 
internally to generate 3’-OH and 5’-phosphoryl groups, 
intact DNA polymerase I, and radioactive dNTPs. Suggest 
a means for making the DNA radioactive. 


12. Revealing tracks. Suppose that replication is initiated 
in a medium containing moderately radioactive tritiated 
thymine. After a few minutes of incubation, the bacteria are 
transferred to a medium containing highly radioactive triti- 
ated thymidine. Sketch the autoradiographic pattern that 
would be seen for (a) undirectional replication and (b) bi- 
directional replication, each from a single origin. 


13. Mutagenic trail. Suppose that the single-stranded RNA 
from tobacco mosaic virus was treated with a chemical 
mutagen, that mutants were obtained having serine or leu- 
cine instead of proline at a specific position, and that further 
treatment of these mutants with the same mutagen yielded 
phenylalanine at this position. 


Ser 
ee 
Pro 
Ss 


Leu 


™ Phe 
Le 


(a) What are the plausible codon assignments for these four 
amino acids? 


(b) Was the mutagen 5-bromouracil, nitrous acid, or an 
acridine dye? 


14. Induced spectrum. DNA photolyases convert the energy 
of light in the near-ultraviolet or visible region (300-500 nm) 
into chemical energy to break the cyclobutane ring of 
pyrimidine dimers. In the absence of substrate, these pho- 
toreactivating enzymes do not absorb light of wavelengths 
longer than 300 nm. Why is the substrate-induced absorp- 
tion band advantageous? 


15. Missing telomerase. Cells lacking telomerase can grow 
for several cell divisions without obvious defects. However, 
after more cell divisions, such cells tend to show chromo- 
somes that have fused together. Propose an explanation for 
the formation of the chromosomes. 


16. I need to unwind. With the assumption that the energy 
required to break an average base pair in DNA is 10 kJ 
mol! (2.4 kcal mol~!), estimate the maximum number of 
base pairs that could be broken per ATP hydrolyzed by a 
helicase operating under standard conditions. 
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17. Triplet oxidation. The oxidation of guanine bases in the 
context of triplet repeats such as CAGCAGCAG can lead 
to the expansion of the repeat. Explain. 


Mechanism Problem 


18. A revealing analog. AMP-PNP, the B,y-imido analog 
of ATP (p. 825), is hydrolyzed very slowly by most AT Pases. 
The addition of AMP-PNP to topoisomerase II and circular 
DNA leads to the negative supercoiling of a single molecule 
of DNA per enzyme. DNA remains bound to the enzyme 
in the presence of this analog. What does this finding reveal 
about the catalytic mechanism? 


Data Interpretation and Chapter Integration Problems 


19. Like a ladder. Circular DNA from SV4O0 virus was iso- 
lated and subjected to gel electrophoresis. The results are 
shown in lane A (the control) of the adjoining gel patterns. 


A B C 


[From W. Keller. PNAS 72: 2553, 1975.] 


(a) Why does the DNA separate in agarose gel electropho- 
resis? How does the DNA in each band differ? 


The DNA was then incubated with topoisomerase I for 
5 minutes and again analyzed by gel electrophoresis with 
the results shown in lane B. 


(b) What types of DNA do the various bands represent? 


Another sample of DNA was incubated with topo- 
isomerase I for 30 minutes and again analyzed as shown in 


lane C. 


(c) What is the significance of the fact that more of the 
DNA is in slower-moving forms? 


20. Ames test. The illustration on the next page shows four 
petri plates used for the Ames test. A piece of filter paper 
(white circle in the center of each plate) was soaked in one of 
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four preparations and then placed on a petri plate. The four 
preparations contained (A) purified water (control), (B) a 
known mutagen, (C) a chemical whose mutagenicity is under 
investigation, and (D) the same chemical after treatment 
with liver homogenate. The number of revertants, visible as 
colonies on the petri plates, was determined in each case. 


(a) What was the purpose of the control plate, which was 
exposed only to water? 


(b) Why was it wise to use a known mutagen in the experi- 
mental system? 
(c) How would you interpret the results obtained with the 


experimental compound? 


(d) What liver components would you think are responsible 
for the effects observed in preparation D? 


(A) Control: No mutagen 


(C) + Experimental sample 


(B) + Known mutagen 


(D) + Experimental sample 
after treatment with 
liver homogenate 
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RNA synthesis is a key step in the expression of genetic 


information. For eukaryotic cells, the initial RNA transcript 
(the mRNA precursor) is often spliced, removing introns 
that do not encode protein sequences. Often, the same 
pre-mRNA is spliced differently in different cell types or 
at different developmental stages. In the image at the left, 


NA stores genetic information in a stable form that can be readily 

replicated. The expression of this genetic information requires its flow 
from DNA to RNA and, usually, to protein, as was introduced in Chapter 4. 
This chapter examines RNA synthesis, or transcription, which is the process 
of synthesizing an RNA transcript with the transfer of sequence informa- 
tion from a DNA template. We begin with a discussion of RNA polymerases, 
the large and complex enzymes that carry out the synthetic process. 
We then turn to transcription in bacteria and focus on the three stages of 
transcription: promoter binding and initiation, elongation of the nascent 
RNA transcript, and termination. We then examine transcription in 
eukaryotes, focusing on the distinctions between bacterial and eukaryotic 
transcription. 

RNA transcripts in eukaryotes are extensively modified, as exemplified 
by the capping of the 5’ end of an mRNA precursor and the addition of a 
long poly(A) tail to its 3’ end. One of the most striking examples of RNA 
modification is the splicing of mRNA precursors, which is catalyzed by 


Chapter 29 includes material from the sixth edition by Susan J. Baserga and Erica A. Champion. 


proteins associated with RNA splicing (stained with a 
fluorescent antibody) highlight regions of the newt 
genome that are being actively transcribed.[(Left) 
Courtesy of Dr. Mark B. Roth and Dr. Joseph G. Gall.] 


OUTLINE 


29.1 RNA Polymerases Catalyze 
Transcription 


29.2 Transcription in Eukaryotes Is 
Highly Regulated 


29.3 The Transcription Products of 
Eukaryotic Polymerases Are 
Processed 


29.4 The Discovery of Catalytic RNA 
Was Revealing in Regard to 
Both Mechanism and Evolution 
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hk Figure 29.1 RNA polymerase 
structures. The three-dimensional structures 
of RNA polymerases from a bacterium 
(Thermus aquaticus) and a eukaryote 
(Saccharomyces cerevisiae). The two largest 
subunits for each structure are shown in dark 
red and dark blue. Notice that both structures 
contain a central metal ion (green) in their 
active sites, near a large cleft on the right. The 
similarity of these structures reveals that these 
enzymes have the same evolutionary origin 
and have many mechanistic features in 
common. [Drawn from 116V.pdb and 
116H.pdb.] 


spliceosomes, protein complexes consisting of small nuclear ribonucleopro- 
tein particles (snRNPs). Remarkably, some RNA molecules can splice 
themselves in the absence of protein. This landmark discovery by Thomas 
Cech and Sidney Altman revealed that RNA molecules can serve as cata- 
lysts and greatly influenced our view of molecular evolution. 

RNA splicing is not merely a curiosity. At least 15% of all genetic 
diseases have been associated with mutations that affect RNA splicing. 
Moreover, the same pre-mRNA can be spliced differently in various cell 
types, at different stages of development, or in response to other biological 
signals. In addition, individual bases in some pre-mRNA molecules are 
changed in a process called RNA editing. One of the biggest surprises of 
the sequencing of the human genome was that only about 23,000 genes were 
identified compared with previous estimates of 100,000 or more. The ability 
of one gene to encode more than one distinct mRNA by alternative splicing 
and, hence, more than one protein may play a key role in expanding the 
repertoire of our genomes. 


RNA synthesis comprises three stages: Initiation, elongation, 
and termination 


RNA synthesis is catalyzed by large enzymes called RNA polymerases. The 
basic biochemistry of RNA synthesis is common to all organisms, com- 
monality that has been beautifully illustrated by the three-dimensional 
structures of representative RNA polymerases from bacteria and eukary- 
otes (Figure 29.1). Despite substantial differences in size and number of 
polypeptide subunits, the overall structures of these enzymes are quite 
similar, revealing a common evolutionary origin. 

RNA synthesis, like all biological polymerization reactions, takes place 
in three stages: initiation, elongation, and termination. RNA polymerases 
perform multiple functions in this process: 


1. They search DNA for initiation sites, also called promoter sites or simply 
promoters. For instance, FE. coli DNA has about 2000 promoter sites in its 
4.8 X 10° bp genome. 


2. They unwind a short stretch of double-helical DNA to produce single- 
stranded DNA templates from which the sequence of bases can be easily 
read out. 


ei 
J Ty ' a 
Ns eh 


Prokaryotic RNA polymerase Eukaryotic RNA polymerase 


3. They select the correct ribonucleoside triphosphate and catalyze the 
formation of a phosphodiester bond. This process is repeated many times as 
the enzyme moves along the DNA template. RNA polymerase is com- 
pletely processive—a transcript is synthesized from start to end by a single 
RNA polymerase molecule. 


4. They detect termination signals that specify where a transcript ends. 


5. They interact with activator and repressor proteins that modulate the 
rate of transcription initiation over a wide range. Gene expression is con- 
trolled substantially at the level of transcription, as will be discussed in 
detail in Chapters 31 and 32. 


The chemistry of RNA synthesis is identical for all forms of RNA, 
including messenger RNAs, transfer RNAs, ribosomal RNAs, and small 
regulatory RNAs. The basic steps just outlined apply to all forms. Their 
synthetic processes differ mainly in regulation, posttranscriptional process- 
ing, and the specific RNA polymerase that participates. 


29.1 RNA Polymerases Catalyze Transcription 


The fundamental reaction of RNA synthesis is the formation of a phospho- 
diester bond. The 3’-hydroxyl group of the last nucleotide in the chain 
nucleophilically attacks the a phosphoryl group of the incoming nucleoside 
triphosphate with the concomitant release of a pyrophosphate. 
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DNA 3’-N—N—N—N—N—N—N—N-5’ <= -3’-N—N—N—N—N—N—N—N-5' 


This reaction is thermodynamically favorable, and the subsequent degra- 
dation of the pyrophosphate to orthophosphate locks the reaction in the 
direction of RNA synthesis. The catalytic sites of RNA polymerases include 
two metal ions, normally magnesium ions (Figure 29.2). One ion remains 
tightly bound to the enzyme, whereas the other ion comes in with the 
nucleoside triphosphate and leaves with the pyrophosphate. Three con- 
served aspartate residues participate in binding these metal ions. 

RNA polymerases are very large, complex enzymes. For example, the 
RNA polymerase of E. coli consists of 5 kinds of subunits with the composi- 
tion a28B'w (Table 29.1). A typical eukaryotic RNA poly- 
merase is larger and more complex, having 12 subunits 
and a total molecular mass of more than 0.5 million dal- 
tons. Despite this complexity, the structures of RNA poly- 
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Table 29.1 Subunits of RNA polymerase from E. coli 


‘ : 3 Subunit G Numb Mass (kd 
merases have been determined in detail by x-ray crystal- amu _ curate ae 
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The polymerization reactions catalyzed by RNA poly- B es - ae 
merases take place within a complex in DNA termed a = e = ; = 
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transcription bubble (Figure 29.3). This complex consists of 
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Figure 29.2 RNA polymerase active site. 
A model of the transition state for 
phosphodiester-bond formation in the active 
site of RNA polymerase. The 3’-hydroxyl 
group of the growing RNA chain attacks the 
a-phosphoryl group of the incoming 
nucleoside triphosphate, resulting in the 
release of pyrophosphate. This transition state 
is structurally similar to that in the active site 
of DNA polymerase (see Figure 28.4). 


Figure 29.3 Transcription bubble. RNA 
polymerase separates a region of the double 
helix to form a structure called the 


“transcription bubble” The red (template strand) 


and blue (nontemplate) strands of DNA are 
shown along with RNA molecule being 
synthesized (shown in green). The position of 
the active site magnesium is indicated. 


Ribonucleoside 
triphosphate 


double-stranded DNA that has been locally unwound in a region of approx- 
imately 17 base pairs. The edges of the bases that normally take part in 
Watson—Crick base pairs are exposed in the unwound region. We will begin 
with a detailed examination of the elongation process, including the role of 
the DNA template read by RNA polymerase and the reac- 
tions catalyzed by the polymerase, before returning to the 
more-complex processes of initiation and termination. 


RNA chains are formed de novo and grow in the 5’-to-3’ 
direction 


Let us begin our examination of transcription by considering 
the DNA template. The first nucleotide (the start site) of a 
DNA sequence to be transcribed is denoted as +1 and the 
second one as +2; the nucleotide preceding the start site is 
denoted as —1. These designations refer to the coding strand 
of DNA. Recall that the sequence of the template strand of 
DNA is the complement of that of the RNA transcript (Fig- 
ure 29.4). In contrast, the coding strand of DNA has the same 
sequence as that of the RNA transcript except for thymine (T) 
in place of uracil (U). The coding strand is also known as the 
sense (+) strand, and the template strand as the antisense (—) 
strand. 

In contrast with DNA synthesis, RNA synthesis can start 
de novo, without the requirement for a primer. Most newly syn- 
thesized RNA chains carry a highly distinctive tag on the 5’ 
end: the first base at that end is either pppG or pppA. 
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IAAACGUAGGGUCACAUC... RNA transcript 


- GCATACAACACACCTTTGCATCCCAGTGTAG... Template, or antisense (-), strand 


- CGTATGTTGTGTGGAACGTAGGGTCACATC... Coding, or sense (+), strand 
—1H142 


Figure 29.4 Template and coding strands. The template, or antisense (—), strand is 
complementary in sequence to the RNA transcript. 


The presence of the triphosphate moiety confirms that RNA synthesis 
starts at the 5’ end. 

The dinucleotide shown on page 854 is synthesized by RNA polymerase 
as part of the complex process of initiation, which will be discussed later in 
the chapter. After initiation takes place, RNA polymerase elongates the 
nucleic acid chain in the following manner (Figure 29.5). A ribonucleoside 
triphosphate binds in the active site of the RNA polymerase, directly adja- 
cent to the growing RNA chain. The incoming ribonucleoside triphosphate 
forms a Watson—Crick base pair with the template strand. The 3’-hydroxyl 
group of the growing RNA chain, oriented and activated by the tightly 
bound metal ion, attacks the a-phosphoryl group to form a new phosphodi- 
ester bond, displacing pyrophosphate. 
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Figure 29.5 Elongation mechanism. A ribonucleoside triphosphate binds adjacent to the 
growing RNA chain and forms a Watson—Crick base pair with a base on the DNA template strand. 
The 3’-hydroxyl group at the end of the RNA chain attacks the newly bound nucleotide and forms 
a new phosphodiester bond, releasing pyrophosphate. 


To proceed to the next step, the RNA-DNA hybrid must move relative 
to the polymerase to bring the 3’ end of the newly added nucleotide into 
proper position for the next nucleotide to be added (Figure 29.6). This 
translocation step does not include breaking any bonds between base pairs 
and is reversible but, once it has taken place, the addition of the next nucle- 
otide, favored by the triphosphate cleavage and pyrophosphate release and 
cleavage, drives the polymerization reaction forward. 
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Figure 29.6 Translocation. After nucleotide addition, the RNA-DNA hybrid can translocate 
through the RNA polymerase, bringing a new DNA base into position to base-pair with an incoming 
nucleoside triphosphate. 


The lengths of the RNA-DNA hybrid and of the unwound region of 
DNA stay rather constant as RNA polymerase moves along the DNA 
template. The length of the RNA—DNA hybrid is determined by a struc- 
ture within the enzyme that forces the RNA—DNA hybrid to separate, 
allowing the RNA chain to exit from the enzyme and the DNA chain to 
rejoin its DNA partner (Figure 29.7). 
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Figure 29.8 Backtracking. The RNA-DNA 
hybrid can occasionally backtrack within the 
RNA polymerase. In the backtracked position, 
hydrolysis can take place, producing a 
configuration equivalent to that after 
translocation. Backtracking is more likely if a 
mismatched base is added, facilitating 
proofreading. 


h Figure 29.7 RNA-DNA hybrid separation. A 
structure within RNA polymerase forces the 
separation of the RNA—DNA hybrid. Notice that the 
DNA strand exits in one direction and the RNA 
product exits in another. [Drawn from 116H.pdb.] 


RNA polymerases backtrack and correct errors 


The RNA-DNA hybrid can also move in the direction opposite that of 
elongation (Figure 29.8). This backtracking is less favorable energetically 
than moving forward because it breaks the bonds between a base pair. 
However, backtracking is very important for proofreading. The incorpora- 
tion of an incorrect nucleotide introduces a non- Watson—Crick base pair. In 
this case, breaking the bonds between this base pair and backtracking is 
less costly energetically. After the polymerase has backtracked, the phos- 
phodiester bond one base pair before the one that has just formed is adjacent 
to the metal ion in the active site. In this position, a hydrolysis reaction in 
which a water molecule attacks the phosphate can result in the cleavage of 
the phosphodiester bond and the release of a dinucleotide that includes 
the incorrect nucleotide. 
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Studies of single molecules of RNA polymerase have confirmed that the 
enzymes hesitate and backtrack to correct errors. Furthermore, these proof- 
reading activities are often enhanced by accessory proteins. The final error 
rate of the order of one mistake per 10° or 10° nucleotides is higher than that 
for DNA replication, including all error-correcting mechanisms. The lower 
fidelity of RNA synthesis can be tolerated because mistakes are not trans- 
mitted to progeny. For most genes, many RNA transcripts are synthesized; 
a few defective transcripts are unlikely to be harmful. 


RNA polymerase binds to promoter sites on the DNA template to 
initiate transcription 


The elongation process is common to all organisms. In contrast, the pro- 
cesses of initiation and termination differ substantially in bacteria and 
eukaryotes. We begin with a discussion of these processes in bacteria, start- 
ing with initiation of transcription. The bacterial RNA polymerase dis- 
cussed earlier with the composition a28['w is referred to as the core enzyme. 
The inclusion of an additional subunit produces the holoenzyme with com- 
position 02BB’wo. The o subunit helps find sites on DNA where transcrip- 


tion begins, referred to as promoter sites or, simply, promoters. At these sites, Transcription 
the o subunit participates in the initiation of RNA synthesis and then starts here 


dissociates from the rest of the enzyme. ee 4 
Sequences upstream of the promoter site are important in determining (A) GATATAATGAGCAAA 
where transcription begins. A striking pattern was evident when the ®) CGTATAATTTIGACCA 
sequences of bacterial promoters were compared. Two common motifs are (Q GTTAACTAGTACGCA 
present on the upstream side of the transcription start site. They are known as (D) GTGATACTGAGCACA 
the —10 sequence and the —35 sequence because they are centered at about (E) GTTTTCATGCCTCCA 
10 and 35 nucleotides upstream of the start site. The region containing these TATAAT 
sequences is called the core promoter. The —10 and —35 sequences are each Figiiie 20.0 Beclesiall promoter sequenced, 
6 bp long. Their consensus sequences, deduced from analyses of many pro- A comparison of five sequences from 
moters (Figure 29.9), are prokaryotic promoters reveals a recurring 
sequence of TATAAT centered on position —10. 
-35 -10 +1 The —10 consensus sequence (in red) was 
ST T GAC ArrrrrmnT ATAAT Start deduced from a large number of promoter 
site sequences. The sequences are from the 


(A) uvrD, (B) uncl, and (C) trp operons of 
E. coli; from (D) phage; and from 
(E) bX174 phage. 


Promoters differ markedly in their efficacy. Some genes are transcribed 
frequently—as often as every 2 seconds in E. coli. The promoters for these 
genes are referred to as strong promoters. In contrast, other genes are tran- 
scribed much less frequently, about once in 10 minutes; the promoters for 
these genes are weak promoters. The —10 and —35 regions of most strong 
promoters have sequences that correspond closely to the consensus 
sequences, whereas weak promoters tend to have multiple substitutions at 
these sites. Indeed, mutation of a single base in either the —10 sequence 
or the —35 sequence can diminish promoter activity. The distance between 
these conserved sequences also is important; a separation of 17 nucleotides 
is optimal. Thus, the efficiency or strength of a promoter sequence serves to 
regulate transcription. Regulatory proteins that bind to 
specific sequences near promoter sites and interact with 
RNA polymerase (Chapter 31) also markedly influence 
the frequency of transcription of many genes. 

Outside the core promoter in a subset of highly 
expressed genes is the upstream element (also called the 
UP element for upstream element). This sequence is 
present from 40 to 60 nucleotides upstream of the 
transcription start site. The UP element is bound by 
the a subunit of RNA polymerase and serves to 
increase the efficiency of transcription by creating an 
additional interaction site for the polymerase. 


Sigma subunits of RNA polymerase recognize 
promoter sites 


To initiate transcription, the a2BB’w core of RNA 
polymerase must bind the promoter. However, it is the 
o subunit that makes this binding possible by enabling 
RNA polymerase to recognize promoter sites. In the pres- 
ence of the o subunit, the RNA polymerase binds ) ¥ 4 
weakly to the DNA and slides along the double helix ws or eet 
until it dissociates or encounters a promoter. The o ~35 element 
subunit recognizes the promoter through several inter- 

actions with the nucleotide bases of the promoter DNA. 


The structure of a bacterial RNA polymerase holoen- hk Figure 29.10 RNA polymerase holoenzyme complex. Notice 
that the o subunit (blue) of the bacterial RNA polymerase holoenzyme 


zyme bound to a promoter site shows the o subunit makes sequence-specific contacts with the —10 (green) and —35 
interacting with DNA at the —10 and —35 regions promoter elements (yellow). The active site of the polymerase is revealed 
essential to promoter recognition (Figure 29.10). by bound metal ion (red). [Drawn from 1L9Z.pdb]. 
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—35 —10 
ser T TGACA TATAAT 


5'~~T NNCNCNCTTGAA CCCATNT 
Siew C TGGGNAY rT T OCA 


3’ Standard promoter 
3’ Heat-shock promoter 


3’ Nitrogen-starvation promoter 


Figure 29.11 Alternative promoter sequences. A comparison of the consensus sequences of 
standard, heat-shock, and nitrogen-starvation promoters of E. coli. These promoters are recognized 
by «7°, o°?, and o™, respectively. 


Therefore, the o subunit is responsible for the specific binding of the RNA 
polymerase to a promoter site on the template DNA. The o subunit is gener- 
ally released when the nascent RNA chain reaches 9 or 10 nucleotides in 
length. After its release, it can assist initiation by another core enzyme. 
Thus, the o subunit acts catalytically. 

E. coli has seven distinct o factors for recognizing several types of 
promoter sequences in FE. coli DNA. The type that recognizes the consensus 
sequences described earlier is called o’° because it has a mass of 70 kd. A 
different o factor comes into play when the temperature is raised abruptly. 
E. coli responds by synthesizing 0°”, which recognizes the promoters of 
heat-shock genes. These promoters exhibit — 10 sequences that are somewhat 
different from the —10 sequence for standard promoters (Figure 29.11). 
The increased transcription of heat-shock genes leads to the coordinated 
synthesis of a series of protective proteins. Other o factors respond to envi- 
ronmental conditions, such as nitrogen starvation. These findings demon- 
strate that o plays a key role in determining where RNA polymerase initiates 
transcription. 


Aue Some other bacteria contain a much larger number of o factors. For 

example, the genome of the soil bactertum Streptomyces coelicolor 
encodes more than 60 o factors recognized on the basis of their amino acid 
sequences. This repertoire allows these cells to adjust their gene-expression 
programs to the wide range of conditions, in regard to nutrients and com- 
peting organisms, that they may experience. 


RNA polymerases must unwind the template double helix for 
transcription to take place 


Although RNA polymerases can search for promoter sites when bound to 
double-helical DNA, a segment of the DNA double helix must be unwound 
before synthesis can begin. The transition from the closed promoter complex (in 
which DNA is double helical) to the open promoter complex (in which a DNA 
segment is unwound) is an essential event in transcription (Figure 29.12). The 
free energy necessary to break the bonds between approximately 17 base pairs 

in the double helix is derived from additional interactions 


that are possible when the DNA distorts to wrap around the 
RNA polymerase and from interactions between the single- 
——- SSS stranded DNA regions and other parts of the enzyme. These 
SOOSSSOSSK aaa 


interactions stabilize the open promoter complex and help 


Closed complex Open complex pull the template strand into the active site. The —35 element 


Figure 29.12 DNA unwinding. The transition from the closed 


remains in a double-helical state, whereas the — 10 element is 


promoter complex to the open promoter complex requires the unwound. The stage is now set for the formation of the first 


unwinding of approximately 17 base pairs of DNA. 


phosphodiester bond of the new RNA chain. 
Elongation takes place at transcription bubbles that move along 
the DNA template 


The elongation phase of RNA synthesis begins with the formation of the 
first phosphodiester bond. Repeated cycles of nucleotide addition can take 


/ RNA polymerase 


Rewinding 


5’ ppp 


Movement 
of polymerase 


place at this point. However, until about 10 nucleotides have been added, 
RNA polymerase sometimes releases the short RNA, which dissociates 
from the DNA. Once RNA polymerase passes this point, the enzyme 
stays bound to its template until a termination signal is reached. The region 
containing RNA polymerase, DNA, and nascent RNA corresponds to the 
transcription bubble (Figure 29.13). The newly synthesized RNA forms a 
hybrid helix with the template DNA strand. This RNA—-DNA helix is 
about 8 bp long, which corresponds to nearly one turn of a double helix. The 
3'-hydroxyl group of the RNA in this hybrid helix is positioned so that it 
can attack the a-phosphorus atom of an incoming ribonucleoside triphosphate. 
The core enzyme also contains a binding site for the coding strand of DNA. 
About 17 bp of DNA are unwound throughout the elongation phase, as 
in the initiation phase. The transcription bubble moves a distance of 170 A 
(17 nm) in a second, which corresponds to a rate of elongation of about 
50 nucleotides per second. Although rapid, it is much slower than the rate 
of DNA synthesis, which is 800 nucleotides per second. 


Sequences within the newly transcribed RNA signal termination 


In bacteria, the termination of transcription is as precisely controlled as its 
initiation. In the termination phase of transcription, the formation of phos- 
phodiester bonds ceases, the RNA—DNA hybrid dissociates, the unwound 
region of DNA rewinds, and RNA polymerase releases the DNA. What 
determines where transcription is terminated? The transcribed regions of 
DNA templates contain stop signals. The simplest one is a palindromic 
GC-rich region followed by an AT-rich region. The RNA transcript of this 
DNA palindrome is self-complementary (Figure 29.14). Hence, its bases 
can pair to form a hairpin structure with a stem and loop, a structure favored 
by its high content of G and C residues. Guanine-—cytosine base pairs are 
more stable than adenine-thymine pairs because of the extra hydrogen bond 
in the base pair. This stable hairpin is followed by a sequence of four or 
more uracil residues, which also are crucial for termination. The RNA tran- 
script ends within or just after them. 

How does this combination hairpin—oligo(U) structure terminate tran- 
scription? First, RNA polymerase likely pauses imme- 
diately after it has synthesized a stretch of RNA that 
folds into a hairpin. Furthermore, the RNA-DNA 
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Figure 29.13 Transcription bubble. (A) A 
schematic representation of a transcription 
bubble in the elongation of an RNA transcript. 
Duplex DNA is unwound at the forward end 
of RNA polymerase and rewound at its rear 
end. The RNA-DNA hybrid rotates during 
elongation. 
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hybrid helix produced after the hairpin is unstable ‘i 
because its rU-dA base pairs are the weakest of the 
four kinds. Hence, the pause in transcription caused 
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Figure 29.14 Termination signal. A termination signal found at the 3’ 


by the hairpin permits the weakly bound nascent end of an mRNA transcript consists of a series of bases that form a stable 
RNA to dissociate from the DNA template and then stem-loop structure and a series of U residues. 
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hk Figure 29.15 Riboswitch. (A) The 
5’-end of an mRNA that encodes proteins 
engaged in the production of flavin 
mononucleotide (FMN) folds to form a 
structure that is stabilized by binding FMN. 
This structure includes a terminator that leads 
to premature termination of the MRNA. At 
lower concentrations of FMN, an alternative 
structure that lacks the terminator is formed, 
leading to the production of full-length mRNA. 
(B) The three-dimensional structure of a 
related FMN-binding riboswitch bound to 
FMN. The blue and yellow stretches 
correspond to regions highlighted in the same 
colors in part A. Notice how the yellow strand 
contacts the bound FMN, stabilizing the 
structure. [Drawn from 3F2Q.pdb]. 
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High [FMN], terminator formed, 
full mRNA not produced 


from the enzyme. The solitary DNA template strand rejoins its partner to 
re-form the DNA duplex, and the transcription bubble closes. 


Some messenger RNAs directly sense metabolite concentrations 


As we shall explore in Chapters 31 and 32, the expression of many genes is 
controlled in response to the concentrations of metabolites and signaling 
molecules within cells. One set of control mechanisms depends on the 
remarkable ability of some mRNA molecules to form special secondary 
structures, some of which are capable of directly binding small molecules. 
These structures are termed riboswitches. Consider a riboswitch that 
controls the synthesis of genes that participate in the biosynthesis of ribofla- 
vin (p. 441) in Bacillus subtilis (Figure 29.15). When flavin mononucleotide 
(FMN), a key intermediate in riboflavin biosynthesis, is present at high 
concentration, it binds to a conformation of the RNA transcript with an 
FMN-binding pocket that also includes a hairpin structure that favors pre- 
mature termination. By trapping the RNA transcript in this termination- 
favoring conformation, FMN prevents the production of functional mRNA. 
However, when FMN is present at low concentration, it does not readily 
bind to RNA, and an alternative conformation forms without the termina- 
tor, allowing the production of the full-length mRNA. The occurrence of 
riboswitches serves as a vivid illustration of how RNAs are capable of form- 
ing elaborate, functional structures, though we tend to depict them as sim- 
ple lines in the absence of specific information. 


The rho protein helps to terminate the transcription of some genes 


RNA polymerase needs no help to terminate transcription at a hairpin fol- 
lowed by several U residues. At other sites, however, termination requires 
the participation of an additional factor. This discovery was prompted by 
the observation that some RNA molecules synthesized in vitro by RNA 
polymerase acting alone are longer than those made in vivo. The missing 
factor, a protein that caused the correct termination, was isolated and named 
rho (p). Additional information about the action of p was obtained by adding 
this termination factor to an incubation mixture at various times after the 
initiation of RNA synthesis (Figure 29.16). RNAs with sedimentation 
coefficients of 105, 138, and 175 were obtained when p was added at initia- 
tion, a few seconds after initiation, and 2 minutes after initiation, respec- 
tively. If no p was added, transcription yielded a 235 RNA product. It is 
evident that the template contains at least three termination sites that 
respond to p (yielding 10S, 135, and 17S RNA) and one termination site that 
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does not (yielding 235 RNA). Thus, specific termination at a site producing 
238 RNA can take place in the absence of p. However, p detects additional 
termination signals that are not recognized by RNA polymerase alone. 


How does p provoke the termination of RNA synthesis? 
A key clue is the finding that p hydrolyzes ATP in the presence 
of single-stranded RNA but not in the presence of DNA or 
duplex RNA. Hexameric p is a helicase, homologous to the 
helicases that we encountered in our consideration of DNA 
replication (Section 28.1). A stretch of nucleotides is bound 
in such a way that the RNA passes through the center of the 
structure (Figure 29.17). The p protein is brought into action 
by sequences located in the nascent RNA that are rich in 
cytosine and poor in guanine. The helicase activity of 
p enables the protein to pull the nascent RNA while pursu- 
ing RNA polymerase. When p catches RNA polymerase at 
the transcription bubble, it breaks the RNA-DNA hybrid 
helix by functioning as an RNA—DNA helicase. 

Proteins in addition to p may provoke termination. For 
example, the nusA protein enables RNA polymerase in E. coli 
to recognize a characteristic class of termination sites. A com- 
mon feature of protein-independent and protein-dependent ter- 
mination is that the functioning signals lie in newly synthesized 
RNA rather than in the DNA template. 


Some antibiotics inhibit transcription 


RNA polymerase 


| | p protein 


Ppp 5” 


Figure 29.17 Mechanism for the termination of transcription 
by p protein. This protein is an ATP-dependent helicase that binds 
the nascent RNA chain and pulls it away from RNA polymerase and 
the DNA template. 


Many antibiotics are highly specific inhibitors of biological pro- 

cesses in bacteria. Rifampicin and actinomycin are two antibiotics 
that inhibit bacterial transcription, although in quite different ways. 
Rifampicin is a semisynthetic derivative of rifamycins, which are compounds 
derived from a strain of Streptomyces. 
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r Figure 29.19 Actinomycin-DNA 
complex structure. (A) The structure of a 
complex between a DNA duplex (shown as a 
space-filling model) and actinomycin B (shown 
as a ball-and-stick model). Two actinomycin 
B molecules are bound in the complex. 

(B) The structure of actinomycin B showing 
the phenoxazone ring. Notice how the 
phenoxazone (yellow) slides between base 
pairs of the DNA. Abbreviation: Me, methyl. 
[Drawn from 113W.pdb] 


Rifampicin 


Figure 29.18 Antibiotic action. Rifampicin binds to a pocket in the channel that is normally 
occupied by the newly formed RNA-DNA hybrid. Thus, the antibiotic blocks elongation after only 
two or three nucleotides have been added. 


This antibiotic specifically inhibits the initiation of RNA synthesis. 
Rifampicin interferes with the formation of the first few phosphodiester 
bonds in the RNA chain. The structure of a complex between a prokaryotic 
RNA polymerase and rifampicin reveals that the antibiotic blocks the chan- 
nel into which the RNA—DNA hybrid generated by the enzyme must pass 
(Figure 29.18). The binding site is 12 A from the active site itself. Rifampicin 
can inhibit only the initiation of transcription, not elongation, because the 
RNA-—DNA hybrid present in the enzyme during elongation prevents the 
antibiotic from binding. The pocket in which rifampicin binds is conserved 
among bacterial RNA polymerases, but not eukaryotic polymerases, and so 
rifampicin can be used as an antibiotic in antituberculosis therapy. 

Actinomycin D, a peptide-containing antibiotic from a different strain of 
Streptomyces, inhibits transcription by an entirely different mechanism. 
Actinomycin D binds tightly and specifically to double-helical DNA and 
thereby prevents it from being an effective template for RNA synthesis. The 
results of spectroscopic, hydrodynamic, and structural studies of complexes 
of actinomycin D and DNA reveal that the phenoxazone ring of actinomy- 
cin slips in between base pairs in DNA (Figure 29.19). This mode of bind- 
ing is called intercalation. At low concentrations, actinomycin D inhibits 
transcription without significantly affecting DNA replication or protein 
synthesis. Hence, actinomycin D is extensively used as a highly specific inhib- 
itor of the formation of new RNA in both prokaryotic and eukaryotic cells. Its 
ability to inhibit the growth of rapidly dividing cells makes it an effective 
therapeutic agent in the treatment of some cancers. 
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Precursors of transfer and ribosomal RNA are cleaved and chemically 
modified after transcription in prokaryotes 


In prokaryotes, messenger RNA molecules undergo little or no modification 
after synthesis by RNA polymerase. Indeed, many mRNA molecules are 
translated while they are being transcribed. In contrast, transfer RNA and 
ribosomal RNA molecules are generated by cleavage and other modifications 
of nascent RNA chains. For example, in E. coli, the three rRNAs and a 
tRNA are excised from a single primary RNA transcript that also contains 
spacer regions (Figure 29.20). Other transcripts contain arrays of several 
kinds of tRNA or of several copies of the same tRNA. The nucleases that 
cleave and trim these precursors of rRNA and tRNA are highly precise. 
Ribonuclease P (RNase P), for example, generates the correct 5’ terminus of 
all tRNA molecules in E. coli. Sidney Altman and his coworkers showed 
that this interesting enzyme contains a catalytically active RNA molecule. 
Ribonuclease ITI (RNase IIT) excises 55, 165, and 235 rRNA precursors 
from the primary transcript by cleaving double-helical hairpin regions at 
specific sites. 


16S 23S 5S . : 5 
rRNA tRNA rRNA rRNA Figure 29.20 Primary transcript. Cleavage of 


A \ \ ‘ this transcript produces 5S, 16S, and 23S rRNA 
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A second type of processing is the addition of nucleotides to the termini of 
some RNA chains. For example, CCA, a terminal sequence required for the 
function of all tRNAs, is added to the 3’ ends of tRNA molecules for which 
this terminal sequence is not encoded in the DNA. The enzyme that cata- 
lyzes the addition of CCA is atypical for an RNA polymerase in that it does 
not use a DNA template. A third type of processing is the modification of 
bases and ribose units of ribosomal RNAs. In prokaryotes, some bases of 
rRNA are methylated. Unusual bases are found in all tRNA molecules. 
They are formed by the enzymatic modification of a standard ribonucle- 
otide in a tRNA precursor. For example, uridylate residues are modified 
after transcription to form ribothymidylate and pseudouridylate. These 
modifications generate diversity, allowing greater structural and functional 
versatility. 
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Figure 29.21 Transcription and 
translation. These two processes are 
closely coupled in prokaryotes whereas they 
are spatially and temporally separate in 
eukaryotes. (A) In prokaryotes the primary 
transcript serves as MRNA and is used 
immediately as the template for protein 
synthesis. (B) In eukaryotes, mRNA 
precursors are processed and spliced in the 
nucleus before being transported to the 
cytoplasm for translation into protein. 


[After J. Darnell, H. Lodish, and D. Baltimore. 


Molecular Cell Biology, 2d ed. (Scientific 
American Books, 1990), p. 230.] 


29.2 Transcription in Eukaryotes Is Highly Regulated 


We turn now to transcription in eukaryotes, a much more complex process 
than in bacteria. Eukaryotic cells have a remarkable ability to regulate pre- 
cisely the time at which each gene is transcribed and how much RNA is 
produced. This ability has allowed some eukaryotes to evolve into multi- 
cellular organisms, with distinct tissues. That is, multicellular eukaryotes 
use differential transcriptional regulation to create different cell types. Gene 
expression is influenced by three important characteristics unique to 
eukaryotes: the nuclear membrane, complex transcriptional regulation, and 
RNA processing. 


1. The Nuclear Membrane. In eukaryotes, transcription and translation 
take place in different cellular compartments: transcription takes place in the 
membrane-bounded nucleus, whereas translation takes place outside the 
nucleus in the cytoplasm. In bacteria, the two processes are closely coupled 
(Figure 29.21). Indeed, the translation of bacterial mRNA begins while the 
transcript is still being synthesized. The spatial and temporal separation of 
transcription and translation enables eukaryotes to regulate gene expression in 
much more intricate ways, contributing to the richness of eukaryotic form and 
function. 


2. Complex Transcriptional Regulation. Like bacteria, eukaryotes rely on 
conserved sequences in DNA to regulate the initiation of transcription. But 
bacteria have only three promoter elements (the —10, —35, and UP elements), 
whereas eukaryotes use a variety of types of promoter elements, each 
identified by its own conserved sequence. Not all possible types will be 
present together in the same promoter. In eukaryotes, elements that regulate 
transcription can be found at a variety of locations in DNA, upstream or 
downstream of the start site and sometimes at distances much farther from 
the start site than in prokaryotes. For example, enhancer elements located 
on DNA far from the start site increase the promoter activity of specific 
genes. 


3. RNA Processing. Although both bacteria and eukaryotes modify RNA, 
eukaryotes very extensively process nascent RNA destined to become mRNA. 
This processing includes modifications to both ends and, most significantly, 
splicing out segments of the primary transcript. RNA processing is 
described in Section 29.3. 
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Type Location Cellular transcripts Effects of a-amanitin 
I Nucleolus 185, 5.85, and 28S rRNA Insensitive 
II Nucleoplaam = mRNA precursorsandsnRNA Strongly inhibited 
Ill Nucleoplasm tRNA and 5S rRNA Inhibited by high concentrations 


Three types of RNA polymerase synthesize RNA in eukaryotic cells 


In bacteria, RNA is synthesized by a single kind of polymerase. In contrast, 
the nucleus of a typical eukaryotic cell contains three types of RNA 
polymerase differing in template specificity and location in the nucleus 
(Table 29.2). All these polymerases are large proteins, containing from 8 to 
14 subunits and having total molecular masses greater than 500 kd. RNA 
polymerase I is located in specialized structures within the nucleus called 
nucleoli, where it transcribes the tandem array of genes for 185, 5.85, and 
285 ribosomal RNA. The other ribosomal RNA molecule (58 rRNA) and 
all the transfer RNA molecules are synthesized by RNA polymerase III, 
which is located in the nucleoplasm rather than in nucleoli. RNA polymerase II, 
which also is located in the nucleoplasm, synthesizes the precursors of 
messenger RNA as well as several small RNA molecules, such as those of 
the splicing apparatus and many of the precursors to small regulatory 
RNAs. 

Although all eukaryotic RNA polymerases are homologous to one 
another and to prokaryotic RNA polymerases, RNA polymerase II contains 
a unique carboxyl-terminal domain on the 220-kd subunit called the CTD; 
this domain is unusual because it contains multiple repeats of a YSPTSPS 
consensus sequence. The activity of RNA polymerase II is regulated by 
phosphorylation mainly on the serine residues of the CTD. 

Another major distinction among the polymerases lies in their responses 
to the toxin a-amanitin, a cyclic octapeptide that contains several modified 
amino acids. 
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a-Amanitin is produced by the poisonous mushroom Amanita phalloides, 
which is also called the death cap or the destroying angel. More than a hun- 
dred deaths result worldwide each year from the ingestion of poisonous 
mushrooms. a-Amanitin binds very tightly (Kg = 10 nM) to RNA poly- 
merase II and thereby blocks the elongation phase of RNA synthesis. Amanita phalloides, also called the death cap. 
Higher concentrations of a-amanitin (1 w~M) inhibit polymerase III, whereas [Archenzo/WikimediaCommons] 
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polymerase I is insensitive to this toxin. This pattern of 
sensitivity is highly conserved throughout the animal and 
plant kingdoms. 

Eukaryotic polymerases also differ from each other in 
the promoters to which they bind. Eukaryotic genes, like 
prokaryotic genes, require promoters for transcription ini- 
tiation. Like prokaryotic promoters, eukaryotic promot- 
ers consist of conserved sequences that serve to attract the 
polymerase to the start site. However, eukaryotic promot- 
ers differ distinctly in sequence and position, depending 
on the type of RNA polymerase to which they bind 
(Figure 29.22). 
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1. RNA Polymerase I. The ribosomal DNA (rDNA) 
transcribed by polymerase I is arranged in several hun- 
dred tandem repeats, each containing a copy of each of 
three rRNA genes. The promoter sequences are located in 
stretches of DNA separating the genes. At the transcrip- 
tional start site lies a TATA-like sequence called the ribo- 
somal initiator element (rInr). Farther upstream, 150 to 
200 bp from the start site, is the upstream promoter element 
(UPE). Both elements aid transcription by binding pro- 
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Figure 29.22 Common eukaryotic 
promoter elements. Each eukaryotic RNA 
polymerase recognizes a set of promoter 
elements—sequences in DNA that promote 
transcription. The RNA polymerase | promoter 
consists of a ribosomal initator (rinr) and an 
upstream promoter element (UPE). The RNA 
polymerase II promoter likewise includes an 
initator element (Inr) and may also include 
either a TATA box or a downstream promoter 
element (DPE). Separate from the promoter 
region, enhancer elements bind specific 
transcription factors. RNA polymerase III 
promoters consist of conserved sequences 
that lie within the transcribed genes. 
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2. RNA Polymerase II. Promoters for RNA polymerase 
II, like prokaryotic promoters, include a set of consensus sequences that 
define the start site and recruit the polymerase. However, the promoter can 
contain any combination of a number of possible consensus sequences. 
Unique to eukaryotes, they also include enhancer elements that can be very 
distant (more than 1 kb) from the start site. 


3. RNA Polymerase III. Promoters for RNA polymerase III are within the 
transcribed sequence, downstream of the start site. There are two types of 
intergenic promoters for RNA polymerase III. Type I promoters, found in 
the 5S rRNA gene, contain two short conserved sequences known as the 
A block and the C block. Type II promoters, found in tRNA genes, consist 
of two 11-bp sequences, the A block and the B block, situated about 15 bp 
from either end of the gene. 


Three common elements can be found in the RNA 
polymerase II promoter region 


RNA polymerase II transcribes all of the protein-coding genes in eukary- 
otic cells. Promoters for RNA polymerase II, like those for bacterial poly- 
merases, are generally located on the 5’ side of the start site for transcrip- 
tion. Because these sequences are on the same molecule of DNA as the genes 
being transcribed, they are called cis-acting elements. The most commonly 
recognized cis-acting element for genes transcribed by RNA polymerase II 
is called the TATA box on the basis of its consensus sequence (Figure 29.23). 
The TATA box is usually found between positions —30 and —100. Note 
that the eukaryotic TATA box closely resembles the prokaryotic —10 
sequence (TATAAT) but is farther from the start site. The mutation of a 
single base in the TATA box markedly impairs promoter activity. Thus, the 
precise sequence, not just a high content of AT pairs, is essential. 

The TATA box is often paired with an initiator element (Inr), a sequence 
found at the transcriptional start site, between positions —3 and +5. This 
sequence defines the start site because the other promoter elements are 


5’ GGNCAATCT 3’ 


CAAT box 
af Tg2 Ag7 To3 Ags Agz Agg Aso 3 5'GGGCGG3' 
TATA box GC box 


Figure 29.23 TATA box. Comparisons of 
the sequences of more than 100 eukaryotic 
promoters led to the consensus sequence 
shown. The subscripts denote the frequency 
(%) of the base at that position. 


Figure 29.24 CAAT box and GC box. 
Consensus sequences for the CAAT and GC 
boxes of eukaryotic promoters for mRNA 
precursors. 


at variable distances from that site. Its presence increases transcriptional 
activity. 

A third element, the downstream core promoter element (DPE), is com- 
monly found in conjunction with the Inr in transcripts that lack the TATA 
box. In contrast with the TATA box, the DPE is found downstream of the 
start site, between positions +28 and +32. 

Additional regulatory sequences are located between —40 and —150. 
Many promoters contain a CAAT box, and some contain a GC box 
(Figure 29.24). Constitutive genes (genes that are continuously expressed 
rather than regulated) tend to have GC boxes in their promoters. The posi- 
tions of these upstream sequences vary from one promoter to another, in 
contrast with the quite constant location of the —35 region in prokaryotes. 
Another difference is that the CAAT box and the GC box can be effective 
when present on the template (antisense) strand, unlike the —35 region, 
which must be present on the coding (sense) strand. These differences 
between prokaryotes and eukaryotes correspond to fundamentally different 
mechanisms for the recognition of cis-acting elements. The —10 and —35 
sequences in prokaryotic promoters are binding sites for RNA polymerase 
and its associated o factor. In contrast, the TATA, CAAT, and GC boxes 
and other cis-acting elements in eukaryotic promoters are recognized by 
proteins other than by RNA polymerase itself. 


The TFIID protein complex initiates the assembly of the 
active transcription complex 


Cis-acting elements constitute only part of the puzzle of eukaryotic gene 
expression. Transcription factors that bind to these elements also are 
required. For example, RNA polymerase II is guided to the start site by a 
set of transcription factors known collectively as TFII (TF stands for tran- 
scription factor, and II refers to RNA polymerase II). Individual TFII 
factors are called TFIIA, TFIIB, and so on. 

In TATA-box promoters, the key initial event is the recognition of the 
TATA box by the TATA-box-binding protein (TBP), a 30-kd component 
of the 700-kd TFIID complex (Figure 29.25). In TATA-less promoters, 
other proteins in the TFIID complex bind the core promoter elements but, 
because less is known about these interactions, we will consider only the 
TATA-box—TBP binding interaction. TBP binds 10° times as tightly to 
the TATA box as to nonconsensus sequences; the dissociation constant of the 
TBP-TATA-box complex is approximately 1 nM. TBP is a saddle-shaped 
protein consisting of two similar domains (Figure 29.26). The TATA box of 
DNA binds to the concave surface of TBP. This binding induces large con- 
formational changes in the bound DNA. The double helix is substantially 
unwound to widen its minor groove, enabling it to make extensive contact 
with the antiparallel B strands on the concave side of TBP. Hydrophobic 
interactions are prominent at this interface. Four phenylalanine residues, 
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Figure 29.25 Transcription initation. 
Transcription factors TFIIA, B, D, E, F, and H 
are essential in initiating transcription by RNA 
polymerase Il. The step-by-step assembly of 
these general transcription factors begins with 
the binding of TFIID (purple) to the TATA box. 
[The TATA-box-binding protein (TBP), a 
component of TFIID, recognizes the TATA box.] 
After assembly, TFIIH opens the DNA double 
helix and phosphorylates the carboxyl-terminal 
domain (CTD), allowing the polymerase to 
leave the promoter and begin transcription. 
The red arrow marks the transcription start 
Site. 


hk Figure 29.26 Complex formed by 
TATA-box-binding protein and DNA. The 
saddlelike structure of the protein sits atop a 
DNA fragment. Notice that the DNA is 
significantly unwound and bent. [Drawn from 
1CDW.pdb] 
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for example, are intercalated between base pairs of the TATA box. The 
flexibility of AT-rich sequences is generally exploited here in bending 
the DNA. Immediately outside the TATA box, classical B- DNA resumes. 
The TBP-TATA-box complex is distinctly asymmetric. The asymmetry is 
crucial for specifying a unique start site and ensuring that transcription 
proceeds unidirectionally. 

TBP bound to the TATA box is the heart of the initiation complex (see 
Figure 29.25). The surface of the TBP saddle provides docking sites for the 
binding of other components. Additional transcription factors assemble on 
this nucleus in a defined sequence. TFIIA is recruited, followed by TFIIB; 
then TFIIF, RNA polymerase I], TFIIE, and TFIIH join the other factors 
to form a complex called the basal transcription apparatus. During the for- 
mation of the basal transcription apparatus, the carboxyl-terminal domain 
(CTD) is unphosphorylated and plays a role in transcription regulation 
through its binding to an enhancer-associated complex called mediator (see 
Section 32.2). The phosphorylated CTD stabilizes transcription elongation 
by RNA polymerase II and recruits RNA-processing enzymes that act in 
the course of elongation. Phosphorylation of the CTD by TFIIH marks the 
transition from initiation to elongation. The importance of the carboxyl- 
terminal domain is highlighted by the finding that yeast containing mutant 
polymerase II with fewer than 10 repeats in the CTD is not viable. Most of 
the factors are released before the polymerase leaves the promoter and can 
then participate in another round of initiation. 


Multiple transcription factors interact with eukaryotic promoters 


The basal transcription complex described in the preceding section initiates 
transcription at a low frequency. Additional transcription factors that bind to 
other sites are required to achieve a high rate of mRNA synthesis. Their role 
is to selectively stimulate specific genes. Upstream stimulatory sites in 
eukaryotic genes are diverse in sequence and variable in position. Their 
variety suggests that they are recognized by many different specific pro- 
teins. Indeed, many transcription factors have been isolated, and their bind- 
ing sites have been identified by footprinting experiments. For example, 
heat-shock transcription factor (HSTF) is expressed in Drosophila after an 
abrupt increase in temperature. This 93-kd DNA-binding protein binds to 
the following consensus sequence: 


5’'-CNNGAANNTCCNNG-3’ 


Several copies of this sequence, known as the heat-shock response element, 
are present starting at a site 15 bp upstream of the TATA box. 
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HSTF differs from 0°”, a heat-shock protein of E. coli (p. 858), in binding 
directly to response elements in heat-shock promoters rather than first 
becoming associated with RNA polymerase. 


Enhancer sequences can stimulate transcription at start sites 
thousands of bases away 


The activities of many promoters in higher eukaryotes are greatly increased 
by another type of cis-acting element called an enhancer. Enhancer sequences 
have no promoter activity of their own yet can exert their stimulatory actions 


over distances of several thousand base pairs. They can be upstream, 
downstream, or even in the midst of a transcribed gene. Moreover, enhancers 
are effective when present on either DNA strand (equivalently, in either 
orientation). 


cz A particular enhancer 1s effective only in certain cells. For example, the 
immunoglobulin enhancer functions in B lymphocytes but not 
elsewhere. Cancer can result if the relation between genes and enhancers is 
disrupted. In Burkitt lymphoma and B-cell leukemia, a chromosomal trans- 
location brings the proto-oncogene myc (a transcription factor itself) under the 
control of a powerful immunoglobin enhancer. The consequent dysregulation 
of the myc gene is believed to play a role in the progression of the cancer. 

Transcription factors and other proteins that bind to regulatory sites on 
DNA can be regarded as passwords that cooperatively open multiple locks, giv- 
ing RNA polymerase access to specific genes. The discovery of promoters and 
enhancers has allowed us to gain a better understanding of how genes are 
selectively expressed in eukaryotic cells. The regulation of gene transcrip- 
tion, discussed in Chapter 32, is the fundamental means of controlling gene 
expression. 


a Although bacteria lack TBP, archaea utilize a TBP molecule that is 
structurally quite similar to the eukaryotic protein. In fact, transcrip- 
tional control processes in archaea are, in general, much more similar to those 
in eukaryotes than are the processes in bacteria. Many components of the 
eukaryotic transcriptional machinery evolved from an ancestor of archaea. 


29.3 The Transcription Products of Eukaryotic 
Polymerases Are Processed 


Virtually all the initial products of transcription are further processed in 
eukaryotes. For example, primary transcripts (pre-mRNA molecules), the 
products of RNA polymerase II action, acquire a cap at their 5’ ends anda 
poly(A) tail at their 3’ends. Most importantly, nearly all mRNA precursors 
in higher eukaryotes are spliced. Introns are precisely excised from primary 
transcripts, and exons are joined to form mature mRNAs with continuous 
messages. Some mature mRNAs are only a tenth the size of their precursors, 
which can be as large as 30 kb or more. The pattern of splicing can be regu- 
lated in the course of development to generate variations on a theme, such 
as membrane-bound or secreted forms of antibody molecules. Alternative 
splicing enlarges the repertoire of proteins in eukaryotes and is one clear 
illustration of why the proteome is more complex than the genome. The 
particular processing steps and the factors taking part vary according to the 
type of RNA polymerase. 


RNA polymerase | produces three ribosomal RNAs 


Several RNA molecules are key components of ribosomes. RNA polymerase 
I transcription produces a single precursor (455 in mammals) that encodes 
three RNA components of the ribosome: the 185 rRNA, the 285 rRNA, 
and the 5.85 rRNA (Figure 29.27). The 185 rRNA is the RNA component 
of the small ribosomal subunit (405), and the 285 and 5.85 rRNAs are two 
RNA components of the large ribosomal subunit (605). The other RNA 
component of the large ribosomal subunit, the 55 rRNA, is transcribed by 
RNA polymerase III as a separate transcript. 

The cleavage of the precursor into three separate rRNAs is actually the 
final step in its processing. First, the nucleotides of the pre-r RNA sequences 
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Figure 29.27 Processing of eukaryotic 
pre-rRNA. The mammalian pre-rRNA 
transcript contains the RNA sequences 
destined to become the 18S, 5.8S, and 28S 
tRNAs of the small and large ribosomal 
subunits. First, nucleotides are modified: small 
nucleolar ribonucleoproteins methylate 
specific ribose groups and convert selected 
uridines into pseudouridines (indicated by red 
lines). Next, the pre-rRNA is cleaved and 
packaged to form mature ribosomes, in a 
highly regulated process in which more than 
200 proteins take part. 
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Pre-rRNA 


SSU 
processome 


Figure 29.28 Visualization of rRNA 
transcription and processing in 
eukaryotes. Transcription of rRNA and its 
assembly into precursor ribosomes can be 
visualized by electron microscopy. The 
structures resemble Christmas trees: the trunk 
is the rDNA and each branch is a pre-rRNA 
transcript. Transcription starts at the top of the 
tree, where the shortest transcripts can be 
seen, and progresses down the rDNA to the 
end of the gene. The terminal knobs visible at 
the end of some pre-rRNA transcripts likely 
correspond to the SSU processome, a large 
ribonucleoprotein required for processing the 
pre-rRNA. [From F. Dragon et al. Nature 
417:967-970, 2002.] 
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destined for the ribosome undergo extensive modification, on both ribose 
and base components, directed by many small nucleolar ribonucleoproteins 
(snoRNPs), each of which consists of one snoRNA and several proteins. 
The pre-rRNA is assembled with ribosomal proteins, as guided by process- 
ing factors, in a large ribonucleoprotein. For instance, the small-subunit 
(SSU) processome is required for 185 rRNA synthesis and can be visualized 
in electron micrographs as a terminal knob at the 5’ ends of the nascent 
rRNAs (Fig. 29.28). Finally, rRNA cleavage (sometimes coupled with 
additional processing steps) releases the mature rRNAs assembled with 
ribosomal proteins as ribosomes. Like those of RNA polymerase I 
transcription itself, most of these processing steps take place in the cell 
nucleolus, a nuclear subcompartment. 


RNA polymerase III produces transfer RNA 


Eukaryotic tRNA transcripts are among the most processed of all RNA 
polymerase III transcripts. Like those of prokaryotic tRNAs, the 5’ leader 
is cleaved by RNase P, the 3’ trailer is removed, and CCA is added by the 
CCA-adding enzyme (Figure 29.29). Eukaryotic tRNAs are also heavily 
modified on base and ribose moieties; these modifications are important 
for function. In contrast with prokaryotic tRNAs, many eukaryotic pre- 
tRNAs are also spliced by an endonuclease and a ligase to remove an 
intron. 


The product of RNA polymerase II, the pre-mRNA transcript, 
acquires a 5’ cap and a 3’ poly(A) tail 


Perhaps the most extensively studied transcription product is the product of 
RNA polymerase II: most of this RNA will be processed to mRNA. The 
immediate product of RNA polymerase II is sometimes referred to as 
precursor-to-messenger RNA, or pre-mRNA. Most pre-mRNA molecules 
are spliced to remove the introns. Moreover, both the 5’ and the 3’ ends are 
modified, and both modifications are retained as the pre-mRNA is con- 
verted into mRNA. 

As in prokaryotes, eukaryotic transcription usually begins with A or G. 
However, the 5’ triphosphate end of the nascent RNA chain is immediately 
modified. First, a phosphoryl group is released by hydrolysis. The diphos- 
phate 5’ end then attacks the a-phosphorus atom of GTP to form a very 
unusual 5’—5’ triphosphate linkage. This distinctive terminus is called a cap 
(Figure 29.30). The N-7 nitrogen of the terminal guanine is then methylated 
by S-adenosylmethionine to form cap 0. The adjacent riboses may be 
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Figure 29.29 Transfer RNA precursor processing. The conversion of a yeast tRNA precursor 
into a mature tRNA requires the removal of a 14-nucleotide intron (yellow), the cleavage of a 5’ 
leader (green), and the removal of UU and the attachment of CCA at the 3’ end (red). In addition, 
several bases are modified. 


methylated to form cap 1 or cap 2. Transfer RNA and ribosomal RNA 
molecules, in contrast with messenger RNAs and with small RNAs that 
participate in splicing, do not have caps. Caps contribute to the stability of 
mRNAs by protecting their 5’ ends from phosphatases and nucleases. In 
addition, caps enhance the translation of mRNA by eukaryotic protein- 
synthesizing systems. 

As mentioned earlier, pre-mRNA is also modified at the 3’ end. Most 
eukaryotic mRNAs contain a polyadenylate, poly(A), tail at that end, added 
after transcription has ended. The DNA template does not encode this 
poly(A) tail. Indeed, the nucleotide preceding poly(A) is not the last nucle- 
otide to be transcribed. Some primary transcripts contain hundreds of 
nucleotides beyond the 3’ end of the mature mRNA. 

How is the 3’ end of the pre-mRNA given its final form? Eukaryotic 
primary transcripts are cleaved by a specific endonuclease that recognizes the 
sequence AAUAAA (Figure 29.31). Cleavage does not take place if this 
sequence or a segment of some 20 nucleotides on its 3’ side is deleted. The 
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Figure 29.30 Capping the 5’ end. Caps at 
the 5’ end of eukaryotic mRNA include 
7-methylguanylate (red) attached by a 
triphosphate linkage to the ribose at the 
5' end. None of the riboses are methylated in 
cap O, one is methylated in cap 1, and both 
are methylated in cap 2. 


Figure 29.31 Polyadenylation of a 
primary transcript. A specific endonuclease 
cleaves the RNA downstream of AAUAAA. 
Poly(A) polymerase then adds about 250 
adenylate residues. 
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Figure 29.32 Small regulatory RNA 
production. A pathway from a transcription 
product including a microRNA to the mature 
microRNA bound to an Argonaute protein. 
The initial transcription product, a pri- 
microRNA, is first cleaved to a small double- 
stranded RNA called a pre-microRNA. One of 
the strands of the pre-microRNA, the mature 
microRNA, is then bound by 

an Argonaute protein. 


presence of internal AAUAAA sequences in some mature mRNAs indi- 
cates that AAUAAA is only part of the cleavage signal; its context also is 
important. After cleavage of the pre-RNA by the endonuclease, a poly(A) 
polymerase adds about 250 adenylate residues to the 3’ end of the transcript; 
ATP is the donor in this reaction. 

The role of the poly(A) tail is still not firmly established despite much 
effort. However, evidence is accumulating that it enhances translation effi- 
ciency and the stability of mRNA. Blocking the synthesis of the poly(A) 
tail by exposure to 3’-deoxyadenosine (cordycepin) does not interfere with the 
synthesis of the primary transcript. Messenger RNA devoid of a poly(A) 
tail can be transported out of the nucleus. However, an mRNA molecule 
devoid of a poly(A) tail is usually a much less effective template for protein 
synthesis than is one with a poly(A) tail. Indeed, some mRNAs are stored 
in an unadenylated form and receive the poly(A) tail only when translation 
is imminent. The half-life ofan mRNA molecule may be determined in part 
by the rate of degradation of its poly(A) tail. 


Small regulatory RNAs are cleaved from larger precursors 


Cleavage plays a role in the processing of small single-stranded RNAs 
(approximately 20-23 nucleotides) called microRNAs. MicroRNAs play 
key roles in gene regulation in eukaryotes, as we shall see in Chapter 32. 
They are generated from initial transcripts produced by RNA polymerase II 
and, in some cases, RNA polymerase III. These transcripts fold into hair- 
pin structures that are cleaved by specific nucleases at various stages 
(Figure 29.32). The final single-stranded RNAs are bound by members of 
the Argonaute family of proteins to exert their roles on controlling gene 
expression. 
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RNA editing changes the proteins encoded by mRNA 


Remarkably, the amino acid sequence information encoded by some 
mRNAs is altered after transcription. RNA editing is the term for a change 
in the nucleotide sequence of RNA after transcription by processes other 
than RNA splicing. RNA editing is prominent in some systems already 
discussed. Apolipoprotein B (apo B) plays an important role in the transport 
of triacylglycerols and cholesterol by forming an amphipathic spherical 
shell around the lipids carried in lipoprotein particles (Section 26.3). Apo B 
exists in two forms, a 512-kd apo B-100 and a 240-kd apo B-48. The larger 
form, synthesized by the liver, participates in the transport of lipids synthe- 
sized in the cell. The smaller form, synthesized by the small intestine, 
carries dietary fat in the form of chylomicrons. Apo B-48 contains the 2152 
N-terminal residues of the 4536-residue apo B-100. This truncated molecule 
can form lipoprotein particles but cannot bind to the low-density-lipoprotein 
receptor on cell surfaces. What is the relationship between these two forms 
of apo B? Experiments revealed that a totally unexpected mechanism for 
generating diversity is at work: the changing of the nucleotide sequence of 
mRNA after its synthesis (Figure 29.33). A specific cytidine residue of mRNA 


is deaminated to uridine, which changes the codon at residue 2153 from CAA 
(Gln) to UAA (stop). The deaminase that catalyzes this reaction is present in 
the small intestine, but not in the liver, and is expressed only at certain 
developmental stages. 

RNA editing is not confined to apolipoprotein B. Glutamate opens 
cation-specific channels in the vertebrate central nervous system by binding 
to receptors in postsynaptic membranes. RNA editing changes a single 
glutamine codon (CAG) in the mRNA for the glutamate receptor to 
the codon for arginine (read as CGG). The substitution of Arg for Gln in the 
receptor prevents Cy. but not Na”, from flowing through the channel. 

RNA editing is likely much more common than was formerly thought. 
The chemical reactivity of nucleotide bases, including the susceptibility to 
deamination that necessitates complex DNA-repair mechanisms, has been 
harnessed as an engine for generating molecular diversity at the RNA and, 
hence, protein levels. 

In trypanosomes (parasitic protozoans), a different kind of RNA editing 
markedly changes several mitochondrial mRNAs. Nearly half the uridine 
residues in these mRNAs are inserted by RNA editing. A guide RNA 
molecule identifies the sequences to be modified, and a poly(U) tail on the 
guide donates uridine residues to the mRNAs undergoing editing. DNA 
sequences evidently do not always faithfully disclose the sequence of 
encoded proteins: functionally crucial changes to mRNA can take place. 


Sequences at the ends of introns specify splice sites in 
mRNA precursors 


Most genes in higher eukaryotes are composed of exons and introns 
(Section 4.7). The introns must be excised and the exons must be linked to 
form the final mRNA ina process called RNA splicing. This splicing must be 
exquisitely sensitive: splicing just one nucleotide upstream or downstream 
of the intended site would create a one-nucleotide shift, which would alter 
the reading frame on the 3’ side of the splice to give an entirely different 
amino acid sequence, likely including a premature stop codon. Thus, the 
correct splice site must be clearly marked. Does a particular sequence 
denote the splice site? The sequences of thousands of intron—exon junctions 
within RNA transcripts are known. In eukaryotes from yeast to mammals, 
these sequences have a common structural motif: the intron begins with GU 
and ends with AG. The consensus sequence at the 5’ splice in vertebrates is 
AGGUAAGU, where the GU is invariant (Figure 29.34). At the 3’ end of 
an intron, the consensus sequence is a stretch of 10 pyrimidines (U or C; 
termed the polypyrimidine tract), followed by any base and then by C, and 
ending with the invariant AG. Introns also have an important internal site 
located between 20 and 50 nucleotides upstream of the 3’ splice site; it is 
called the branch site for reasons that will be evident shortly. In yeast, the 
branch-site sequence is nearly always UACUAAC, whereas, in mammals, 
a variety of sequences are found. 

The 5’ and 3’ splice sites and the branch site are essential for determin- 
ing where splicing takes place. Mutations in each of these three critical 
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Figure 29.34 Splice sites. Consensus sequences for the 5’ splice site and the 3’ splice site are 
shown. Py stands for pyrimidine. 
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Figure 29.33 RNA editing. Enzyme- 
catalyzed deamination of a specific cytidine 
residue in the mRNA for apolipoprotein B-100 
changes a codon for glutamine (CAA) to a 
stop codon (UAA). Apolipoprotein B-48, a 
truncated version of the protein lacking the 
LDL receptor-binding domain, is generated by 
this posttranscriptional change in the mRNA 
sequence. [After P. Hodges and J. Scott. 
Trends Biochem. Sci. 17:77, 1992.] 
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Figure 29.35 Splicing mechanism used 
for mRNA precursors. The upstream (5’) 
exon is shown in blue, the downstream (3’) 
exon in green, and the branch site in yellow. 
Y stands for a pyrimidine nucleotide, R for a 
purine nucleotide, and N for any nucleotide. 
The 5’ splice site is attacked by the 2’-OH 
group of the branch-site adenosine residue. 
The 3’ splice site is attacked by the newly 
formed 3'-OH group of the upstream exon. 
The exons are joined, and the intron is 
released in the form of a lariat. [After P. A. 
Sharp. Cell 42:397-408, 1985] 
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regions lead to aberrant splicing. Introns vary in length from 50 to 10,000 
nucleotides, and so the splicing machinery may have to find the 3’ site 
several thousand nucleotides away. Specific sequences near the splice 
sites (in both the introns and the exons) play an important role in splicing 
regulation, particularly in designating splice sites when there are many 
alternatives (p. 878). Researchers are currently attempting to determine the 
factors that contribute to splice-site selection for individual mRNAs. 
Despite our knowledge of splice-site sequences, predicting pre-mRNAs 
and their protein products from genomic DNA sequence information 
remains a challenge. 


Splicing consists of two sequential transesterification reactions 


The splicing of nascent mRNA molecules is a complicated process. It 
requires the cooperation of several small RNAs and proteins that form a 
large complex called a spliceosome. However, the chemistry of the splicing 
process is simple. Splicing begins with the cleavage of the phosphodiester 
bond between the upstream exon (exon 1) and the 5’ end of the intron 
(Figure 29.35). The attacking group in this reaction is the 2’-OH group of 
an adenylate residue in the branch site. A 2’—5’ phosphodiester bond is 
formed between this A residue and the 5’ terminal phosphate of the intron. 
This reaction is a transesterification. 
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Note that this adenylate residue is also joined to two other nucleotides by 
normal 3’—5’ phosphodiester bonds (Figure 29.36). Hence a branch is gener- 
ated at this site, and a lariat intermediate is formed. 
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Figure 29.36 Splicing branch point. The structure 
O fe) OH of the branch point in the lariat intermediate in which 
~ ms the adenylate residue is joined to three nucleotides 
on by phosphodiester bonds. The new 2’-to-5’ linkage is 
O shown in red, and the usual 3’-to-5' linkages are 
ra shown in blue. 


The 3’-OH terminus of exon 1 then attacks the phosphodiester bond 
between the intron and exon 2. Exons 1 and 2 become joined, and the intron 
is released in lariat form. Again, this reaction is a transesterification. Splicing 
is thus accomplished by two transesterification reactions rather than by 
hydrolysis followed by ligation. The first reaction generates a free 3’-OH 
group at the 3’ end of exon 1, and the second reaction links this group to the 
5’-phosphate of exon 2. The number of phosphodiester bonds stays the same 
during these steps, which is crucial because it allows the splicing reaction 
itself to proceed without an energy source such as ATP or GTP. 


Small nuclear RNAs in spliceosomes catalyze the 
splicing of mRNA precursors 


The nucleus contains many types of small RNA molecules with fewer than 
300 nucleotides, referred to as snRNAs (small nuclear RNAs). A few of 
them—designated U1, U2, U4, U5, and U6—are essential for splicing 
mRNA precursors. The secondary structures of these RNAs are highly 
conserved in organisms ranging from yeast to human beings. These RNA 
molecules are associated with specific proteins to form complexes termed 
snRNPs (small nuclear ribonucleoprotein particles); investigators often 
speak of them as “snurps.” Spliceosomes are large (60S) dynamic assemblies 
composed of snRNPs, hundreds of other proteins called splicing factors, and 
the mRNA precursors being processed (‘Table 29.3). 

In mammalian cells, splicing begins with the recognition of the 5’ splice 
site by the U1 snRNP (Figure 29.37). U1 snRNA contains a highly con- 
served six-nucleotide sequence, not covered by protein in the snRNP, that 


Figure 29.37 Spliceosome assembly and action. U1 binds the 5’ splice site and U2 binds 

to the branch point. A preformed U4-U5-U6 complex then joins the assembly to form the 
complete spliceosome. The U6 snRNA re-folds and binds the 5’ splice site, displacing U1. 
Extensive interactions between U6 and U2 displace U4. Then, in the first transesterification step, 
the branch-site adenosine attacks the 5’ splice site, making a lariat intermediate. U5 holds the two 
exons in close proximity, and the second transesterification takes place, with the 5’ splice-site 
hydroxyl group attacking the 3’ splice site. These reactions result in the mature spliced mRNA and 
a lariat form of the intron bound by U2, U5, and U6. [After T. Villa, J. A. Pleiss, and C. Guthrie, Cel/ 
109:149-152, 2002.] 
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Table 29.3 Small nuclear ribonucleoprotein particles (SnRNPs) in the splicing of 
mRNA precursors 


Size of snRNA 
snRNP (nucleotides) Role 
U1 165 Binds the 5’ splice site 
U2 185 Binds the branch site and forms part of the catalytic center 
U5 116 Binds the 5’ splice site and then the 3’ splice site 
U4 145, Masks the catalytic activity of U6 
U6 106 Catalyzes splicing 


base-pairs to the 5’ splice site of the pre-mRNA. This binding initiates 
spliceosome assembly on the pre-mRNA molecule. 


5’ splice site mRNA precursor 


2! yy ng -y—— 


sii —A IC As: 
U1 snRNA 


U2 snRNP then binds the branch site in the intron by base-pairing 
between a highly conserved sequence in U2 snRNA and the pre-mRNA. 
U2 snRNP binding requires ATP hydrolysis. A preassembled U4-U5-U6 
tri-snRNP joins this complex of U1, U2, and the mRNA precursor to form 
the spliceosome. This association also requires ATP hydrolysis. 

A revealing view of the interplay of RNA molecules in this assembly 
came from examining the pattern of cross-links formed by psoralen, a 
reagent that joins neighboring pyrimidines in base-paired regions on 
treatment with light. These cross-links suggest that splicing takes place in 
the following way. First, U5 interacts with exon sequences in the 5’ splice 
site and subsequently with the 3’ exon. Next, U6 disengages from U4 and 
undergoes an intramolecular rearrangement that per- 
mits base-pairing with U2 as well as interaction with 
the 5’ end of the intron, displacing U1 from the spli- 
/ ceosome. The U2-U6 helix is indispensable for splic- 
A a ing, suggesting that U2 and U6 snRNAs probably form 
a” the catalytic center of the spliceosome (Figure 29.38). U4 
serves as an inhibitor that masks U6 until the specific 


LACUA G \ splice sites are aligned. These rearrangements result in 
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and generating the lariat intermediate. 
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Figure 29.38 Splicing catalytic center. 
The catalytic center of the spliceosome is 
formed by U2 snRNA (red) and U6 snRNA 
(green), which are base paired. U2 is also 
base paired to the branch site of the mRNA 
precursor. [After H. D. Madhani and C. 
Guthrie. Cell 71:803-817, 1992.] 


generate the spliced product. U2, U5, and U6 bound to 
the excised lariat intron are released to complete the 
splicing reaction. 

Many of the steps in the splicing process require ATP hydrolysis. How 
is the free energy associated with ATP hydrolysis used to power splicing? 
To achieve the well-ordered rearrangements necessary for splicing, ATP- 
powered RNA helicases must unwind RNA helices and allow alternative 
base-pairing arrangements to form. Thus, two features of the splicing process 


are noteworthy. First, RNA molecules play key roles in directing the alignment 
of splice sites and in carrying out catalysis. Second, ATP-powered helicases 
unwind RNA duplex intermediates that facilitate catalysis and induce the 
release of snRNPs from the mRNA. 


Transcription and processing of mRNA are coupled 


Although the transcription and processing of mRNAs have been described 
herein as separate events in gene expression, experimental evidence suggests 
that the two steps are coordinated by the carboxyl-terminal domain of RNA 
polymerase II. We have seen that the CTD consists of a unique repeated 
seven-amino-acid sequence, YSPTSPS. Either 52 or 5s or both may be phos- 
phorylated in the various repeats. The phosphorylation state 
of the CTD is controlled by a number of kinases and phos- 
phatases and leads the CTD to bind many of the proteins 


: é Shine . Phosphorylated 
having roles in RNA transcription and processing. The cTD ya 


CTD contributes to efficient transcription by recruiting 
these proteins to the pre-mRNA (Figure 29.39), including: 


1. capping enzymes, which methylate the 5’ guanine on 
the pre-mRNA immediately after transcription begins; 


2. components of the splicing machinery, which initiate 
the excision of each intron as it is synthesized; and 


3. an endonuclease that cleaves the transcript at the 
poly(A) addition site, creating a free 3'-OH group that is 
the target for 3’ adenylation. 


These events take place sequentially, directed by the 
phosphorylation state of the CTD. 


Mutations that affect pre-mRNA splicing cause disease 


kz Mutations in either the pre-mRNA (cis-acting) or 
the splicing factors (trans-acting) can cause defec- 
tive pre-mRNA splicing. Mutations in the pre-mRNA 
cause some forms of thalassemia, a group of hereditary 
anemias characterized by the defective synthesis of hemo- 
globin (Section 7.4). Cis-acting mutations that cause aber- 
rant splicing can occur at the 5’ or 3’ splice sites in either of 
the two introns of the hemoglobin B chain or in its exons. 
The mutations usually result in an incorrectly spliced pre- 
mRNA that, because of a premature stop codon, cannot 
encode a full-length protein. The defective mRNA is nor- 
mally degraded rather than translated. Mutations in the 5’ 
splice site may alter that site such that the splicing machin- 
ery cannot recognize it, forcing the machinery to find 
another 5’ splice site in the intron and introducing the 
potential for a premature stop codon. Mutations in the intron itself may cre- 
ate a new 5’ splice site; in this case, either one of the two splice sites may be 
recognized (Figure 29.40). Consequently, some normal protein can be 
made, and so the disease is less severe. Mutations affecting splicing have been 
estimated to cause at least 15% of all genetic diseases. 

Disease-causing mutations may also appear in splicing factors. Retinitis 
pigmentosa is a disease of acquired blindness, first described in 1857, with 
an incidence of 1/3500. About 5% of the autosomal dominant form of retinitis 
pigmentosa is likely due to mutations in the hPrp8 protein, a pre-mRNA 
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Figure 29.39 The CTD: Coupling 
transcription to pre-mRNA processing. 
The transcription factor TFIIH phosphorylates 
the carboxyl-terminal domain (CTD) of RNA 
polymerase ll, signaling the transition from 
transcription initiation to elongation. The 
phosphorylated CTD binds factors required for 
pre-mRNA capping, splicing, and 
polyadenylation. These proteins are brought in 
close proximity to their sites of action on the 
nascent pre-mRNA as it is transcribed during 
elongation. [After P. A. Sharp. TIBS 30:279— 
281, 2005] 


Figure 29.40 A splicing mutation that 
causes thalassemia. An A-to-G mutation 
within the first intron of the gene for the 
human hemoglobin B chain creates a new 5’ 
splice site (GU). Both 5’ splice sites are 
recognized by the U1 snRNP; so splicing may 
sometimes create a normal mature mRNA 
and an abnormal mature mRNA that contains 
intron sequences. The normal mature MRNA 
is translated into a hemoglobin B chain. 
Because it includes intron sequences, the 
abnormal mature mRNA now has a premature 
stop codon and is degraded. 


Figure 29.41 An example of alternative 
splicing. In human beings, two very different 
hormones are produced from a single 
calcitonin/CGRP pre-mRNA. Alternative 
splicing produces the mature mRNA for either 
calcitonin or CGRP (calcitonin-gene-related 
protein), depending on the cell type in which 
the gene is expressed. Each alternative 
transcript incorporates one of two alternative 
polyadenylation signals (A) present in the pre- 
mRNA. 
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splicing factor that is a component of the U4-U5-U6 tri-snRNP. How a 
mutation in a splicing factor that is present in all cells causes disease only in 
the retina is not clear; nevertheless, retinitis pigmentosa is a good example 
of how mutations that disrupt spliceosome function can cause disease. 


Most human pre-mRNAs can be spliced in alternative 
ways to yield different proteins 


Alternative splicing is a widespread mechanism for generating protein diversity. 
Different combinations of exons from the same gene may be spliced into a 
mature RNA, producing distinct forms of a protein for specific tissues, 
developmental stages, or signaling pathways. What controls which splicing 
sites are selected? The selection is determined by the binding of trans-acting 
splicing factors to cis-acting sequences in the pre-mRNA. Most alternative 
splicing leads to changes in the coding sequence, resulting in proteins with 
different functions. Alternative splicing provides a powerful mechanism for 
expanding the versatility of genomic sequences through combinatorial control. 
Consider a gene with five positions at which splicing can take place. With 
the assumption that these alternative splicing pathways can be regulated 
independently, a total of 2° = 32 different mRNAs can be generated. 
Sequencing of the human genome has revealed that most pree-emRNAs 
are alternatively spliced, leading to a much greater number of proteins than 
would be predicted from the number of genes. An example of alternative 
splicing leading to the expression of two different proteins, each in a different 
tissue, is provided by the gene encoding both calcitonin and calcitonin- 
gene-related peptide (CGRP; Figure 29.41). In the thyroid gland, the inclu- 
sion of exon 4 in one splicing pathway produces calcitonin, a peptide 
hormone that regulates calcium and phosphorus metabolism. In neuronal 
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Table 29.4 Selected human diseases attributed to defects in alternative splicing 


Disorder Gene or its product 

Acute intermittent porphyria Porphobilinogen deaminase 

Breast and ovarian cancer BRCAI1 

Cystic fibrosis CFTR 

Frontotemporal dementia T protein 

Hemophilia A Factor VIII 

HGPRT deficiency (Lesch-Nyhan syndrome) | Hypoxanthine-guanine phosphoribosyltransferase 
Leigh encephalomyelopathy Pyruvate dehydrogenase Ela 

Severe combined immunodeficiency Adenosine deaminase 

Spinal muscle atrophy SMN1 or SMN2 


cells, the exclusion of exon 4 in another splicing pathway produces CGRP, 
a peptide hormone that acts as a vasodilator. A single pre-mRNA thus 
yields two very different peptide hormones, depending on cell type. In this 
case, only two proteins result from alternative splicing; however, in other 
cases, many more can be produced. An extreme example is the Drosophila 
pre-mRNA that encodes DSCAM, a neuronal protein affecting axon 
connectivity. Alternative splicing of this pre-mRNA has the potential to 
produce 38,016 different combinations of exons, a greater number than the 
total number of genes in the Drosophila genome. However, only a fraction of 
these potential mRNAs appear to be produced owing to regulatory 
mechanisms that are not yet well understood. Several human diseases that 
can be attributed to defects in alternative splicing are listed in Table 29.4. 
Further understanding of alternative splicing and the mechanisms of splice- 
site selection will be crucial to understanding how the proteome is repre- 
sented by the human genome. 


29.4 The Discovery of Catalytic RNA Was Revealing in 
Regard to Both Mechanism and Evolution 


RNAs form a surprisingly versatile class of molecules. As we have seen, 
splicing is catalyzed largely by RNA molecules, with proteins playing a 
secondary role. Another enzyme that contains a key RNA component is 
ribonuclease P, which catalyzes the maturation of tRNA by endonucleolytic 
cleavage of nucleotides from the 5’ end of the precursor molecule. Finally, 
as we shall see in Chapter 30, the RNA component of ribosomes is the 
catalyst that carries out protein synthesis. 

The versatility of RNA first became clear from observations of the 
processing of ribosomal RNA in a single-cell eukaryote. In Tetrahymena 
(a ciliated protozoan), a 414-nucleotide intron is removed from a 6.4-kb 
precursor to yield the mature 26S rRNA molecule (Figure 29.42). In an 
elegant series of studies of this splicing reaction, Thomas Cech and his 
coworkers established that the RNA spliced itself to precisely excise the 
intron. These remarkable experiments demonstrated that an RNA molecule 
can splice itself in the absence of protein. Indeed, the RNA alone is catalytic 
and, under certain conditions, is thus a ribozyme. More than 1500 similar 
introns have since been found in species as widely dispersed as bacteria and 
eukaryotes, though not in vertebrates. Collectively, they are referred to as 
group I introns. 

The self-splicing reaction in the group | intron requires an added guanos- 
ine nucleotide. Nucleotides were originally included in the reaction mixture 
because it was thought that ATP or GTP might be needed as an energy 
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Figure 29.42 Self-splicing. A ribosomal RNA precursor from Tetrahymena, representative of the group | 
introns, splices itself in the presence of a guanosine cofactor (G, shown in green). A 414-nucleotide intron 3 3’ 
(red) is released in the first splicing reaction. This intron then splices itself twice again to produce a linear aun 


RNA that has lost a total of 19 nucleotides. This L19 RNA is catalytically active. [After T. Cech. RNA as an 
enzyme. Copyright © 1986 by Scientific American, Inc. All rights reserved.] 


source. Instead, the nucleotides were found to be necessary as cofactors. The 
required cofactor proved to be a guanosine unit, in the form of guanosine, 
GMP, GDP, or GTP. G (denoting any one of these species) serves not as an 
energy source but as an attacking group that becomes transiently incorpo- 
rated into the RNA (see Figure 29.42). G binds to the RNA and then 
attacks the 5’ splice site to form a phosphodiester bond with the 5’ end of 
the intron. This transesterification reaction generates a 3’-OH group at the 
end of the upstream exon. This newly attached 3’-OH group then attacks 
the 3’ splice site. This second transesterification reaction joins the two exons 
and leads to the release of the 414-nucleotide intron. 

Self-splicing depends on the structural integrity of the RNA precursor. 
Much of the group I intron is needed for self-splicing. This molecule, like 
many RNAs, has a folded structure formed by many double-helical stems 
and loops (Figure 29.43), with a well-defined pocket for binding the 
guanosine. Examination of the three-dimensional structure of a catalytically 
active group I intron determined by x-ray crystallography reveals the 


Guanosine- 
binding site 


hk Figure 29.43 Structure of a 
self-splicing intron. The structure of a large 
fragment of the self-splicing intron from 
Tetrahymena reveals a complex folding pattern 
of helices and loops. Bases are shown in green, 
A; yellow, C; purple, G; and orange, U. [Drawn 
from 1GRZ.pdb]. 
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coordination of magnesium ions in the active site analogous to that observed 
in protein enzymes such as DNA polymerase. 

Analysis of the base sequence of the rRNA precursor suggested that the 
splice sites are aligned with the catalytic residues by base-pairing between 
the internal guide sequence (IGS) in the intron and the 5’ and 3’ exons 
(Figure 29.44). The IGS first brings together the guanosine cofactor and the 
5’ splice site so that the 3’-OH group of G can nucleophilically attack 
the phosphorus atom at this splice site. The IGS then holds the downstream 
exon in position for attack by the newly formed 3’-OH group of the 
upstream exon. A phosphodiester bond is formed between the two exons, 
and the intron is released as a linear molecule. Like catalysis by protein 
enzymes, self-catalysis of bond formation and breakage in this rRNA 
precursor is highly specific. 

The finding of enzymatic activity in the self-splicing intron and in the 
RNA component of RNase P has opened new areas of inquiry and changed 
the way in which we think about molecular evolution. As mentioned in an 
earlier chapter, the discovery that RNA can be a catalyst as well as an 
information carrier suggests that an RNA world may have existed early 
in the evolution of life, before the appearance of DNA and protein. 

Messenger RNA precursors in the mitochondria of yeast and fungi also 
undergo self-splicing, as do some RNA precursors in the chloroplasts of 
unicellular organisms such as Chlamydomonas. Self-splicing reactions can 
be classified according to the nature of the unit that attacks the upstream 
splice site. Group I self-splicing is mediated by a guanosine cofactor, as in 
Tetrahymena. The attacking moiety in group II splicing is the 2’-OH group 
of a specific adenylate of the intron (Figure 29.45). 

Group I and group II self-splicing resembles spliceosome-catalyzed 
splicing in two respects. First, in the initial step, a ribose hydroxyl group 
attacks the 5’ splice site. The newly formed 3’-OH terminus of the upstream 
exon then attacks the 3’ splice site to form a phosphodiester bond with the 
downstream exon. Second, both reactions are transesterifications in which 
the phosphate moieties at each splice site are retained in the products. The 
number of phosphodiester bonds stays constant. Group II splicing is like 
the spliceosome-catalyzed splicing of mRNA precursors in several addi- 
tional ways. The attack at the 5’ splice site is carried out by a part of the 
intron itself (the 2'-OH group of adenosine) rather than by an external 
cofactor (G). In both cases, the intron is released in the form of a lariat. 
Moreover, in some instances, the group II intron is transcribed in pieces 
that assemble through hydrogen bonding to the catalytic intron, in a manner 
analogous to the assembly of the snRNAs in the spliceosome. 


n/a These similarities have led to the suggestion that the spliceosome- 

catalyzed splicing of mRNA precursors evolved from RNA-catalyzed 
self-splicing. Group II splicing may well be an intermediate between group 
I splicing and the splicing in the nuclei of higher eukaryotes. A major step in 
this transition was the transfer of catalytic power from the intron itself to other 
molecules. The formation of spliceosomes gave genes a new freedom because 
introns were no longer constrained to provide the catalytic center for 
splicing. Another advantage of external catalysts for splicing is that they can 
be more readily regulated. However, it is important to note that similarities 
do not establish ancestry. The similarities between group II introns and 
mRNA splicing may be a result of convergent evolution. Perhaps there are 
only a limited number of ways to carry out efficient, specific intron excision. 
The determination of whether these similarities stem from ancestry or 
from chemistry will require expanding our understanding of RNA 
biochemistry. 
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Figure 29.44 Self-splicing mechanism. 
The catalytic mechanism of the group | intron 
includes a series of transesterification 
reactions. [After T. Cech. RNA as an enzyme. 
Copyright © 1986 by Scientific American, Inc. 
All rights reserved.] 
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Figure 29.45 Comparison of splicing pathways. The exons being joined are shown in blue 
and yellow and the attacking unit is shown in green. The catalytic site is formed by the intron itself 
(red) in group | and group II splicing. In contrast, the splicing of nuclear MRNA precursors is 
catalyzed by snRNAs and their associated proteins in the spliceosome. [After P. A. Sharp. Science 
235:766-771, 1987.] 


Summary 


29.1 RNA Polymerases Catalyze Transcription 
All cellular RNA molecules are synthesized by RNA polymerases 
according to instructions given by DNA templates. The activated 
monomer substrates are ribonucleoside triphosphates. The direction of 
RNA synthesis is 5’ —> 3’, as in DNA synthesis. RNA polymerases, 
unlike DNA polymerases, do not need a primer. 

RNA polymerase in E. coli is a multisubunit enzyme. The sub- 
unit composition of the ~500-kd holoenzyme is a2BB’wo and that 
of the core enzyme is a2BB’w. Transcription is initiated at promoter 
sites consisting of two sequences, one centered near — 10 and the other 
near —35; that is, 10 and 35 nucleotides away from the start site in the 
5’ (upstream) direction. The consensus sequence of the —10 region is 
TATAAT. The o subunit enables the holoenzyme to recognize pro- 
moter sites. When the growth temperature is raised, F. coli expresses 
a special o subunit that selectively binds the distinctive promoter of 
heat-shock genes. RNA polymerase must unwind the template double 
helix for transcription to take place. Unwinding exposes some 17 bases 
on the template strand and sets the stage for the formation of the first 


29.2 


29.3 


phosphodiester bond. Newly synthesized RNA chains usually start 
with pppG or pppA. The o subunit usually dissociates from the 
holoenzyme after the initiation of the new chain. Elongation takes 
place at transcription bubbles that move along the DNA template at 
a rate of about 50 nucleotides per second. RNA polymerase occasion- 
ally backtracks, a process that can facilitate proofreading of the RNA 
transcript. The nascent RNA chain contains stop signals that end 
transcription. One stop signal is an RNA hairpin, which is followed by 
several U residues. A different stop signal is read by the rho protein, 
an ATPase. Some genes are regulated by riboswitches, structures 
that form in RNA transcripts and bind specific metabolites. In E. coli, 
precursors of transfer RNA and ribosomal RNA are cleaved and 
chemically modified after transcription, whereas messenger RNA is 
used unchanged as a template for protein synthesis. 


Transcription in Eukaryotes Is Highly Regulated 

RNA synthesis in eukaryotes takes place in the nucleus, whereas pro- 
tein synthesis takes place in the cytoplasm. There are three types of 
RNA polymerase in the nucleus: RNA polymerase I makes ribosomal 
RNA precursors, I] makes messenger RNA precursors, and III makes 
transfer RNA precursors. Eukaryotic promoters are complex, being 
composed of several different elements. Promoters for RNA poly- 
merase II may be located on the 5’ side or the 3’ side of the start site 
for transcription. A common type of eukaryotic promoter consists of a 
TATA box centered between —30 and —100 and paired with an ini- 
tiator element. Eukaryotic promoter elements are recognized by pro- 
teins called transcription factors rather than by RNA polymerase II. 
The saddle-shaped TATA-box-binding protein unwinds and sharply 
bends DNA at TATA-box sequences and serves as a focal point for 
the assembly of transcription complexes. The TATA-box-binding 
protein initiates the assembly of the active transcription complex. The 
activity of many promoters is greatly increased by enhancer sequences 
that have no promoter activity of their own. Enhancer sequences can 
act over distances of several kilobases, and they can be located either 
upstream or downstream of a gene. 


The Transcription Products of Eukaryotic Polymerases Are Processed 
The 5’ ends of mRNA precursors become capped and methylated in 
the course of transcription. A 3’ poly(A) tail is added to most mRNA 
precursors after the nascent chain has been cleaved by an endonu- 
clease. RNA editing processes alter the nucleotide sequence of some 
mRNAs, such as the one for apolipoprotein B. 

The splicing of mRNA precursors is carried out by spliceosomes, 
which consist of small nuclear ribonucleoprotein particles. Splice sites 
inmRNA precursors are specified by sequences at ends of introns and 
by branch sites near their 3’ ends. The 2’-OH group of an adenosine 
residue in the branch site attacks the 5’ splice site to form a lariat 
intermediate. The newly generated 3’-OH terminus of the upstream 
exon then attacks the 3’ splice site to become joined to the downstream 
exon. Splicing thus consists of two transesterification reactions, with 
the number of phosphodiester bonds remaining constant during reac- 
tions. Small nuclear RNAs in spliceosomes catalyze the splicing of 
mRNA precursors. In particular, U2 and U6 snRNAs form the active 
centers of spliceosomes. 

The events in posttranscriptional processing of mRNA are con- 
trolled by the phosphorylation state of the carboxy-terminal domain, 
part of RNA polymerase II. 
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Processing 


29.4 The Discovery of Catalytic RNA Was Revealing in Regard to Both 
Mechanism and Evolution 
Some RNA molecules, such as those containing the group I intron, 


undergo self-splicing in the absence of protein. A self-modified ver- 
sion of this rRNA intron displays true catalytic activity and is thus a 
ribozyme. Spliceosome-catalyzed splicing may have evolved from 
self-splicing. The discovery of catalytic RNA has opened new vistas in 
our exploration of early stages of molecular evolution and the origins 


of life. 


Key Terms 


transcription (p. 851) TATA box (p. 866) 
RNA polymerase (p. 852) 
promoter site (p. 852) 
transcription bubble (p. 853) 


consensus sequence (p. 857) 


enhancer (p. 868) 


(snoRNP) (p. 870) 
sigma (co) subunit (p. 857) 

riboswitch (p. 860) 

rho (p) protein (p. 860) 

carboxy-terminal domain (CTD) (p. 865) 


5’ cap (p. 870) 
poly(A) tail (p. 871) 
microRNA (p. 872) 


Problems 


1. Complements. The sequence of part of an mRNA is 
5'-AUGGGGAACAGCAAGAGUGGGGCCCUGUCCAAGGAG-3' 


What is the sequence of the DNA coding strand? Of the 
DNA template strand? 


2. Checking for errors. Why is RNA synthesis not as 
carefully monitored for errors as is DNA synthesis? 


3. Speed is not of the essence. Why is it advantageous for 
DNA synthesis to be more rapid than RNA synthesis? 


4. Active sites. The overall structures of 
RNA polymerase and DNA polymerase 
are very different, yet their active sites 
show considerable similarities. What do 
the similarities suggest about the evolutionary relationship 
between these two important enzymes? 


5. Potent inhibitor. Heparin inhibits transcription by bind- 
ing to RNA polymerase. What properties of heparin allow 
it to bind so effectively to RNA polymerase? 


6. A loose cannon. Sigma protein by itself does not bind to 
promoter sites. Predict the effect of a mutation enabling o 
to bind to the -10 region in the absence of other subunits of 
RNA polymerase. 


7. Stuck sigma. What would be the likely effect of a muta- 
tion that prevents o from dissociating from the RNA poly- 
merase core? 


8. Transcription time. What is the minimum length of time 
required for the synthesis by E. coli polymerase of an 
mRNA encoding a 100-kd protein? 


transcription factor (p. 867) 


small nucleolar ribonucleoprotein 


pre-mRNA (p. 870) 


RNA editing (p. 872) 

RNA splicing (p. 873) 

spliceosome (p. 874) 

small nuclear RNA (snRNA) (p. 875) 

small nuclear ribonucleoprotein 
particle (snRNP) (p. 875) 

alternative splicing (p. 878) 

catalytic RNA (p. 879) 


self-splicing (p. 879) 


9. Rapid search. RNA polymerase finds promoter sites 
very rapidly. The observed rate constant for the binding of 
the RNA polymerase holoenzyme to promoter sequences is 
10'° M~'s~!. The rate constant for two macromolecules 
encountering each other is typically 10° M7~!s~'. Propose 
an explanation for the 100-fold larger rate for a protein 
finding a particular site along a DNA molecule. 


10. Where to begin? Identify the likely transcription start 
site in the following DNA sequence: 


5’-GCCGTTGACACCGTTCGGCGATCGATCCGCTATAATGTGTGGATCCGCTT-3' 
5’-CGGCAACTGTGGCAAGCCGCTAGCTAGGCGATATTACACACCTAGGCGAA-3' 


11. Between bubbles. How far apart are transcription bubbles 
on E. coli genes that are being transcribed at a maximal 
rate? 


12. A revealing bubble. Consider the synthetic RNA-DNA 
transcription bubble illustrated here. Refer to the coding 
DNA strand, the template strand, and the RNA strand as 
strands 1, 2, and 3, respectively. 


(1) DNA Coding strand CACGGC 
A Ga 

5'-GGATACT TACAGCCAT ATACTCCATT..3° 
3'-CCTATGAATGTCGGTACCTGTGCCGCT TATGAGGTAA...5’ 


Ge 


(2) Template strand yr SEREACEREGAA 


U 
5’-UU i 


(3) RNA strand 


(a) Suppose that strand 3 is labeled with °?P at its 5’ end 
and that polyacrylamide gel electrophoresis is carried out 
under nondenaturing conditions. Predict the autoradio- 
graphic pattern for (i) strand 3 alone, (ii) strands 1 and 3, 
(iii) strands 2 and 3, (iv) strands 1, 2, and 3, and (v) strands 
1, 2, and 3 and core RNA polymerase. 


(b) What is the likely effect of rifampicin on RNA synthe- 
sis in this system? 


(c) Heparin blocks elongation of the RNA primer if it is 
added to core RNA polymerase before the onset of tran- 
scription but not if added after transcription starts. Account 
for this difference. 


(d) Suppose that synthesis is carried out in the presence of 
ATP, CTP, and UTP. Compare the length of the longest 
product obtained with that expected when all four ribonu- 
cleoside triphosphates are present. 


13. Proofreading marks. The major products of proof- 
reading by RNA polymerase are dinucleotides rather than 
mononucleotides. Why? 


14. Abortive cycling. Di- and trinucleotides are occasionally 
released from RNA polymerase at the very start of tran- 
scription, a process called abortive cycling. This process 
requires the restart of transcription. Suggest a plausible 
explanation for abortive cycling. 


15. Polymerase inhibition. Cordycepin inhibits poly(A) 
synthesis at low concentrations and RNA synthesis at 
higher concentrations. 


NH> 

ZS 
en, 
e 


HO O 


HOH 


Cordycepin (3'-deoxyadenosine) 


(a) What is the basis of inhibition by cordycepin? 
(b) Why is poly(A) synthesis more sensitive to the presence 
of cordycepin? 


(c) Does cordycepin need to be modified to exert its 
effect? 


16. Alternative splicing. A gene contains eight sites where 
alternative splicing is possible. Assuming that the splicing 
pattern at each site is independent of that at all other sites, 
how many splicing products are possible? 


17. Supercoiling. Negative supercoiling of DNA favors the 
transcription of genes because it facilitates unwinding. 
However, not all promoter sites are stimulated by negative 
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Problems 


supercoiling. The promoter site for topoisomerase II itself is 
a noteworthy exception. Negative supercoiling decreases 
the rate of transcription of this gene. Propose a possible 
mechanism for this effect and suggest a reason why it may 
occur. 


18. An extra piece. In one type of mutation leading to a 
form of thalassemia, the mutation of a single base (G to A) 
generates a new 3’ splice site (blue in the illustration below) 
akin to the normal one (yellow) but farther upstream. 


Normal 3’ end 


i of intron 


5’ CCTATTGGTCTATTTTCCACCCTTAGGCTGCTG3' 


5’ CCTATTAGTCTATTTTCCACCCTTAGGCTGCTG3' 


What is the amino acid sequence of the extra segment of 
protein synthesized in a thalassemic patient having a muta- 
tion leading to aberrant splicing? The reading frame after 
the splice site begins with TCT. 


19. A long-tailed messenger. Another thalassemic patient 
had a mutation leading to the production of an mRNA for 
the B chain of hemoglobin that was 900 nucleotides longer 
than the normal one. The poly(A) tail of this mutant mRNA 
was located a few nucleotides after the only AAUAAA 
sequence in the additional sequence. Propose a mutation 
that would lead to the production of this altered mRNA. 


Mechanism Problem 


20. RNA editing. Many uridine molecules are inserted into 
some mitochondrial mRNAs in trypanosomes. The uridine 
residues come from the poly(U) tail of a donor strand. 
Nucleoside triphosphates do not participate in this reaction. 
Propose a reaction mechanism that accounts for these find- 
ings. (Hint: Relate RNA editing to RNA splicing.) 


Chapter Integration Problems 


21. Proteome complexity. What processes considered in this 
chapter make the proteome more complex than the genome? 
What processes might further enhance this complexity? 


22. Separation technique. Suggest a means by which you 
could separate mRNA from the other types of RNA in a 
eukaryotic cell. 


Data Interpretation Problems 


23. Run-off experiment. Nuclei were isolated from brain, 
liver, and muscle. The nuclei were then incubated with 
a-[°*PJUTP under conditions that allow RNA synthesis, 
except that an inhibitor of RNA initiation was present. 
The radioactive RNA was isolated and annealed to various 
DNA sequences that had been attached to a gene chip. In 
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the adjoining graphs, the intensity of the shading indicates 
roughly how much mRNA was attached to each DNA 
sequence. 


Liver Muscle Brain 


(a) Why does the intensity of hybridization differ between 
genes? 

(b) What is the significance of the fact that some of the 
RNA molecules display different hybridization patterns in 
different tissues? 

(c) Some genes are expressed in all three tissues. What 
would you guess is the nature of these genes? 

(d) Suggest areason why an initiation inhibitor was included 
in the reaction mixture. 


24. Christmas trees. The adjoining autoradiograph depicts 
several bacterial genes undergoing transcription. Identify 
the DNA. What are the strands of increasing length? Where 
is the beginning of transcription? The end of transcription? 
On the page, what is the direction of RNA synthesis? What 
can you conclude about the number of enzymes participat- 
ing in RNA synthesis on a given gene? 


Protein Synthesis 


cHapTER 3O 


The ribosome, shown at the right, is a factory for the manufacture of polypeptides. Amino 
acids are carried into the ribosome, one at a time, connected to transfer RNA molecules. 
Each amino acid is joined to the growing polypeptide chain, which detaches from the 
ribosome only after the polypeptide has been completed. This assembly-line approach allows 
even very long polypeptide chains to be assembled rapidly and with impressive accuracy. 
[(Left) Images of Birmingham Premium/Alamy.] 


‘Sie information is most important because of the proteins that it 
encodes, in that proteins play most of the functional roles in cells. In 
Chapters 28 and 29, we examined how DNA is replicated and transcribed 
into RNA. We now turn to the mechanism of protein synthesis, a process 
called translation because the four-letter alphabet of nucleic acids is 
translated into the entirely different twenty-letter alphabet of proteins. 
Translation is a conceptually more complex process than either replication 
or transcription, both of which take place within the framework of a common 
base-pairing language. As befits its position linking the nucleic acid and 
protein languages, the process of protein synthesis critically depends on both 
nucleic acid and protein factors. Protein synthesis takes place on ribosomes— 
enormous complexes containing three large RNA molecules and more 
than 50 proteins. Among the great triumphs in biochemistry in recent years 
has been the determination of the structure of the ribosome and its compo- 
nents so that its function can be examined in atomic detail. Perhaps the 
most significant conclusion from these studies is that the ribosome is a 
ribozyme; that is, the RNA components play the most fundamental roles. 
These observations strongly support the notion that life evolved through an 
RNA world, and the ribosome is a surviving inhabitant of that world. 
Transfer RNA molecules (tRNAs) and messenger RNA (mRNA) also 
are key participants in protein synthesis. The link between amino acids and 
nucleic acids is first made by enzymes called aminoacyl-tRNA synthetases. 


OUTLINE 


30.1 
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Protein Synthesis Requires the 
Translation of Nucleotide 
Sequences into Amino Acid 
Sequences 


Aminoacyl Transfer RNA 
Synthetases Read the Genetic 
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Protein Synthesis 


Eukaryotic Protein Synthesis 
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Synthesis Primarily in Translation 
Initiation 


A Variety of Antibiotics and Toxins 
Can Inhibit Protein Synthesis 


Ribosomes Bound to the 
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Figure 30.1 Polypeptide-chain growth. 
Proteins are synthesized by the successive 
addition of amino acids to the carboxyl 
terminus. 


By specifically linking a particular amino acid to each tRNA, these enzymes 
translate the genetic code. 

Although RNA is paramount in the process of translation, protein fac- 
tors also are required for the efficient synthesis of a protein. Protein factors 
participate in the initiation, elongation, and termination of protein synthe- 
sis. P-loop NTPases of the G-protein family play particularly important 
roles. Recall that these proteins serve as molecular switches as they cycle 
between a GTP-bound form and a GDP-bound form (Section 14.1). This 
chapter focuses primarily on protein synthesis in prokaryotes because it 
illustrates many general principles and is well understood. Some distinctive 
features of protein synthesis in eukaryotes also are presented. 


30.1 Protein Synthesis Requires the Translation of 
Nucleotide Sequences into Amino Acid Sequences 


The basics of protein synthesis are the same across all kingdoms of life— 
evidence that the protein-synthesis system arose very early in evolution. An 
mRNA is decoded, or read, in the 5’-to-3’ direction, one codon at a time, 
and the corresponding protein is synthesized in the amino-to-carboxy]l 
direction by the sequential addition of amino acids to the carboxyl end of 
the growing peptide chain (Figure 30.1). The amino acids arrive at the 
growing chain in activated form as aminoacyl-tRNAs, created by joining 
the carboxyl group of an amino acid to the 3’ end of atRNA molecule. The 
linking of an amino acid to its corresponding tRNA is catalyzed by an 
aminoacyl-tRNA synthetase. ATP cleavage drives this activation reaction. 


For each amino acid, there is usually one activating enzyme and at least one 
kind of tRNA. 


The synthesis of long proteins requires a low error frequency 


The process of transcription is analogous to copying, word for word, a page 
of a book. There is no change of alphabet or vocabulary; so the likelihood of 
a change in meaning is small. Translating the base sequence of an MRNA 
molecule into a sequence of amino acids is analogous to translating the page 
of a book into another language. Translation is a complex process, entailing 
many steps and dozens of molecules. The potential for error exists at each step. 
The complexity of translation creates a conflict between two requirements: 
the process must be both accurate and fast enough to meet a cell’s needs. In 
E. coli, translation can take place at a rate of 50 amino acids per second, a 
truly impressive speed, considering the complexity of the process. 


How accurate must protein synthesis be? Let uscon- _—_ Table 30.1 Accuracy of protein synthesis 


sider error rates. The probability of forming a protein 
with no errors depends on the number of amino acid 
residues and on the frequency (¢) of insertion of a wrong 
amino acid. As Table 30.1 shows, an error frequency of 


= ae ; 4 Frequency of inserting 
10°“ is intolerable, even for quite small proteins. An € an incorrect amino acid 


Probability of synthesizing 


an error-free protein 


Number of amino acid residues 


sae 100 300 1000 
value of 10 usually leads to the error-free synthesis of 107 See 0.049 0.000 
a 300-residue protein (~33 kd) but not of a 1000-residue 1073 0.905 0.741 0.368 
protein (~110 kd). Thus, the error frequency must not 1074 0.990 0.970 0.905 
exceed approximately 10 * to produce the larger pro- 1075 0.999 0.997 0.990 


teins effectively. Lower error frequencies are conceiv- 


Note: The probability p of forming a protein with no errors depends on n, the 


able; however, except for the largest proteins, they will number of amino acids, and «, the frequency of insertion of a wrong amino 


not dramatically increase the percentage of proteins #4 =(1~ 6” 
with accurate sequences. In addition, such lower error 

rates are likely to be possible only by a reduction in the 

rate of protein synthesis because additional time for proofreading is 
required. In fact, the observed values of € are close to 10 *. An error fre- 
quency of about io-* per amino acid residue was selected in the course 
of evolution to accurately produce proteins consisting of as many as 
1000 amino acids while maintaining a remarkably rapid rate for protein 
synthesis. 


Transfer RNA molecules have a common design 


The fidelity of protein synthesis requires accurate recognition of three- 
base codons on messenger RNA. Recall that the genetic code relates each 
amino acid to a three-letter codon (Section 4.6). An amino acid cannot 
itself recognize a codon. Consequently, an amino acid is attached to a 
specific tRNA molecule that can recognize the codon by Watson—Crick 
base-pairing. Transfer RNA serves as the adapter molecule that binds to a 
specific codon and brings with it an amino acid for incorporation into the 
polypeptide chain. 

Consider yeast alanyl-tRNA, so called because it will carry the amino 
acid alanine. Yeast alanyl-tRNA was the first nucleic acid sequenced. This 
adapter molecule is a single chain of 76 ribonucleotides (Figure 30.2). The 
5’ terminus is phosphorylated (pG), whereas the 3’ terminus has a free 
hydroxyl group. The amino acid-attachment site is the 3'-hydroxyl] group of 
the adenosine residue at the 3’ terminus of the molecule. The sequence 
5’-IGC-3' in the middle of the molecule is the anticodon, where I is the 
purine base inosine. It is complementary to 5'-GCC-3’, one of the codons 
for alanine. 

Thousands of tRNA sequences are now known. The striking finding 
is that all of them can be arranged in a cloverleaf pattern in which about 
half the residues are base-paired (Figure 30.3). Hence, tRNA molecules 
have many common structural features. This finding is not unexpected, 
because all tR NA molecules must be able to interact in nearly the same 
way with the ribosomes, mRNAs, and protein factors that participate in 
translation. 

All known transfer RNA molecules have the following features: 


1. Each is a single chain containing between 73 and 93 ribonucleotides 


(~25 kd). 


2. They contain many unusual bases, typically between 7 and 15 per mole- 
cule. Some of these bases are methylated or dimethylated derivatives of 
A, U, C, and G formed by enzymatic modification of a precursor tRNA. 
Some methylations prevent the formation of certain base pairs, thereby 
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Figure 30.2 Alanyl-tRNA sequence. The 
base sequence of yeast alanyl-tRNA and the 
deduced cloverleaf secondary structure are 
shown. Modified nucleosides are abbreviated 
as follows: methylinosine (ml), dihydrouridine 
(UH), ribothymidine (T), pseudouridine (i), 
methylguanosine (mG), and 
dimethylguanosine (m3G). Inosine (1), 
another modified nucleoside, is part of the 
anticodon. 
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Figure 30.3 General structure of tRNA 
molecules. Comparison of the base sequences of 
many tRNAs reveals a number of conserved features. 


~— Anticodon 
loop 


rendering such bases accessible for interactions with other bases. In addi- 
tion, methylation imparts a hydrophobic character to some regions of 
tRNAs, which may be important for their interaction with synthetases and 
ribosomal proteins. Other modifications alter codon recognition, as will be 
described shortly. 


3. The molecule is L-shaped (Figure 30.4). 


4. About half the nucleotides in tRNAs are base-paired to form double 
helices. The four helical regions are arranged to form two apparently con- 
tinuous segments of double helix. These segments are like A-form DNA, as 
expected for an RNA helix (p. 119). One helix, containing the 5’ and 3’ ends, 
runs horizontally in the model shown in Figure 30.5. The other helix, which 


CCA 
terminus 


Anticodon 
loop 


hk Figure 30.4 Transfer RNA structure. Notice the 

L-shaped structure revealed by this skeletal model of yeast 

phenylalanyl-tRNA. The CCA region is at the end of one arm, 

loop and the anticodon loop is at the end of the other. [Drawn 
from 1EHZ.pdb.] 


hk Figure 30.5 Helix stacking in tRNA. 
The four double-stranded regions of the tRNA 
(see Figure 30.3) stack to form an L-shaped 
structure. [Drawn from 1EHZ.pdb.] 


contains the anticodon and runs vertically in Figure 30.5, forms the other 891 
arm of the L. 30.1 Basics of Translation 


Five groups of bases are not base-paired in this way: the 3’ CCA terminal 


region, which is part of a region called the acceptor stem; the TysC loop, which 
acquired its name from the sequence ribothymine-pseudouracil-cytosine; P 
the “extra arm,” which contains a variable number of residues; the DHU Hoc seeing 
loop, which contains several dihydrouracil residues; and the anticodon loop. 0 
Most of the bases in the nonhelical regions participate in hydrogen-bonding 
interactions, even if the interactions are not like those in Watson—Crick base 
pairs. The structural diversity generated by this combination of helices and Oo OH 
loops containing modified bases ensures that the tRNAs can be uniquely Oe / 
distinguished, though structurally similar overall. PA ‘\ 
H2 d cytosine CCA arm 

5. The 5’ end of a tRNA is phosphorylated. The 5’ terminal residue is 0 antRNA 
usually pG. 
6. An activated amino acid is attached to a hydroxyl group of the adenosine 5 Le 
residue in the amino acid-attachment site, located at the end of the 3’ CCA oO. / 
component of the acceptor stem (Figure 30.6). This single-stranded region x P\ 
can change conformation in the course of amino acid activation and protein @ 7 
synthesis. H.C sieges 
7. The anticodon loop, which is present in a loop near the center of the 
sequence, is at the other end of the L, making accessible the three bases that 
make up the anticodon. 

Thus, the architecture of the tRNA molecule is well suited to its role as Araino-acid 


adaptor: the anticodon is available to interact with an appropriate codon on 
mRNA while the end that is linked to an activated amino acid is well posi- 
tioned to participate in peptide-bond formation. 


Aminoacyl-tRNA 


Figure 30.6 Aminoacyl-tRNA. Amino acids 
are coupled to tRNAs through ester linkages 


Some transfer RNA molecules recognize more than one codon to either the 2’- or the 3’-hydroxyl group of 


because of wobble in base-pairing racer ents se A linkage to the 
-hydroxyl group is shown. 


What are the rules that govern the recognition of a codon by the anticodon 

of atRNA? A simple hypothesis is that each of the bases of the codon forms Anticodon 
a Watson—Crick type of base pair with a complementary base on the anti- 
codon of the tRNA. The codon and anticodon would then be lined up in an 
antiparallel fashion. In the diagram in the margin, the prime denotes the —X—Y¥—L— 
complementary base. Thus X and X’ would be either A and U (or U and A) ° ° 
or G and C (or C and G). According to this model, a particular anticodon 
can recognize only one codon. 

The facts are otherwise. As found experimentally, some tRNA molecules 
can recognize more than one codon. For example, the yeast alanyl-tRNA 
binds to three codons: GCU, GCC, and GCA. The first two bases of these 
codons are the same, whereas the third is different. Could it be that recogni- 
tion of the third base of a codon is sometimes less discriminating than rec- 
ognition of the other two? The pattern of degeneracy of the genetic code 
indicates that it might be so. XYU and XYC always encode the same amino 
acid; XYA and XYG usually do. Francis Crick surmised from these data 
that the steric criteria might be less stringent for pairing of the third base 
than for the other two. Models of various base pairs were built to determine 
which ones are similar to the standard A - U and G -: C base pairs with 
regard to the distance and angle between the glycosidic bonds. Inosine was 


5° eM 
—X'—Y'—7'— 


Codon 


Table 30.2 Allowed pairings at the 
third base of the codon 
according to the wobble 


hypothesis 
First base Third base 
of anticodon of codon 
Cc G 
A U 
U AorG 
G Wore 
I U, C, or A 


16S RNA A1493 


Codon U1 


Anticodon A36 


Figure 30.7 16S rRNA monitors 
base-pairing between the codon and the 
anticodon. Adenine 1493, one of three 
universally conserved bases in 16S rRNA, 
forms hydrogen bonds with the bases in both 
the codon and the anticodon only if the 
codon and anticodon are correctly paired. 
[From J. M. Ogle and V. Ramakrishnan. Annu. 
Rev. Biochem. 74:129-177, 2005, Fig. 2a.] 
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included in this study because it appeared in several anticodons. With the 
assumption of some steric freedom (“‘wobble’’) in the pairing of the third 
base of the codon, the combinations shown in Table 30.2 seemed plausible. 

The wobble hypothesis is now firmly established. The anticodons of 
tRNAs of known sequence bind to the codons predicted by this hypothesis. 
For example, the anticodon of yeast alanyl-tRNA is IGC. This tRNA rec- 
ognizes the codons GCU, GCC, and GCA. Recall that, by convention, 
nucleotide sequences are written in the 5’ —> 3’ direction unless otherwise 
noted. Hence, I (the 5’ base of this anticodon) pairs with U, C, or A (the 3’ 
base of the codon), as predicted. 


ribose 
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base pair base pair base pair 


Two generalizations concerning the codon—anticodon interaction can be 
made: 


1. The first two bases of a codon pair in the standard way. Recognition is 
precise. Hence, codons that differ in either of their first two bases must be 
recognized by different tRNAs. For example, both UUA and CUA encode 
leucine but are read by different tRNAs. 


2. The first base of an anticodon determines whether a particular tRNA 
molecule reads one, two, or three kinds of codons: C or A (one codon), U or 
G (two codons), or I (three codons). Thus, part of the degeneracy of the 
genetic code arises from imprecision (wobble) in the pairing of the third base of 
the codon with the first base of the anticodon. We see here a strong reason for 
the frequent appearance of inosine, one of the unusual nucleosides, in 
anticodons. Inosine maximizes the number of codons that can be read by a 
particular tRNA molecule. The inosine bases in tRNA are formed by the 
deamination of adenosine after the synthesis of the primary transcript. 
Why is wobble tolerated in the third position of the codon but not in the 
first two? This question is answered by considering the interaction of the 
tRNA with the ribosome. As we will see, ribosomes are huge RNA~—protein 
complexes consisting of two subunits, the 305 and 50S subunits. The 305 
subunit has an RNA molecule, the165 rRNA, that has three universally 
conserved bases—adenine 1492, adenine 1493, and guanine 530—that 
form hydrogen bonds on the minor-groove side but only with correctly 
formed base pairs of the codon—anticodon duplex (Figure 30.7). These 


interactions serve to check whether Watson—Crick base pairs are present in 
the first two positions of the codon—anticodon duplex. No such inspection 
device is present for the third position; so more-varied base pairs are 
tolerated. This mechanism for ensuring fidelity is analogous to the minor- 
groove interactions utilized by DNA polymerase for a similar purpose 
(Section 28.1). Thus, the ribosome plays an active role in decoding the codon— 
anticodon interactions. 


30.2 Aminoacyl Transfer RNA Synthetases Read 
the Genetic Code 


Before codon and anticodon meet, the amino acids required for protein 
synthesis must first be attached to specific tRNA molecules. The linkage of 
an amino acid to atRNA is crucial for two reasons. First, the attachment of 
da given amino acid to a particular tRNA establishes the genetic code. When an 
amino acid has been linked to atRNA, it will be incorporated into a growing 
polypeptide chain at a position dictated by the anticodon of the tRNA. 
Second, because the formation of a peptide bond between free amino acids is not 
thermodynamically favorable, the amino acid must first be activated in order 
for protein synthesis to proceed. The activated intermediates in protein synthesis 
ave amino acid esters, in which the carboxyl group of an amino acid is linked to 
either the 2'- or the 3’-hydroxyl group of the ribose unit at the 3’ end of 
tRNA. An amino acid ester of tRNA is called an aminoacyl-tRNA or some- 
times a charged tRNA (see Figure 30.6). For a specific amino acid attached 
to its cognate tRNA—for instance, threonine—the charged tRNA is desig- 
nated Thr-tRNA!™, 


Amino acids are first activated by adenylation 


The activation reaction is catalyzed by specific aminoacyl-tRNA synthetases, 
which are also called activating enzymes. The first step is the formation of an 
aminoacyl adenylate from an amino acid and ATP. 


Amino acid + ATP == aminoacyl-AMP + PP; 


This activated species is a mixed anhydride in which the carboxyl group of 
the amino acid is linked to the phosphoryl group of AMP; hence, it is also 
known as aminoacyl-AMP. 
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The next step is the transfer of the aminoacyl group of aminoacyl-AMP to 
a particular tRNA molecule to form aminoacyl-tRNA. 
Aminoacyl-AMP + tRNA =~ aminoacyl-tRNA + AMP 
The sum of these activation and transfer steps is 
Amino acid + ATP + tRNA = — aminoacyl-tRNA + AMP + PP; 


The AG*’ of this reaction is close to 0, because the free energy of hydro- 
lysis of the ester bond of aminoacyl-tRNA is similar to that for the hydrolysis 
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of ATP to AMP and PP;. As we have seen many times, the reaction is 
driven by the hydrolysis of pyrophosphate. The sum of these three reactions 
is highly exergonic: 


Amino acid + ATP + tRNA + H,0 —> 
aminoacyl-tRNA + AMP + 2 P; 


Thus, the equivalent of two molecules of ATP is consumed in the synthesis 
of each aminoacyl-tRNA. One of them is consumed in forming the ester 
linkage of aminoacyl-tRNA, whereas the other is consumed in driving the 
reaction forward. 

The activation and transfer steps for a particular amino acid are cata- 
lyzed by the same aminoacyl-tRNA synthetase. Indeed, the aminoacyl- 
AMP intermediate does not dissociate from the synthetase. Rather, it is tightly 
bound to the active site of the enzyme by noncovalent interactions. 
Aminoacyl-AMP is normally a transient intermediate in the synthesis of 
aminoacyl-tRNA, but it is relatively stable and readily isolated if tRNA is 
absent from the reaction mixture. 

We have already encountered an acyl adenylate intermediate in fatty acid 
activation (Section 22.2). The major difference between these reactions is 
that the acceptor of the acyl group is CoA in fatty acid activation and tRNA 
in amino acid activation. The energetics of these biosyntheses are very simi- 
lar: both are made irreversible by the hydrolysis of pyrophosphate. 


Aminoacyl-tRNA synthetases have highly discriminating 
amino acid activation sites 


Each aminoacyl-tRNA synthetase is highly specific for a given amino acid. 
Indeed, a synthetase will incorporate the incorrect amino acid only once in 
10* or 10° catalytic reactions. How is this level of specificity achieved? Each 
aminoacyl-tRNA synthetase takes advantage of the properties of its amino 
acid substrate. Let us consider the challenge faced by threonyl-tRNA syn- 
thetase. Threonine is particularly similar to two other amino acids—namely, 
valine and serine. Valine has almost exactly the same shape as that of threo- 
nine, except that valine has a methyl group in place of a hydroxyl group. 
Serine has a hydroxyl group, as does threonine, but lacks the methyl group. 
How can the threonyl-tRNA synthetase avoid coupling these incorrect 
amino acids to threonyl-tRNA? 

The structure of the amino acid-binding site of threonyl-tRNA 
synthetase reveals how valine is avoided (Figure 30.8). The synthetase 
contains a zinc ion, bound to the enzyme by two histidine residues and one 


Thr 


Figure 30.8 Active site of threonyl-tRNA 
synthetase. Notice that the amino 
acid-binding site includes a zinc ion (green 
ball) that coordinates threonine through its 
amino and hydroxyl groups. 


cysteine residue. The remaining coordination sites are available for substrate 
binding. Threonine coordinates to the zinc ion through its amino group and 
its side-chain hydroxyl group. The side-chain hydroxyl group is further 
recognized by an aspartate residue that hydrogen bonds to it. The methyl 
group present in valine in place of this hydroxyl group cannot participate in 
these interactions; it is excluded from this active site and, hence, does not 
become adenylated and transferred to threonyl-tRNA (abbreviated 
tRNA! ). The use of a zinc ion appears to be unique to threonyl-tRNA 
synthetase; other aminoacyl-tRNA synthetases have different strategies for 
recognizing their cognate amino acids. The carboxylate group of the cor- 
rectly positioned threonine is available to attack the a phosphoryl group of 
ATP to form the aminoacyl adenylate. 

The zinc site is less able to discriminate against serine because this 
amino acid does have a hydroxyl group that can bind to the zinc ion. Indeed, 
with only this mechanism available, threonyl-tRNA synthetase does mis- 
takenly couple serine to threonyl-tRNA at arate 107 to 10° times that for 
threonine. As noted on page 888, this error rate is likely to lead to many 
translation errors. How is a higher level of specificity achieved? 


Proofreading by aminoacyl-tRNA synthetases increases 
the fidelity of protein synthesis 


Threonyl-tRNA synthetase can be incubated with tRNA! that has 
been covalently linked with serine (Ser-tRNA! ); the tRNA has been 
“mischarged.” The reaction is immediate: a rapid hydrolysis of the 
aminoacyl-tRNA forms serine and free tRNA. In contrast, incubation with 
correctly charged Thr-tRNA! results in no reaction. Thus, threony]l- 
tRNA synthetase contains an additional functional site that hydrolyzes 
Ser-tRNA!* but not Thr-tRNA!". This editing site provides an opportu- 
nity for the synthetase to correct its mistakes and improve its fidelity to less 
than one mistake in 10*. The results of structural and mutagenesis studies 
revealed that the editing site is more than 20 A from the activation site 
(Figure 30.9). This editing site readily accepts and cleaves Ser-tRNA! but 
does not cleave Thr-tRNA!?". The discrimination of serine from threonine 
is easy because threonine contains an extra methyl group; a site that con- 
forms to the structure of serine will sterically exclude threonine. 

Most aminoacyl-tRNA synthetases contain editing sites in addition to 
activation sites. These complementary pairs of sites function as a double 
sieve to ensure very high fidelity. In general, the acylation site rejects amino 
acids that are larger than the correct one because there is insufficient room 
for them, whereas the hydrolytic site cleaves activated species that are 
smaller than the correct one. 

The structure of the complex between threonyl-tRNA synthetase and 
its substrate reveals that the aminoacylated CCA can swing out of the 
activation site and into the editing site (Figure 30.10). Thus, the aminoacyl- 
tRNA can be edited without dissociating from the synthetase. This proof- 
reading, which depends on the conformational flexibility of a short stretch of 
polynucleotide sequence, is entirely analogous to that of DNA polymerase 
(Section 28.1). In both cases, editing without dissociation significantly 
improves fidelity with only modest costs in time and energy. 

A few synthetases achieve high accuracy without editing. For example, 
tyrosyl-tRNA synthetase has no difficulty discriminating between tyrosine 
and phenylalanine; the hydroxyl group on the tyrosine ring enables tyrosine 
to bind to the enzyme 10° times as strongly as phenylalanine. Proof-reading 
has been selected in evolution only when fidelity must be enhanced beyond what 
can be obtained through an initial binding interaction. 
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hk Figure 30.9 Editing site. Mutagenesis 
studies revealed the position of the editing 
site (shown in green) in threonyl-tRNA 
synthetase. Only one subunit of the dimeric 
enzyme is shown here and in subsequent 
illustrations. [Drawn from 1QF6.pdb.] 


Editing site 


Activation site 


Figure 30.10 Editing of aminoacyl-tRNA. 
The flexible CCA arm of an aminoacyl-tRNA can 
move the amino acid between the activation 
site and the editing site. If the amino acid fits 
well into the editing site, the amino acid is 
removed by hydrolysis. 
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How do synthetases choose their tRNA partners? This enormously 


important step is the point at which “translation” takes place—at which the 
correlation between the amino acid and the nucleic acid worlds is made. In 
a sense, aminoacyl-tRNA synthetases are the only molecules in biology that 
“know” the genetic code. Their precise recognition of tRNAs is as im- 


Anticodon 
loop 


h Figure 30.11 Threonyl-tRNA synthetase complex. The 
structure shows the complex between threonyl-tRNA synthetase 
and tRNA". Notice that the synthetase binds to both the 
acceptor stem and the anticodon loop. [Drawn from 
1QF6.pdb.] 


portant for high-fidelity protein synthesis as is the accurate 
selection of amino acids. In general, tRNA recognition by the 
synthetase is different for each synthetase and tRNA pair. 
Consequently, generalities are difficult to make. We will exam- 
ine the interaction of threonyl-tRNA synthase with its tRNA 
partner. 

Some synthetases recognize their tRNA partners primarily on 
the basis of their anticodons, although they may also recognize 
other aspects of tRNA structure that vary among different 
tRNAs. The most direct evidence comes from crystallographic 
studies of complexes formed between synthetases and their 
cognate tRNAs. Consider, for example, the structure of the 
complex between threonyl-tRNA synthetase and tRNA! 
(Figure 30.11). As expected, the CCA arm extends into the 
zinc-containing activation site, where it is well positioned to accept 
threonine from threonyl adenylate. The enzyme interacts exten- 
sively not only with the acceptor stem of the tRNA, but also 
with the anticodon loop. The interactions with the anticodon 
loop are particularly revealing. Each base within the sequence 
5’-CGU-3' of the anticodon participates in hydrogen bonds 
with the enzyme; those with the second two bases (G and U) 
appear to be more important because the synthetase interacts 
just as efficiently with the anticodons GGU and UGU. Although 
interactions between the enzyme and the anticodon are often 
crucial for correct recognition, Figure 30.12 shows that many 


aspects of tRNA molecules are recognized by synthetases. Note that many 
of the recognition sites are loops rich in unusual bases that can provide 
structural identifiers. 
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Figure 30.12 Recognition sites on tRNA. 
Circles represent nucleotides, and the sizes of the 
circles are proportional to the frequency with 
which they are used as recognition sites by 
aminoacyl-tRNA synthetases. The numbers 
indicate the positions of the nucleotides in the 
base sequence, beginning from the 5’ end of 

Anticodon the tRNA molecule. [After M. Ibba, and 

5 loop D. Sdll, Annu. Rev. Biochem. 69:617—650, 1981, 

p. 636] 
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Aminoacyl-tRNA synthetases can be divided into two classes 


NY At least one aminoacyl-tRNA synthetase exists for each amino acid. 

The diverse sizes, subunit composition, and sequences of these enzymes 
were bewildering for many years. Could it be that essentially all synthetases 
evolved independently? The determination of the three-dimensional struc- 
tures of several synthetases followed by more-refined sequence comparisons 
revealed that different synthetases are, in fact, related. Specifically, syn- 
thetases fall into two classes, termed class I and class II, each of which includes 
enzymes specific for 10 of the 20 amino acids (Table 30.3). Intriguingly, syn- 
thetases from the two classes bind to different faces of the tRNA molecule 
(Figure 30.13). The CCA arm of tRNA adopts different conformations to 
accommodate these interactions; the arm is in the helical conformation 
observed for free tRNA (see Figures 30.4 and 30.5) for class I] enzymes and 
in a hairpin conformation for class I enzymes. These two classes also differ in 
other ways. 


CLASS | 


CLASS II 


tRNA Complex tRNA Complex 


1. Class I enzymes acylate the 2'-hydroxyl group of the terminal adenosine 
of tRNA, whereas class II enzymes (except the enzyme for Phe-tRNA) 
acylate the 3’-hydroxyl group. 


2. The two classes bind ATP in different conformations. 


3. Most class I enzymes are monomeric, whereas most class II enzymes are 
dimeric. 


Why did two distinct classes of aminoacyl-tRNA synthetases evolve? 
The observation that the two classes bind to distinct faces of tRNA suggests 
a possibility. Recognition sites on both faces of tRNA may have been 
required to allow the recognition of 20 different tRNAs. 


30.3 The Ribosome Is the Site of Protein Synthesis 


We turn now to ribosomes, the molecular machines that coordinate the 
interplay of charged tRNAs, mRNA, and proteins that leads to protein 
synthesis. An E. coli ribosome is a ribonucleoprotein assembly with a mass 
of about 2500 kd, a diameter of approximately 250 A, and a sedimentation 
coefficient of 70S. The 20,000 ribosomes in a bacterial cell constitute nearly 
a fourth of its mass. 

A ribosome can be dissociated into a large subunit (50S) and a small sub- 
unit (30S). These subunits can be further split into their constituent proteins 


Table 30.3 Classification and subunit 
structure of aminoacyl-tRNA 
synthetases in E. coli 


Class I Class II 


Arg (a) Ala (a4) 
Cys (a) Asn (a2) 
Gln (a) Asp (a2) 
Glu (a) Gly (a282) 
Te (x) His (a2) 
Leu (a) Lys (a2) 
Met (a) Phe (a2B2) 
Trp (a2) Ser (a2) 
Tyr (a2) Pro (a2) 


Val («) Thr (a2) 


hk Figure 30.13 Classes of 

aminoacyl-tRNA synthetases. Notice that 
class | and class II synthetases recognize 
different faces of the tRNA molecule. The CCA 
arm of tRNA adopts different conformations in 
complexes with the two classes of synthetase. 
Note that the CCA arm of the tRNA is turned 
toward the viewer (see Figures 30.4 and 
30.5). [Drawn from 1EUY.pdb and 1QF6.pdb.] 
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hk Figure 30.14 The ribosome at high 
resolution. Detailed models of the ribosome 
based on the results of x-ray crystallographic 
studies of the 70S ribosome and the 30S and 
50S subunits: (left) view of the part of the 
50S subunit that interacts with the 30S 
subunit; (center) side view of the 70S 
ribosome; (right) view of the part of the 30S 
subunit that interacts with the 50S subunit. 
23S RNA is shown in yellow, 5S RNA in 
orange, 16S RNA in green, proteins of the 
50S subunit in red, and proteins of the 30S 
subunit in blue. Notice that the interface 
between the 50S and the 30S subunits 
consists entirely of RNA. [Drawn from 1GIX. 
pdb and 1GlY.pdb]] 


and RNAs. The 305 subunit contains 21 different proteins (referred to as 51 
through 821) and a 165 RNA molecule. The 50S subunit contains 34 differ- 
ent proteins (L1 through L34) and two RNA molecules, a 23S and a 55 
species. A ribosome contains one copy of each RNA molecule, two copies 
each of the L7 and L12 proteins, and one copy of each of the other proteins. 
The L7 protein is identical with L12 except that its amino terminus is acety- 
lated. Both the 305 and the 50S subunits can be reconstituted in vitro from 
their constituent proteins and RNA, as was first achieved by Masayasu 
Nomura in 1968. This reconstitution is an outstanding example of the principle 
that supramolecular complexes can form spontaneously from their macromo- 
lecular constituents. 

Astounding progress on the structure of the ribosome has been made by 
x-ray crystallographic methods, after the pioneering work by Ada Yonath. 
The structures of both the 30S and the 505 subunits as well as the com- 
plete 70S ribosome have been determined at or close to atomic resolution 
(Figure 30.14). The determination of the structure of the 705 ribosome 
requires the positioning of more than 100,000 atoms. The features of these 
structures are in remarkable agreement with interpretations of less-direct 
experimental probes. These structures provide an invaluable framework for 
examining the mechanism of protein synthesis. 


Ribosomal RNAs (5S, 16S, and 23S rRNA) play a central 
role in protein synthesis 


The prefix ribo in the name ribosome is apt because RNA constitutes nearly 
two-thirds of the mass of these large molecular assemblies. The three RNAs 
present—55, 165, and 23S—are critical for ribosomal architecture and 
function. They are formed by the cleavage of primary 30S transcripts and 
further processing. These molecules fold into structures that allow them to 
form internal base pairs. Their base-pairing patterns were deduced by com- 
paring the nucleotide sequences of many species to detect conserved 
sequences as well as conserved base pairings. For instance, the 165 RNA of 
one species may have a G-C base pair, whereas another may have an A-U 
base pair, but the location of the base pair is the same in both molecules. 
Chemical modification and digestion experiments supported the structures 
deduced from sequence comparisons (Figure 30.15). The striking finding 
is that across all species ribosomal RNAs (rRNAs) are folded into defined 
structures that have many short duplex regions. This conclusion and essen- 
tially all features of the secondary structure have been confirmed by the 
x-ray crystallographically determined structures. 
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Figure 30.15 Ribosomal RNA folding pattern. (A) The secondary structure of 16S 
ribosomal RNA deduced from sequence comparison and the results of chemical studies. (B) The 


tertiary structure of 16S RNA determined by x-ray crystallography. [(A) Courtesy of Dr. Bryn Weiser 
and Dr. Harry Noller; (B) drawn from 1FJG.pdb.] 
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hk Figure 30.16 Transfer RNA-binding 
sites. (A) Three tRNA-binding sites are 
present on the 70S ribosome. They are called 
the A (for aminoacyl), P (for peptidyl), and E 
(for exit) sites. Each tRNA molecule contacts 
both the 30S and the 50S subunit. (B) The 
tRNA molecules in sites A and P are base- 
paired with mRNA. [(B) Drawn from 1JGP. 
pdb 
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Figure 30.17 An active ribosome. This 
schematic representation shows the relations 
among the key components of the translation 
machinery. 
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a For many years, ribosomal proteins were presumed to orchestrate 

protein synthesis and ribosomal RNAs were presumed to serve pri- 
marily as structural scaffolding. The current view is almost the reverse. The 
discovery of catalytic RNA made biochemists receptive to the possibility 
that RNA plays a much more active role in ribosomal function. The detailed 
structures make it clear that the key sites in the ribosome, such as those that 
catalyze the formation of the peptide bond and interact with mRNA and 
tRNA, are composed almost entirely of RNA. Contributions from the 
proteins are minor. Many of the proteins have elongated structures that 
“snake” their way into the RNA matrix. The almost inescapable conclusion 
is that the ribosome initially consisted only of RNA and that the proteins 
were added later to fine-tune its functional properties. This conclusion has 
the pleasing consequence of dodging a “chicken and egg” question: How 
can complex proteins be synthesized if complex proteins are required for 
protein synthesis? 


Ribosomes have three tRNA-binding sites that 
bridge the 30S and 50S subunits 


Three tRNA-binding sites in ribosomes are arranged to allow the formation 
of peptide bonds between amino acids encoded by the codons on mRNA 
(Figure 30.16). The mRNA fragment being translated at a given moment is 
bound within the 30S subunit. Each of the tRNA molecules is in contact 
with both the 30S subunit and the 50S subunit. At the 30S end, two of the 
three tRNA molecules are bound to the mRNA through anticodon—codon 
base pairs. These binding sites are called the A site (for aminoacyl) and the 
P site (for peptidyl). The third tRNA molecule is bound to an adjacent site 
called the E site (for exit). 

The other end of each tRNA molecule, the end without the anticodon, 
interacts with the 505 subunit. The acceptor stems of the tRNA molecules 
occupying the A site and the P site converge at a site where a peptide bond 
is formed. A tunnel connects this site to the back of the ribosome, through 
which the polypeptide chain passes during synthesis (Figure 30.17). 


The start signal is usually AUG preceded by several bases 
that pair with 16S rRNA 


How does protein synthesis start? The simplest possibility would be for the 
first 3 nucleotides of each mRNA to serve as the first codon; no special start 


5’ 3’ 
AGCACGAGGGGAAAUCUGAUGGAACGCUAC E.coli trpA 


Fee a 


UUUGGAUGGAGUGAAACGAUGGCGAUUGCA E.coli araB 
GGUAACCAGGUAACAACCAUGCGAGUGUUG E.coli thrA 
CAAUUCAGGGUGGUGAAUGUGAAACCAGUA E.coli lacl 
AAUCUUGGAGGCUUUUUUAUGGUUCGUUCU  X174 phage A protein 
UAACUAAGGAUGAAAUGCAUGUCUAAGACA QB phage replicase 
UCCUAGGAGGUUUGAC CUAUGCGAGCUUUU R17 phage A protein 
AUGUACUAAGGAGGUU GUAUGGAACAACGC_ phage cro 

7 Pairs with : _ Pairs wae 

16S rRNA initiator tRNA 


signal would then be needed. However, experiments show that translation 
does not begin immediately at the 5’ terminus of mRNA. Indeed, the first 
translated codon is nearly always more than 25 nucleotides away from the 5’ 
end. Furthermore, in prokaryotes, many mRNA molecules are polycistronic, 
or polygenic—that is, they encode two or more polypeptide chains. For 
example, a single mRNA molecule about 7000 nucleotides long specifies 
five enzymes in the biosynthetic pathway for tryptophan in E. coli. Each of 
these five proteins has its own start and stop signals on the mRNA. In fact, 
all known mRNA molecules contain signals that define the beginning and end 
of each encoded polypeptide chain. 

A clue to the mechanism of initiation was the finding that nearly half the 
amino-terminal residues of proteins in E. coli are methionine. In fact, the 
initiating codon in mRNA is AUG (methionine) or, less frequently, GUG 
(valine) or, rarely UUG (leucine). What additional signals are necessary to 
specify a translation start site? The first step toward answering this question 
was the isolation of initiator regions from a number of mRNAs. This isola- 
tion was accomplished by using pancreatic ribonuclease to digest mRNA-— 
ribosome complexes (formed under conditions in which protein synthesis 
could begin but elongation could not take place). As expected, each initiator 
region displays an AUG (or GUG or UUG) codon (Figure 30.18). In addi- 
tion, each initiator region contains a purine-rich sequence centered about 
10 nucleotides on the 5’ side of the initiator codon. 

The role of this purine-rich region, called the Shine-Dalgarno sequence, 
became evident when the sequence of 165 rRNA was elucidated. The 3’ 
end of this rRNA component of the 305 subunit contains a sequence of 
several bases that is complementary to the purine-rich region in the initiator 
sites of mRNA. Mutagenesis of the CCUCC sequence near the 3’ end of 
165 rRNA to ACACA markedly interferes with the recognition of start sites 
in mRNA. This result and other evidence show that the initiator region of 
mRNA binds very near the 3’ end of the 165 rRNA. The number of base 
pairs linking mRNA and 16S rRNA ranges from three to nine. Thus, two 
kinds of interactions determine where protein synthesis starts: (1) the pairing of 
mRNA bases with the 3' end of 16S rRNA and (2) the pairing of the initiator 
codon on mRNA with the anticodon of an initiator tRNA molecule. 


Bacterial protein synthesis is initiated by formylmethionyl transfer RNA 


As stated earlier, methionine is the first amino acid in many E. coli proteins. 
However, the methionine residue found at the amino-terminal end of E. coli 
proteins is usually modified. In fact, protein synthesis in bacteria starts with 
the modified amino acid N-formylmethionine (f Met). A special tRNA brings 
formylmethionine to the ribosome to initiate protein synthesis. This initia- 
tor tRNA (abbreviated as tRNAg) differs from the tRNA that inserts 
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h Figure 30.18 Initiation sites. 
Sequences of mRNA initiation sites for protein 
synthesis in some bacterial and viral mRNA 
molecules. Comparison of these sequences 
reveals some recurring features. 
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Figure 30.19 Formylation of 
methionyl-tRNA. Initiator tRNA (tRNAp) is 
first charged with methionine, and then a 
formyl group is transferred to the 
methionyl-tRNA¢ from 
N'°-formyltetrahydrofolate. 


methionine in internal positions (abbreviated as tR NA). The subscript “f” 
indicates that methionine attached to the initiator tRNA can be formylated, 
whereas it cannot be formylated when attached to tRNA,,. Although virtu- 
ally all proteins synthesized in E. coli begin with formylmethionine, in 
approximately one-half of the proteins, N-formylmethionine is removed 
when the nascent chain is 10 amino acids long. 

Methionine is linked to these two kinds of tRNAs by the same 
aminoacyl-tRNA synthetase. A specific enzyme then formylates the amino 
group of the methionine molecule that is attached to tRNAg (Figure 30.19). 
The activated formyl donor in this reaction is N'°-formyltetrahydrofolate, 
a folate derivative that carries activated one-carbon units (Section 24.2). 
Free methionine and methionyl-tRNA,, are not substrates for this 
transformylase. 


Formylmethionyl-tRNA¥s is placed in the P site of the ribosome in 
the formation of the 70S initiation complex 


Messenger RNA and formylmethionyl-tRNAy; must be brought to the ribo- 
some for protein synthesis to begin. How is this task accomplished? Three 
protein initiation factors (IF1, IF2, and IF3) are essential. The 305 ribo- 
somal subunit first forms a complex with IF 1 and IF3 (Figure 30.20). The 
binding of these factors to the 30S subunit prevents it from prematurely 
joining the 505 subunit to form a dead-end 705 complex, devoid of mRNA 
and f Met-tRNAg. IF1 binds near the A site and directs the fMet-tRNAf¢ to 
the P site. Initiation factor 2, a member of the G-protein family, binds GTP, 
and the concomitant conformational change enables IF 2 to associate with 
fMet-tRNAgs. The IF2-GTP-nitiator-tRNA complex binds with mRNA 
(correctly positioned by the interaction of the Shine-Dalgarno sequence 
with the 16S rRNA) and the 305 subunit to form the 30S initiation complex. 
Structural changes then lead to the ejection of IF1 and IF3. IF 2 stimulates 
the association of the 50S subunit to the complex. The GTP bound to IF2 
is hydrolyzed, leading to the release of IF2. The result is a 70S initiation 
complex. The formation of the 70S initiation complex is the rate-limiting 
step in protein synthesis. 

When the 705 initiation complex has been formed, the ribosome is ready 
for the elongation phase of protein synthesis. The fMet-tRNAgs molecule 
occupies the P site on the ribosome, positioned so that its anticodon pairs 
with the initiating codon on mRNA. The other two sites for tRNA 
molecules, the A site and the E site, are empty. This interaction establishes 
the reading frame for the translation of the entire mRNA. After the initiator 
codon has been located, groups of three nonoverlapping nucleotides are 


defined. 


Elongation factors deliver aminoacyl-tRNA to the ribosome 


At this point, fMet-tRNAs occupies the P site, and the A site is vacant. The 
particular species inserted into the empty A site depends on the mRNA 
codon in the A site. However, the appropriate aminoacyl-tRNA does not 
simply leave the synthetase and diffuse to the A site. Rather, it is delivered 
to the A site in association with a 43-kd protein called elongation factor Tu 
(EF-Tu), another member of the G-protein family, which requires GTP 
for activity. EF-Tu binds aminoacyl-tRNA only in its GTP form 
(Figure 30.21) and releases it to the ribosome in its GDP form. The binding 
of EF-Tu to aminoacyl-tRNA serves two functions. First, EF-Tu protects 
the delicate ester linkage in aminoacyl-tRNA from hydrolysis. Second, the 
GTP in EF-Tu is hydrolyzed to GDP only when an appropriate complex 
between the EF-Tu—aminoacyl-tRNA complex and the ribosome has 


formed. If the anticodon is not properly paired with the codon, hydrolysis 
does not take place and the aminoacyl-tRNA is not transferred to the ribo- 
some. This mechanism allows the free energy of GTP hydrolysis to contrib- 
ute to the accuracy of protein synthesis. 

EF-Tu is then reset to its GTP form by a second elongation factor, 
elongation factor Ts. EF-T’s induces the dissociation of GDP. GTP binds to 
EF-Tu, and EF-Ts is concomitantly released. It is noteworthy that EF-Tu 
does not interact with fMet-tRNAy. Hence, this initiator tRNA is not 
delivered to the A site. In contrast, Met-tRNA,, like all other aminoacyl- 
tRNAs, does bind to EF-Tu. These findings account for the fact that 
internal AUG codons are not read by the initiator tRNA. Conversely, IF2 
recognizes fMet-tRNAgs but no other tRNA. The cycle of elongation con- 
tinues until a termination codon is met. 


NZ This GTP-GDP cycle of EF-Tu is reminiscent of those of the het- 

erotrimeric G proteins in signal transduction (Section 14.1) and the 
Ras proteins in growth control (Section 14.3). This similarity is due to their 
shared evolutionary heritage, seen in the homology of the amino-terminal 
domain of EF-Tu to the P-loop NTPase domains in the other G proteins. 
The other two domains of the tripartite EF-Tu are distinctive; they mediate 
interactions between aminoacyl-tRNA and the ribosome. In all these related 
enzymes, the change in conformation between the GTP and the GDP 
forms leads to a change in interaction partners. A further similarity is the 
requirement that an additional protein catalyzes the exchange of GTP for 
GDP; ET-T's catalyzes the exchange for ET-Tu, just as an activated recep- 
tor does for a heterotrimeric G protein. 


Peptidyl transferase catalyzes peptide-bond synthesis 


With both the P site and the A site occupied by aminoacyl-tRNA, the stage 
is set for the formation of a peptide bond: the formylmethionine molecule 
linked to the initiator tRNA will be transferred to the amino group of the 
amino acid in the A site. The formation of the peptide bond, one of the most 
important reactions in life, is a thermodynamically spontaneous reaction 
catalyzed by a site on the 235 rRNA of the 505 subunit called the peptidyl 
transferase center. This catalytic center is located deep in the 50S subunit 
near the tunnel that allows the nascent peptide to leave the ribosome. 

The ribosome derives much of its catalytic power from what is called 
catalysis by proximity and orientation. The ribosome positions and orients 


hk Figure 30.21 Structure of elongation factor 
Tu. The structure of a complex between elongation 
factor Tu (EF-Tu) and an aminoacyl-tRNA. Notice the 
P-loop NTPase domain (purple shading) at the 
amino-terminal end of EF-Tu. This NTPase domain is 


similar to those in other G proteins. [Drawn from 1B23. 
pdb] 
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Figure 30.20 Translation initiation in 
prokaryotes. Initiation factors aid the 
assembly first of the 30S initiation complex 
and then of the 70S initiation complex. 
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Figure 30.22 Peptide-bond formation. The amino group of the aminoacyl-tRNA attacks the 
carbonyl group of the ester linkage of the peptidyl-tRNA to form a tetrahedral intermediate. This 
intermediate collapses to form the peptide bond and release the deacylated tRNA. 


the two substrates so that they are situated to take advantage of the inherent 
reactivity of an amine group (on the aminoacyl-tRNA in the A site) with 
an ester (on the initiator tRNA in the P site). The amino group of the 
aminoacyl-tRNA in the A site is positioned to attack the ester linkage 
between the initiator tRNA and the formylmethionine molecule in the P 
site (Figure 30.22). The peptidyl transferase center includes bases that 
promote this reaction by helping to form an —-NH) group on the A-site 
aminoacyl-tRNA and by helping to stabilize the tetrahedral intermediate 
that forms. This reaction is, in many ways, analogous to the reverse of the 
reaction catalyzed by serine proteases such as chymotrypsin (Section 9.1). 
The peptidyl-tRNA is analogous to the acyl-enzyme form of a serine pro- 
tease. In a serine protease, the acyl-enzyme is generated with the use of the 
free energy associated with cleaving an amide bond. In the ribosome, the 
free energy necessary to form the analogous species, an aminoacyl-tRNA, 
comes from the ATP that is cleaved by the aminoacyl-tRNA synthetase 
before the arrival of the tRNA at the ribosome. 


The formation of a peptide bond is followed by the GTP-driven 
translocation of tRNAs and mRNA 


With the formation of the peptide bond, the peptide chain is now attached 
to the tRNA whose anticodon is in the A site on the 305 subunit. The two 
subunits rotate with respect to one another, and this structural change 
places the CCA end of the same tRNA and its peptide in the P site of the 
large subunit (Figure 30.23). However, protein synthesis cannot continue 
without the translocation of the mRNA and the tRNAs within the ribosome. 
The mRNA must move by a distance of three nucleotides so that the next 
codon is positioned in the A site for interaction with the incoming aminoacyl- 
tRNA. At the same time, the deacylated tRNA moves out of the P site 
into the E site on the 30S subunit and the peptidyl-tRNA moves out of the 
A site into the P site on the 30S subunit. The movement of the peptidyl- 
tRNA into the P site shifts the mRNA by one codon, exposing the next 
codon to be translated in the A site. 

The three-dimensional structure of the ribosome undergoes significant 
change during translocation, and evidence suggests that translocation may 
result from properties of the ribosome itself. However, protein factors accel- 
erate the process. Translocation is enhanced by elongation factor G (EF-G, 
also called translocase). A possible mechanism for accelerating the translocation 
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Figure 30.23 Mechanism of protein 
synthesis. The cycle begins with 
peptidyl-tRNA in the P site. An 
aminoacyl-tRNA binds in the A site. With 
both sites occupied, a new peptide bond is 
formed. The tRNAs and the mRNA are 
translocated through the action of elongation 


factor G, which moves the deacylated tRNA _—— 
to the E site. Once there, the tRNA is free to ae 4 di 7: 
issociation 


dissociate to complete the cycle. 


process in shown in Figure 30.24. First, EF-G in the GTP form binds to 
the ribosome near the A site, interacting with the 235 rRNA of the 505 
subunit. The binding of EF-G to the ribosome stimulates the GTPase 
activity of EF-G. On GTP hydrolysis, EF-G undergoes a conformational 
change that displaces the peptidyl-tRNA in the A site to the P site, which 
carries the mRNA and the deacylated tRNA with it. The dissociation of 
EF-G leaves the ribosome ready to accept the next aminoacyl-tRNA into 
the A site. 

Note that the peptide chain remains in the P site on the 50S subunit 
throughout this cycle, growing into the exit tunnel. This cycle is repeated, 
with mRNA translation taking place in the 5’ — 3’ direction, as new 
aminoacyl-tRNAs move into the A site, allowing the polypeptide to be 
elongated until a stop signal is found. 

The direction of translation has important consequences. Recall that 
transcription also is in the 5’ —> 3’ direction (Section 29.1). If the direction 
of translation were opposite that of transcription, only fully synthesized 
mRNA could be translated. In contrast, because the directions are the same, 


EF-G 


Figure 30.24 Translocation mechanism. In the GTP form, EF-G binds to the EF-Tu-binding site 
on the 50S subunit. This binding stimulates GTP hydrolysis, inducing a conformational change in 
EF-G that forces the tRNAs and mRNA to move through the ribosome by a distance corresponding 
to one codon. 
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Figure 30.25 Polysomes. Transcription of a 
segment of DNA from E. coli generates mRNA 
molecules that are immediately translated by multiple = Pees ok loo 4 
ribosomes. [From O. L. Miller, Jr., B. A. Hamkalo, and SOLO ae Saka 
C. A. Thomas, Jr. Science 169:392—395, 1970] as : 


mRNA can be translated while it is being synthesized. In prokaryotes, 
almost no time is lost between transcription and translation. The 5’ end of 
mRNA interacts with ribosomes very soon after it is made, much before the 
3’ end of the mRNA molecule is finished. An important feature of prokaryotic 
gene expression is that translation and transcription ave closely coupled in space 
and time. 

Many ribosomes can be translating an mRNA molecule simultaneously. 
This parallel synthesis markedly increases the efficiency of mRNA transla- 
tion. The group of ribosomes bound to an mRNA molecule is called a polyri- 
bosome or a polysome (Figure 30.25). Recent work shows that the ribosomes 
are arranged so as to protect the mRNA and to facilitate easy exchange of 
the substrates and products with the cytoplasm. The ribosomes in the poly- 
some are in a helical array around the mRNA with the tRNA binding sites 
and peptide exit tunnel exposed to the cytoplasm. 


Protein synthesis is terminated by release factors that read stop codons 


The final phase of translation is termination. How does the synthesis of a 
polypeptide chain come to an end when a stop codon is encountered? No 
tRNAs with anticodons complementary to the stop codons—UAA, UGA, 
or UAG—exist in normal cells. Instead, these stop codons are recognized by 
proteins called release factors (RFs). One of these release factors, RF 1, rec- 
ognizes UAA or UAG. A second factor, RF2, recognizes UAA or UGA. A 
third factor, RF3, another GTPase, mediates interactions between RF1 or 
RF2 and the ribosome. 

RF1 and RF2 are compact proteins that, in eukaryotes, resemble a 
tRNA molecule. When bound to the ribosome, the proteins unfold to 
bridge the gap between the stop codon on the mRNA and the peptidyl 
transferase center on the 50S subunit (Figure 30.26). The RF interacts with 
the peptidyl transferase center with a loop containing a highly conserved 
glycine-glycine-glutamine (GGQ) sequence, with the glutamine methylated 
on the amide nitrogen atom of the R group. This modified glutamine is 
crucial in promoting, assisted by the peptidyl transferase, a water molecule’s 
attack on the ester linkage between the tRNA and the polypeptide chain, 
freeing the polypeptide chain. The detached polypeptide leaves the ribo- 
some. Transfer RNA and messenger RNA remain briefly attached to the 
70S ribosome until the entire complex is dissociated through the hydrolysis 
of GTP in response to the binding of EF-G and another factor, called the 
ribosome release factor (RRF). 


Peptide 
cleaved 
from tRNA 


Figure 30.26 Termination of protein synthesis. A release factor recognizes a stop codon in the 
A site and stimulates the release of the completed protein from the tRNA in the P site. 


30.4 Eukaryotic Protein Synthesis Differs from Prokaryotic 
Protein Synthesis Primarily in Translation Initiation 


The basic plan of protein synthesis in eukaryotes and archaea is similar to 
that in bacteria. The major structural and mechanistic themes recur in all 
domains of life. However, eukaryotic protein synthesis entails more protein 
components than does prokaryotic protein synthesis, and some steps are 
more intricate. Some noteworthy similarities and differences are as follows: 


1. Ribosomes. Eukaryotic ribosomes are larger. They consist of a 605 large 
subunit and a 40S small subunit, which come together to form an 805 par- 
ticle having a mass of 4200 kd, compared with 2700 kd for the prokaryotic 
705 ribosome. The 40S subunit contains an 185 RNA that is homologous to 
the prokaryotic 165 RNA. The 605 subunit contains three RNAs: the 55 
RNA, which is homologous to the prokaryotic 55 rRNA; the 288 RNA, 
which is homologous to the prokaryotic 235 molecules; and the 5.85 RNA, 
which is homologous to the 5’ end of the 235 RNA of prokaryotes. 


2. Initiator tRNA. In eukaryotes, the initiating amino acid is methionine 
rather than N-formylmethionine. However, as in prokaryotes, a special 
tRNA participates in initiation. This aminoacyl-tRNA is called Met-tRNA; 
or Met-tRNAg (the subscript “i” stands for initiation, and “f” indicates that 
it can be formylated in vitro). 


3. Initiation. The initiating codon in eukaryotes is always AUG. Eukaryotes, 
in contrast with prokaryotes, do not have a specific purine-rich sequence on 
the 5’ side to distinguish initiator AUGs from internal ones. Instead, the 
AUG nearest the 5’ end of mRNA is usually selected as the start site. A 405 
ribosome, with a bound Met-tRNA,, attaches to the cap at the 5’ end of 
eukaryotic mRNA (Section 29.3) and searches for an AUG codon by 
moving step-by-step in the 3’ direction (Figure 30.27). This scanning 
process is catalyzed by helicases that move along the mRNA powered by 
ATP hydrolysis. Pairing of the anticodon of Met-tRNA; with the AUG 
codon of mRNA signals that the target has been found. In almost all cases, 
eukaryotic mRNA has only one start site and hence is the template for only 
a single protein. In contrast, a prokaryotic mRNA can have multiple Shine— 
Dalgarno sequences and, hence, start sites, and it can serve as a template for 
the synthesis of several proteins. 


Figure 30.27 Eukaryotic translation initiation. In eukaryotes, translation initiation starts with 
the assembly of a complex on the 5’ cap that includes the 40S subunit and Met-tRNAj. Driven by 
ATP hydrolysis, this complex scans the mRNA until the first AUG is reached. The 60S subunit is then 
added to form the 80S initiation complex. 
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Figure 30.28 Protein interactions circularize eukaryotic mRNA. 


Eukaryotes utilize many more initiation factors than do prokaryotes, and 
their interplay is much more intricate. The prefix e[F denotes a eukaryotic 
initiation factor. For example, elF-4E is a protein that binds directly to the 
7-methylguanosine cap (Section 29.3), whereas eIF-2, in association with 
GTP, delivers the met-tRNA; to the ribosome. The difference in initiation 
mechanism between prokaryotes and eukaryotes is, in part, a consequence 
of the difference in RNA processing. The 5’ end of mRNA is readily 
available to ribosomes immediately after transcription in prokaryotes. In 
contrast, in eukaryotes pre-mRNA must be processed and transported to 
the cytoplasm before translation is initiated. The 5’ cap provides an easily 
recognizable starting point. In addition, the complexity of eukaryotic trans- 
lation initiation provides another mechanism for regulation of gene expres- 
sion that we shall explore further in Chapter 31. 

Although most eukaryotic mRNA molecules rely on the 5’ cap to initiate 
protein synthesis, recent work has established that some mRNA molecules can 
recruit ribosomes for initiation without the use of a 5’-cap and cap-binding 
proteins. In these mRNAs, highly structured RNA sequences called internal 
ribosome entry sites (IRES) facilitate 40S ribosome binding to the mRNA. 
IRES were first discovered in the genomes of RNA viruses and have since been 
found in other viruses, as well as in a subset of cellular mRNA that appears to 
take part in development and cell stress. The molecular mechanism by which 
IRES function to initiate protein synthesis remains to be determined. 


4. The Structure of mRNA. The 5'-cap bearing eukaryotic 
mRNA is circular. The eIF-4E protein that binds to the 
mRNA cap structure also binds to the poly(A) tail through 
two protein intermediaries. The eIF-4E that is already 
bound to the cap then binds to the elF-4G protein, which 
in turn binds to a protein associated with the poly(A) tail, 
the poly(A)-binding protein (PABPI; Figure 30.28). Cap 
and tail are thus brought together to form a circle of 
mRNA. The circular structure may facilitate the rebinding 
of the ribosomes following protein-synthesis termination. 
Regulation of the activity of e[F-4G is a key control point 
under normal and pathological conditions. Altered func- 
tion of elF-4G has been implicated in fragile-X syndrome, 


[After H. Lodish et al., Molecular Cell Biology, 5th ed. (W. H. Freeman the most common form of inherited mental impairment, as 


and Company, 2004), Fig. 4.31.] 


well as prostate and other cancers. 


5. Elongation and Termination. Eukaryotic elongation factors EF1la and 
EF 1By are the counterparts of prokaryotic EF-Tu and EF-Ts. The GTP 
form of EF1a delivers aminoacyl-tRNA to the A site of the ribosome, and 
EF 1By catalyzes the exchange of GTP for bound GDP. Eukaryotic EF2 
mediates GTP-driven translocation in much the same way as does 
prokaryotic EF-G. Termination in eukaryotes is carried out by a single 
release factor, eRF1, compared with two in prokaryotes. Finally, elF-3, 
like its prokaryotic counterpart IF3, prevents the reassociation of ribo- 
somal subunits in the absence of an initiation complex. 


6. Organization. The components of the translation machinery in higher 
eukaryotes are organized into large complexes associated with the cytoskeleton. 
This association is believed to facilitate the efficiency of protein synthesis. 
Recall that organization of elaborate biochemical processes into physical com- 
plexes is a reoccurring theme in biochemistry (Section 18.5 and Section 25.2). 


Mutations in initiation factor 2 cause a curious pathological condition 


¥e Defective elF4-G is not the only initiation factor that results in 
pathological conditions. Mutations in eukaryotic initiation factor 2 


(A) (B) 


Figure 30.29 The effects of vanishing white matter disease. (A) In the normal brain, magnetic 
resonance imaging (MRI) visualizes the white matter as dark gray. (B) In the diseased brain, this MRI 
reveals that white matter is replaced by cerebrospinal fluid, seen as white. [(A) Courtesy of Marjo S. van 
der Knaap, M.D., Ph.D., VU University Medical Center, The Netherlands. (B) M. S. van der knapp et al., 
Lancet Neurology 5: 413-423] 


result in a mysterious disease, called vanishing white matter (VWM) 
disease, in which nerve cells in the brain disappear and are replaced by 
cerebrospinal fluid (Figure 30.29). The white matter of the brain consists 
predominately of nerve axons that connect the gray matter of the brain to 
the rest of the body. Death, resulting from fever or extended coma, can be 
anywhere from a few years to decades after the onset of the disease, which is 
usually in young children but can be shortly after birth or even in adulthood. 
An especially puzzling aspect of the disease is its tissue specificity. A mut- 
ation in a biochemical process as fundamental to life as protein-synthesis 
initiation would be predicted to be lethal or at least to affect all tissues of 
the body. Diseases such as VWM graphically show that, although much 
progress has been made in biochemistry, much more research will be 
required to understand the complexities of health and disease. 


39.5 A Variety of Antibiotics and Toxins Can 
Inhibit Protein Synthesis 


Many chemicals that inhibit various aspects of protein synthesis have been 
identified. These chemicals are powerful experimental tools and clinically 
useful drugs. 


Some antibiotics inhibit protein synthesis 


¥e The differences between eukaryotic and prokaryotic ribosomes can 
be exploited for the development of antibiotics (Table 30.4). For 
example, the antibiotic streptomycin, a highly basic trisaccharide, interferes 
with the binding of fMet-tRNA to ribosomes in prokaryotes and thereby 
prevents the correct initiation of protein synthesis. Other aminoglycoside 


Table 30.4 Antibiotic inhibitors of protein synthesis 


Antibiotic 


Action 
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30.5 Inhibitors of Protein Synthesis 
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Streptomycin 


Streptomycin and other aminoglycosides Inhibit initiation and cause the misreading of mRNA (prokaryotes) 


Tetracycline Binds to the 30S subunit and inhibits the binding of aminoacyl-tRNAs (prokaryotes) 

Chloramphenicol Inhibits the peptidy] transferase activity of the 50S ribosomal subunit (prokaryotes) 

Cycloheximide Inhibits translocation (eukaryotes) 

Erythromycin Binds to the 50S subunit and inhibits translocation (prokaryotes) 

Puromycin Causes premature chain termination by acting as an analog of aminoacyl-tRNA (prokaryotes and eukaryotes) 
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Figure 30.30 Antibiotic action of 
puromycin. Puromycin resembles the 
aminoacyl terminus of an aminoacyl-tRNA. Its 
amino group joins the carboxyl group of the 
growing polypeptide chain to form an adduct 
that dissociates from the ribosome. This 
adduct is stable because puromycin has an 
amide (shown in red) rather than an ester 
linkage. 
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Figure 30.31 Blocking of translocation by diphtheria 
toxin. Diphtheria toxin blocks protein synthesis in eukaryotes 
by catalyzing the transfer of an ADP-ribose unit from NAD* 
to diphthamide, a modified amino acid residue in elongation 
factor 2 (translocase). Diphthamide is formed by the 
posttranslational modification (blue) of a histidine residue. 
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antibiotics such as neomycin, kanamycin, and gentamycin interfere with the 
interaction between tRNA and the 16S rRNA of the 305 subunit (p. 892) of 
prokaryotic ribosomes. Chloramphenicol acts by inhibiting peptidy] transferase 
activity. Erythromycin binds to the 505 subunit and blocks translocation. 

The antibiotic puromycin inhibits protein synthesis in both prokaryotes 
and eukaryotes by causing nascent polypeptide chains to be released before 
their synthesis is completed. Puromycin is an analog of the terminal part of 
aminoacyl-tRNA (Figure 30.30). It binds to the A site on the ribosome and 
inhibits the entry of aminoacyl-tRNA. Furthermore, puromycin contains 
an a-amino group. This amino group, like the one on aminoacyl-tRNA, 
forms a peptide bond with the carboxyl group of the growing peptide chain. 
The product, a peptide having a covalently attached puromycin residue at 
its carboxy] end, dissociates from the ribosome. No longer used medicinally, 
puromycin is utilized mainly as an experimental tool for the investigation of 
protein synthesis. Cycloheximide, another antibiotic, blocks translocation in 
eukaryotic ribosomes, making a useful laboratory tool for blocking protein 
synthesis in eukaryotic cells. 


Diphtheria toxin blocks protein synthesis in eukaryotes 
by inhibiting translocation 


¥e Many antibiotics, harvested from prokaryotes for medicinal 
purposes, are inhibitors of prokaryotic protein synthesis. 
However, some prokaryotes produce protein-synthesis inhibitors 
that inhibit eukaryotic protein synthesis, leading to diseases such as 
diphtheria, which was a major cause of death in childhood before 
the advent of effective immunization. Symptoms include painful 
sore throat, hoarseness, fever, and difficulty breathing. The lethal 
effects of this disease are due mainly to a protein toxin produced by 
Corynebacterium diphtheriae, a bacterium that grows in the upper 
respiratory tract of an infected person. A few micrograms of 
diphtheria toxin is usually lethal in an unimmunized person because 
it inhibits protein synthesis. Shortly after entering a target cell, the 
toxin is cleaved into a 21-kd A fragment and a 40-kd B fragment. 
The A fragment of the toxin catalyzes the covalent modification of 
elongation factor 2, the elongation factor catalyzing translocation in 
eukaryotic protein synthesis, whereas the B fragment enables the A 
fragment to enter the cytoplasm of its target cell. 

A single A fragment of the toxin in the cytoplasm can kill a 
cell. Why is it so lethal? EF2 contains diphthamide, an unusual 
amino acid residue of unknown function that is formed by 
the posttranslational modification of histidine. The A fragment of 
the diphtheria toxin catalyzes the transfer of the ADP ribose unit 
of NAD* to the diphthamide ring (Figure 30.31). This ADP 


ribosylation of a single side chain of EF 2 blocks EF 2’s capacity to carry out the 
translocation of the growing polypeptide chain. Protein synthesis ceases, 
accounting for the remarkable toxicity of diphtheria toxin. 


Ricin fatally modifies 28S ribosomal RNA 


ci Ricin is a biomolecule frequently in the news because of its potential 

use as a bioterrorism agent. Ricin is a small protein (65 kd) found in 
the seeds of the castor oil plant, Ricinus communis (Figure 30.32). It is indeed 
a deadly molecule because as little as 500 pg is lethal for an adult human 
being, and a single molecule can inhibit all protein synthesis in a cell, result- 
ing in cell death. 

Ricin is a heterodimeric protein composed of a catalytic A chain joined 
by a single disulfide bond to a B chain. The B chain allows the toxin to bind 
to the target cell, and this binding leads to an endocytotic uptake of the 
dimer and the eventual release of the A chain into the cytoplasm. The A 
chain cleaves adenine from a particular adenosine nucleotide on the 285 
rRNA. Removal of the adenine base completely inactivates the ribosome by 
preventing the binding of elongation factors. Thus, ricin and diphtheria 
toxin both act by inhibiting protein-synthesis elongation; ricin does so by 
covalently modifying rRNA, and diphtheria toxin does so by covalently 
modifying the elongation factor. 


30.6 Ribosomes Bound to the Endoplasmic Reticulum 
Manufacture Secretory and Membrane Proteins 


A newly synthesized protein in E. coli can stay in the cytoplasm or it can be 
sent to the plasma membrane, the outer membrane, the space between them, 
or the extracellular medium. Eukaryotic cells can direct proteins to internal 
sites such as lysosomes, mitochondria, chloroplasts, and the nucleus. How is 
sorting accomplished? In eukaryotes, a key choice is made soon after the syn- 
thesis of a protein begins. The ultimate destination of a protein depends 
broadly on the location of the ribosome on which it is being synthesized. 

In eukaryotic cells, a ribosome remains free in the cytoplasm unless it is 
directed to the endoplasmic reticulum (ER), the extensive membrane system 
that comprises about half the total membrane of a cell. The region that binds 
ribosomes is called the rough ER because of its studded appearance, in con- 
trast with the smooth ER, which is devoid of ribosomes (Figure 30.33). Free 
ribosomes synthesize proteins that remain within the cell, either within the 
cytoplasm or directed to organelles bounded by a double membrane, such as 
the nucleus, mitochondria, and chloroplasts. Ribosomes bound to the ER 
usually synthesize proteins destined to leave the cell or to at least contact the 
cell exterior from a position in the cell membrane. These proteins fall into 
three major classes: secretory proteins (proteins exported by the cell), lysosomal 
proteins, and proteins spanning the plasma membrane. Virtually all integral 
membrane proteins of the cell, except those located in the membranes of 
mitochondria and chloroplasts, are formed by ribosomes bound to the ER. 

A variety of strategies are used to send proteins synthesized by free 
ribosomes to the nucleus, peroxisomes, mitochondria, and chloroplasts of 
eukaryotic cells. However, in this section, we will focus on the targeting of 
proteins produced by ribosomes bound to the endoplasmic reticulum. 


Signal sequences mark proteins for translocation across 
the endoplasmic reticulum membrane 


The synthesis of proteins destined to leave the cell or become embedded in 
the plasma membrane begins on a free ribosome but, shortly after synthesis 
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Figure 30.32 Castor beans. The seeds of 
castor beans from Ricinus communis are a 
rich source of oils with a wide variety of uses, 
including the production of biodiesel fuels. 
The seeds are also rich in the toxin ricin. [Ted 
Kinsman/Photo Researchers.] 


Figure 30.33 Ribosomes are bound to 
the endoplasmic reticulum. In this 
electron micrograph, ribosomes appear as 
small black dots binding to the cytoplasmic 
side of the endoplasmic reticulum to give a 
rough appearance. In contrast, the smooth 
endoplasmic reticulum is devoid of 
ribosomes. [From G. K. Voletz, M. M. 

Rolls, and T. A. Rapoport, EMBO Rep. 
3:944-950, 2002.] 
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Figure 30.35 The signal-recognition 
particle. The signal-recognition particle (SRP) 
consists of six proteins (one of which is 
SRP54) and one 300-nucleotide RNA 
molecule. The RNA has a complex structure 
with many double-helical stretches 
punctuated by single-stranded regions, shown 
as circles. [After H. Lodish et al., Molecular 
Cell Biology, 5th ed. (W. H. Freeman and 
Company, 2004). See K. Strub et al., Mol. 
Cell Biol. 11:3949-3959, 1991, and S. High 
and B. Dobberstein, J. Cell Biol. 
113:229-233, 1991. 


Cleavage 
site 


Human growth hormone MATGSRTSL PI AEGI LQEGSA | FPT 
MALWMRLLPLLALLALWGPDPAAA | FVN 
MKWVTFISLLLFSSAYS  RGV 
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Zea maize protein 19 SIF 
Yeast invertase SMT 
Human influenza virus A DQ 


Figure 30.34 Amino-terminal signal sequences of some eukaryotic secretory and 
plasma-membrane proteins. The hydrophobic core (yellow) is preceded by basic residues 
(blue) and followed by a cleavage site (red) for signal peptidase. 


begins, it is halted until the ribosome 1s directed to the cytoplasmic side of 
the endoplasmic reticulum. When the ribosome docks with the ER mem- 
brane, protein synthesis begins again. As the newly forming peptide chain 
exits the ribosome, it is transported, cotranslationally, through the mem- 
brane into the lumen of the endoplasmic reticulum. 

Free ribosomes that are synthesizing proteins for use in the cell are 
identical with those attached to the ER. What is the process that directs the 
ribosome synthesizing a protein destined to enter the ER to bind to the ER? 
The translocation consists of four components. 


1. The Signal Sequence. The signal sequence is a sequence of 9 to 12 
hydrophobic amino acid residues, sometimes containing positively charged 
amino acids (Figure 30.34). This sequence is usually near the amino termi- 
nus of the nascent polypeptide chain. The presence of the signal sequence 
identifies the nascent peptide as one that must cross the ER membrane. 
Some signal sequences are maintained in the mature protein, whereas others 
are cleaved by a signal peptidase on the lumenal side of the ER membrane 
(see Figure 30.36). 


2. The Signal-Recognition Particle (SRP). The signal-recognition particle 
recognizes the signal sequence and binds the sequence and the ribosome as 
soon as the signal sequence exits the ribosome. SRP then shepherds the 
ribosome and its nascent polypeptide chain to the ER membrane. SRP is 
a ribonucleoprotein consisting of a 75 RNA and six different proteins 
(Figure 30.35). One protein, SRP54, is a GTPase that is crucial for SRP 
function. 5RP binds all ribosomes but binds tightly only to ribosomes that 
display the signal sequence. SRP samples ribosomes until it locates one 
exhibiting a signal sequence. After SRP is bound to the signal sequence, 
interactions between the ribosome and the SRP occlude the elongation- 
factor-binding site, thereby halting protein synthesis. 


3. The SRP Receptor (SR). The SRP-ribosome complex diffuses to the 
endoplasmic reticulum, where SRP binds the SRP receptor, an integral 
membrane protein consisting of two subunits, SRa and SRB. SRa is, like 
SRP54, a GTPase. 


4. The Translocon. The SRP-SR complex delivers the ribosome to the ER 
membrane. There it docks with the translocation machinery, called the 
translocon, a multisubunit assembly of integral and peripheral membrane 
proteins. The translocon is a protein-conducting channel. This channel 
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Figure 30.36 The SRP targeting cycle. (1) Protein synthesis begins on free ribosomes. (2) After 
the signal sequence has exited the ribosome, it is bound by the SRP, and protein synthesis halts. 
(3) The SRP-ribosome complex docks with the SRP receptor in the ER membrane. (4) The SRP 
and the SRP receptor simultaneously hydrolyze bound GTPs. Protein synthesis resumes and the 
SRP is free to bind another signal sequence. (5) The signal peptidase may remove the signal 
sequence as it enters the lumen of the ER. (6) Protein synthesis continues as the protein is 
synthesized directly into the ER. (7) On completion of protein synthesis, the ribosome is released. 
(8) The protein tunnel in the translocon closes. [After H. Lodish et al., Molecular Cell Biology, 

5th ed. (W. H. Freeman and Company, 2004), Fig. 16.6.] 


opens when the translocon and ribosome bind to each other. Protein 
synthesis resumes with the growing polypeptide chain passing through the 
translocon channel into the lumen of the ER. 


The interactions of the components of the translocation machinery are 
shown in Figure 30.36. For the SRP-SR complex to form, both the SRP54 
and the SRa subunits of SR must bind GTP. For the SRP-SR complex to 
then deliver the ribosome to the translocon, the two GTP molecules—one in 
SRP and the other in SR—are aligned in what is essentially an active site 
shared by the two proteins. After the ribosome has been passed along to the 
translocon, the GTPs are hydrolyzed, SRP and SR dissociate, and SRP is free 
to search for another signal sequence to begin the cycle anew. Thus, SRP acts 
catalytically. The signal peptidase, which is associated with the translocon in 
the lumen of the ER, removes the signal sequence from most proteins. 


Transport vesicles carry cargo proteins to their final destination 


As the proteins are synthesized, they fold to form their three-dimensional 
structures in the lumen of the ER. Some proteins are modified by the 
attachment of N-linked carbohydrates (Section 11.3). Finally, the proteins 
must be sorted and transported to their final destinations. Regardless of the 
destination, the principles of transport are the same. Transport is mediated 
by transport vesicles that bud off the endoplasmic reticulum (Figure 30.37). 
Transport vesicles from the ER carry their cargo (the proteins) to the Golgi 
complex, where the vesicles fuse and deposit the cargo inside the complex. 
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Figure 30.37 Protein-sorting pathways. Newly synthesized proteins in the lumen of the ER are 
collected into membrane buds. These buds pinch off to form transport vesicles. The transport 
vesicles carry the cargo proteins to the Golgi complex where the cargo proteins are modified. 
Transport vesicles then carry the cargo to the final destination as directed by the v-SNARE and 
t-SNARE proteins. 


There the cargo proteins are modified—for instance, by the attachment of 
carbohydrates. From the Golgi complex, transport vesicles carry the cargo 
proteins to their final destinations, as shown in Figure 30.37. 

How does a protein end up at the correct destination? A newly synthesized 
protein will float inside the ER lumen until it binds to an integral membrane 
protein called a cargo receptor. This binding sequesters the cargo protein into a 
small region of the membrane that can subsequently form a membrane bud. 
The bud will carry the protein to a specific destination—plasma membrane, 
lysosome, or cell exterior. The key to ensuring that the protein reaches the 
proper destination is that the protein must bind to a receptor in the ER region 
associated with the protein’s destination. To ensure the proper match of protein 
with ER region, cargo receptors recognize various characteristics of the cargo 
protein, such as a particular amino acid sequence or an added carbohydrate. 

The formation of buds is facilitated by the binding of coat proteins (COPs) 
to the cytoplasmic side of the bud. The coat proteins associate with one anoth- 
er to pinch off the vesicle. After the transport vesicle has formed and is 
released, the coat proteins are shed to reveal another integral protein called 
v-SNARE (“v” for vesicle). vv SNARE will bind to a particular t-SNARE (‘t” 
for target) in the target membrane. This binding leads to the fusion of the 
transport vesicle to the target membrane, and the cargo is delivered. Thus, the 
assignment of identical v-SNARE proteins to the same region of the ER 
membrane causes an ER region to be associated with a particular destination. 


Summary 


30.1 


30.2 


30.3 


Protein Synthesis Requires the Translation of Nucleotide Sequences 
into Amino Acid Sequences 

Protein synthesis is called translation because information present as 
a nucleic acid sequence is translated into a different language, the 
sequence of amino acids in a protein. This complex process is medi- 
ated by the coordinated interplay of more than a hundred macromol- 
ecules, including mRNA, rRNAs, tRNAs, aminoacyl-tRNA syn- 
thetases, and protein factors. Given that a protein typically comprises 
from 100 to 1000 amino acids, the frequency at which an incorrect 
amino acid is incorporated in the course of protein synthesis must be 
less than 10 *. Transfer RNAs are the adaptors that make the link 
between a nucleic acid and an amino acid. These molecules, single 
chains of about 80 nucleotides, have an L-shaped structure. 


Aminoacyl Transfer RNA Synthetases Read the Genetic Code 

Each amino acid is activated and linked to a specific transfer RNA by 
an enzyme called an aminoacyl-tRNA synthetase. Such an enzyme 
links the carboxyl group of an amino acid to the 2’- or 3’-hydroxyl 
group of the adenosine unit of a CCA sequence at the 3’ end of the 
tRNA by an ester linkage. There is at least one specific aminoacyl- 
tRNA synthetase and at least one specific tRNA for each amino acid. 
A synthetase utilizes both the functional groups and the shape of its 
cognate amino acid to prevent the attachment of an incorrect amino 
acid to a tRNA. Some synthetases have a separate active site at which 
incorrectly linked amino acids are removed by hydrolysis. A syn- 
thetase recognizes the anticodon, the acceptor stem, and sometimes 
other parts of its tRNA substrate. By specifically recognizing both 
amino acids and tRNAs, aminoacyl-tRNA synthetases implement the 
instruction of the genetic code. 

The codons of messenger RNA recognize the anticodons of trans- 
fer RNAs rather than the amino acids attached to the tRNAs. A codon 
on mRNA forms base pairs with the anticodon of the tRNA. Some 
tRNAs are recognized by more than one codon because pairing of 
the third base of a codon is less crucial than that of the other two (the 
wobble mechanism). There exist two evolutionary distinct classes of 
synthetases, each recognizing 10 amino acids. The two classes recog- 
nize opposite faces of tRNA molecules. 


The Ribosome Is the Site of Protein Synthesis 

Protein synthesis takes place on ribosomes—ribonucleoprotein parti- 
cles (about two-thirds RNA and one-third protein) consisting of large 
and small subunits. In E. coli, the 705 ribosome (2500 kd) is made up 
of 305 and 505 subunits. The 30S subunit consists of 165 ribosomal 
RNA and 21 different proteins; the 50S subunit consists of 235 and 55 
rRNA and 34 different proteins. The ribosome includes three sites for 
tRNA binding called the A (aminoacy]) site, the P (peptidyl) site, and 
the E (exit) site. 

Protein synthesis takes place in three phases: initiation, elongation, 
and termination. In prokaryotes, mRNA, formylmethionyl-tRNA, (the 
special initiator tRNA that recognizes AUG), and a 305 ribosomal 
subunit come together with the assistance of initiation factors to form 
a 305 initiation complex. A 50S ribosomal subunit then joins this com- 
plex to form a 70S initiation complex, in which f Met-tRNAgs occupies 
the P site of the ribosome. 
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30.4 


30.5 


30.6 


Elongation factor Tu delivers the appropriate aminoacyl-tRNA 
to the ribosome’s A (aminoacy]) site as an EF-Tu—aminoacyl-tRNA-— 
GTP ternary complex. EF-Tu serves both to protect the aminoacy]l- 
tRNA from premature cleavage and to increase the fidelity of protein 
synthesis by ensuring that the correct anticodon—codon pairing has 
taken place before hydrolyzing GTP and releasing aminoacyl-tRNA 
into the A site. A peptide bond is formed when the amino group of 
the aminoacyl-tRNA nucleophilically attacks the ester linkage of the 
peptidyl-tRNA. On peptide-bond formation, the tRNAs and mRNA 
must be translocated for the next cycle to begin. The deacylated tRNA 
moves to the E site and then leaves the ribosome, and the peptidyl- 
tRNA moves from the A site into the P site. Elongation factor G uses 
the free energy of GTP hydrolysis to drive translocation. Protein 
synthesis is terminated by release factors, which recognize the termi- 
nation codons UAA, UGA, and UAG and cause the hydrolysis of the 
ester bond between the polypeptide and tRNA. 


Eukaryotic Protein Synthesis Differs from Prokaryotic Protein Synthesis 
Primarily in Translation Initiation 

The basic plan of protein synthesis in eukaryotes is similar to that in 
prokaryotes, but there are some significant differences between them. 
Eukaryotic ribosomes (80S) consist of a 405 small subunit and a 60S 
large subunit. The initiating amino acid is again methionine, but it is 
not formylated. The initiation of protein synthesis is more complex in 
eukaryotes than in prokaryotes. In eukaryotes, the AUG closest to the 
5’ end of mRNA is nearly always the start site. The 405 ribosome finds 
this site by binding to the 5’ cap and then scanning the RNA until 
AUG is reached. The regulation of translation in eukaryotes provides 
a means for regulating gene expression. 


A Variety of Antibiotics and Toxins Can Inhibit Protein Synthesis 
Many clinically important antibiotics function by inhibiting protein 
synthesis. All steps of protein synthesis are susceptible to inhibition 
by one antibiotic or another. Diphtheria toxin inhibits protein synthe- 
sis by covalently modifying an elongation factor, thereby preventing 
elongation. Ricin, a toxin from castor beans, inhibits elongation by 
removing a crucial adenine residue from rRNA. 


Ribosomes Bound to the Endoplasmic Reticulum Manufacture 
Secretory and Membrane Proteins 

Proteins contain signals that determine their ultimate destination. The 
synthesis of all proteins begins on free ribosomes in the cytoplasm. In 
eukaryotes, protein synthesis continues in the cytoplasm unless the 
nascent chain contains a signal sequence that directs the ribosome to 
the endoplasmic reticulum. Amino-terminal signal sequences consist 
of a hydrophobic segment of 9 to 12 residues preceded by a positively 
charged amino acid. Signal-recognition particle, a ribonucleoprotein 
assembly, recognizes signal sequences and brings ribosomes bearing 
them to the ER. A GTP-—GDP cycle releases the signal sequence from 
SRP and then detaches SRP from its receptor. The nascent chain is 
then translocated across the ER membrane. Proteins are transported 
throughout the cell in transport vesicles. 


Key Terms 


translation (p. 887) 
ribosome (p. 887) 


aminoacyl-tRNA synthetase (p. 888) transfer RNA (tRNA) (p. 889) 
codon (p. 889) anticodon (p. 889) 


wobble hypothesis (p. 892) 

30S subunit (p. 897) 

50S subunit (p. 897) 
Shine-Dalgarno sequence (p. 901) polysome (p. 906) 
initiation factor (p. 902) 

elongation factor Tu (EF-Tu) (p. 902) 


elongation factor T's (EF-Ts) (p. 903) 


Problems 


1. Babel fish. Why is protein synthesis also called 
translation? 


2. Careful, but not too careful. Why is it crucial that protein 
synthesis has an error frequency of 10° *? 


3. Commonalities. What features are common to all tRNA 
molecules? 


4. The ol’ two step. What two reaction steps are required for 
the formation of an aminoacyl-tRNA? 


5. The same but different. Why must tRNA molecules have 
both unique structural features and common structural 
features? 


6. Charge it. In the context of protein synthesis, what is 
meant by an activated amino acid? 


7. Synthetase mechanism. The formation of isoleucyl-tRNA 
proceeds through the reversible formation of an enzyme- 
bound Ile-AMP intermediate. Predict whether *’P-labeled 
ATP is formed from *”PP; when each of the following sets 
of components is incubated with the specific activating 
enzyme: 


(a) ATP and *°PP; 
(b) tRNA, ATP, and *’PP; 
(c) Isoleucine, ATP, and *°PP; 


8. 1 = 2, for sufficiently large values of 1. The energetic 
equivalent of two molecules of ATP is used to activate an 
amino acid, yet only one molecule of ATP is used. Explain. 


9. Steves. Using threonyl-tRNA synthetase as an example, 
account for the specificity of threonyl-tRNA formation. 


10. Use all available information. Suggest a reason why 
there are two classes of aminoacyl-tRNA synthetases, with 
each class recognizing a different face of the tRNA. 


11. Going wobbly. Explain how it is possible that some 
tRNA molecules recognize more than one codon. 


12. Light and heavy ribosomes. Ribosomes were isolated 
from bacteria grown in a “heavy” medium (!°C and 1) 
and from bacteria grown in a “light” medium ('7C and 
'4N). These 60S ribosomes were added to an in vitro system 
engaged in protein synthesis. An aliquot removed several 


peptidy]l transferase center (p. 903) 


elongation factor G (EF-G) 
(translocase) (p. 904) 


release factor (RF) (p. 906) 
signal sequence (p. 912) 
signal peptidase (p. 912) 
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signal-recognition particle (SRP) (p. 912) 
SRP receptor (SR) (p. 912) 

translocon (p. 912) 

transport vesicle (p. 913) 

coat proteins (p. 914) 

v-SNARE (p. 914) 

t-SNARE (p. 914) 


hours later was analyzed by density-gradient centrifuga- 
tion. How many bands of 705 ribosomes would you expect 
to see in the density gradient? 


13. The price of protein synthesis. What is the smallest num- 
ber of molecules of ATP and GTP consumed in the synthe- 
sis of a 200-residue protein, starting from amino acids? 
Assume that the hydrolysis of PP; is equivalent to the 
hydrolysis of ATP for this calculation. 


14. Contrasting modes of elongation. The two basic mecha- 
nisms for the elongation of biomolecules are represented in 
the adjoining illustration. In type 1, the activating group 
(X) is released from the growing chain. In type 2, the acti- 
vating group is released from the incoming unit as it is 
added to the growing chain. Indicate whether each of the 
following biosyntheses is by means of a type 1 or a type 2 
mechanism: 


Ge + F® Ete + Fe 


| | 


GHe® + ®@ 


Type 1 Type 2 


a) Glycogen synthesis 
b) Fatty acid synthesis 


d) DNA synthesis 


( 
( 
(c) Cs —> Cy9 — Cys in cholesterol synthesis 
( 
(e) RNA synthesis 


(f) Protein synthesis 


15. Correct phasing. What is meant by the phrase reading 
frame? 


16. Suppressing frameshifts. The insertion of a base in a cod- 
ing sequence leads to a shift in the reading frame, which in 
most cases produces a nonfunctional protein. Propose a 
mutation in a tRNA that might suppress frameshifting. 


17. Tagging a ribosomal site. Design an affinity-labeling 
reagent for one of the tRNA-binding sites in E. coli 
ribosomes. 
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18. Viral mutation. An mRNA transcript of a T7 phage 
gene contains the base sequence 


1 
5'-AACUGCACGAGGUAACACAAGAUGGCU-3’ 


Predict the effect of a mutation that changes the G marked 
by an arrow to A. 


19. A new translation. A transfer RNA with a UGU anti- 
codon is enzymatically conjugated to '*C-labeled cysteine. 
The cysteine unit is then chemically modified to alanine. 
The altered aminoacyl-tRNA is added to a protein-synthe- 
sizing system containing normal components except for 
this tRNA. The mRNA added to this mixture contains the 
following sequence: 


5'-UUUUGCCAUGUUUGUGCU-3' 


What is the sequence of the corresponding radiolabeled 
peptide? 


20. Two synthetic modes. Compare and contrast protein 
synthesis by ribosomes with protein synthesis by the solid- 
phase method (see Section 3.5). 


21. Enhancing fidelity. Compare the accuracy of DNA 
replication, RNA synthesis, and protein synthesis. Which 
mechanisms are used to ensure the fidelity of each of these 
processes? 


22. Triggered GTP hydrolysis. Ribosomes markedly accel- 
erate the hydrolysis of GTP bound to the complex of 
EF-Tu and aminoacyl-tRNA. What is the biological 
significance of this enhancement of GTPase activity by 
ribosomes? 


23. Blocking translation. Devise an experimental strategy 
for switching off the expression of a specific mRNA with- 
out changing the gene encoding the protein or the gene’s 
control elements. 


24. Directional problem. Suppose that you have a protein- 
synthesis system that is synthesizing a protein designated 
A. Furthermore, you know that protein A has four trypsin- 
sensitive sites, equally spaced in the protein, that, on diges- 
tion with trypsin, yield the peptides A;, A», A3, Ay, and As. 
Peptide A; is the amino-terminal peptide, and As is the 
carboxyl-terminal peptide. Finally, you know that your 
system requires 4 minutes to synthesize a complete protein 
A. At t = 0, you add all 20 amino acids, each carrying a *C 
label. 


(a) At t = 1 minute, you isolate intact protein A from the 
system, cleave it with trypsin, and isolate the five peptides. 
Which peptide is most heavily labeled? 


(b) At t = 3 minutes, what will be the order of the labeling 
of peptides from greatest to least? 


(c) What does this experiment tell you about the direction 
of protein synthesis? 


25. Translator. Aminoacyl-tRNA synthetases are the only 
component of gene expression that decodes the genetic 
code. Explain. 


26. A timing device. EF-Tu, a member of the G-protein 
family, plays a crucial role in the elongation process of 
translation. Suppose that a slowly hydrolyzable analog of 
GTP were added to an elongating system. What would be 
the effect on the rate of protein synthesis? 


27. Not just RNA. What are the roles of the protein factors 
required for protein synthesis? 


28. Membrane transport. What four components are 
required for the translocation of proteins across the endo- 
plasmic reticulum membrane? 


29. Push. Don’t pull. What is the energy source that powers 
the cotranslational movement of proteins across the endo- 
plasmic reticulum? 


30. You have to know where to look. Prokaryotic messenger 
RNAs usually contain many AUG codons. How does the 
ribosome identify the AUG specifying initiation? 

31. Fundamentally the same, yet . . . List the differences 


between prokaryotic and eukaryotic protein synthesis. 


32. Like a border collie. What is the role of the signal- 
recognition particle in protein translocation? 


33. An assembly line. Why is the fact that protein synthesis 
takes place on polysomes advantageous? 

34. Match ’em 
(a) Initiation 
(b) Elongation 


(c) Termination 


GTP 

AUG 

fMet 

RRF 

IF2 
Shine-Dalgarno 
EF-Tu 

Peptidyl transferase 
UGA 


Transformylase 


SO". SY Gee et 


= 


35. Wasted effort? Transfer RNA molecules are quite large, 
given that the anticodon consists of only three nucleotides. 
What is the purpose of the rest of the tRNA molecule? 


Mechanism Problems 


36. Molecular attack. What is the nucleophile in the reac- 
tion catalyzed by peptidyl transferase? Suggest a plausible 
mechanism for this reaction. 


37. Evolutionary amino acid choice. Ornithine is structurally 
similar to lysine except ornithine’s side chain is one methylene 
group shorter than that of lysine. Attempts to chemically 
synthesize and isolate ornithinyl-tRNA proved unsuccess- 
ful. Propose a mechanistic explanation. (Hint: Six-membered 
rings are more stable than seven-membered rings.) 


Chapter Integration Problems 


38. Déja vu. Which protein in G-protein cascades plays a 
role similar to that of elongation factor Ts? 


39. Family resemblance. Eukaryotic elongation factor 2 is inhib- 
ited by ADP ribosylation catalyzed by diphtheria toxin. What 
other G proteins are sensitive to this mode of inhibition? 


40. The exceptional E. coli. In contrast with E. coli, most 
bacteria do not have a full complement of aminoacyl-tRNA 
synthetases. For instance, Helicobacter pylori, the cause 
of stomach ulcers, has tRNAG?, but no Gln-tRNA 
synthetase. However, glutamine is a common amino acid 
in H. pylori proteins. Suggest a means by which glutamine 
can be incorporated into proteins in H. pylori. (Hint: 
Glu-tRNA synthetase can misacylate tRNA@!",) 


41. The final step. What aspect of primary structure allows 
the transfer of linear nucleic acid information into the func- 
tional three-dimensional structure of proteins? 


Data Interpretation Problems 


42. Helicase helper. The initiation factor eIlF-4 displays 
ATP-dependent RNA helicase activity. Another initiation 
factor, elF-4H, has been proposed to assist the action of 
elF-4. Graph A shows some of the experimental results 
from an assay that can measure the activity of elF-4 helicase 
in the presence of elF-4H. 
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(a) What are the effects on elF-4 helicase activity in the 
presence of eIF-4H? 

(b) Why did measuring the helicase activity of elF-4H 
alone serve as an important control? 

(c) The initial rate of helicase activity of 0.2 uM of elF-4 
was then measured with varying amounts of elF-4H 
(graph B). What ratio of el[F-4H to elF-4 yielded optimal 
activity? 
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Problems 


Initial rate of helix 
unwinding (fmole minute!) 
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(d) Next, the effect of RNA-RNA helix stability on the 
initial rate of unwinding in the presence and absence of 
elF-4H was tested (graph C). How does the effect of 
elF-4H vary with helix stability? 


2.0 — 


elF-4A + elF-4H 


elF-4A only 


In (initial rate of 
helix unwinding) 


! l ! 
93) 5 -18 21 —24 -27 


(©) Helix stability (AG in kcal mol-') 
(e) How might elF-4H affect the helicase activity of eIF-4A? 


[Data after N. J. Richter, G. W. Rodgers, Jr., J. O. Hensold, 
and W. C. Merrick. Further biochemical and kinetic char- 
acterization of human eukaryotic initiation factor 4H. 
J. Biol. Chem. 274:35415-35424, 1999. ] 


43, Size separation. The protein-synthesizing machinery was 
isolated from eukaryotic cells and briefly treated with a low 
concentration of RNase. The sample was then subjected to 
sucrose gradient centrifugation. The gradient was fractionated 
and the absorbance, or optical density (OD), at 254 nm was 
recorded for each fraction. The following plot was obtained. 


(a) What do the three peaks of absorbance in graph A 
represent? 
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The experiment was repeated except that, this time, the 
RNase treatment was omitted. 
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(b) Why is the centrifugation pattern in graph B more 
complex? What do the series of peaks near the bottom of 
the centrifuge tube represent? 


OD 254 


| 
0) 20 40 60 80 100 


(B) Top > Bottom 


Before the isolation of the protein-synthesizing machinery, 
the cells were grown in low concentrations of oxygen 
(hypoxic conditions). Again the experiment was repeated 
without RNase treatment (graph C). 


OD 254 


0 20 40 60 80 100 
(C) Top > Bottom 


(c) What is the effect of growing cells under hypoxic 
conditions? 


[Data after M. Koritzinsky et al. EMBO J. 25:1114-1125, 
2006. 


CHAPTER JI 


The Control of Gene Expression 
in Prokaryotes 


Bacteria respond to changes in their environments. A micrograph of the light organ of a 
newly hatched squid (Euprymna scolopes) is shown on the left. The light spots are due to 
colonies of the bacteria Vibrio fischerili that live symbiotically within these organs. These 
bacteria become luminescent when they reach an appropriately high density. The density is 
sensed by the circuit shown on the right in which each bacterium releases a small molecule 
into the environment. The molecule is subsequently taken up by other bacterial cells, which 
start a signaling cascade that stimulates the expression of specific genes. [S. V. Nyholm et al. 
PNAS 97(2000): 10231-10235. Copyright 2000 National Academy of Science] 


Fy simple prokaryotic cells must respond to changes in their metabolism OUTLINE 
or in their environments. Much of this response takes place through 

changes in gene expression. A gene is expressed when it is transcribed into 
RNA and, for most genes, translated into proteins. Genomes comprise thou- 


31.1 Many DNA-Binding Proteins 
Recognize Specific DNA 


sands of genes. Some of these genes are expressed all the time. These genes a 

are subject to constitutive expression. Many other genes are expressed only 31.2 Prokaryotic DNA-Binding 

under some circumstances—that is, under a particular set of physiological Proteins Bind Specifically to 

conditions. These genes are subject to regulated expression. For example, the Regulatory Sites in Operons 

level of expression of some genes in bacteria may vary more than a 1000-fold SINS) Regulatory Circuits Can Result in 

in response to the supply of nutrients or to environmental challenges. Switching Between Patterns of 
In this chapter, we will examine gene-regulation mechanisms in prokary- Gene Expression 


otes, particularly E. coli, because many of these processes were first discov- 
ered in this organism. In Chapter 32, we will turn to gene-regulation 
mechanisms in eukaryotes. We shall see both substantial similarities and 
fundamental differences in comparing gene-regulatory mechanisms of the 
two types of organisms. 

How is gene expression controlled? Gene activity is controlled first and fore- 
most at the level of transcription. Whether a gene is transcribed is determined 


31.4 Gene Expression Can Be 
Controlled at Posttranscriptional 
Levels 
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922 largely by the interplay between specific DNA sequences and certain proteins 

CHAPTER 31 The Control of Gene that bind to these sequences. Most often, these proteins repress the expres- 

Expression in Prokaryotes sion of specific genes by blocking the access of RNA polymerase to their 
promotors. In some cases, however, the proteins can activate the expression 
of specific genes. We shall learn about several different strategies that allow 
the coordinated regulation of sets of genes. Some genes are also controlled at 
stages beyond the level of transcription and we shall examine several mecha- 
nisms at these stages. Finally, we will examine several important examples of 
how gene expression is regulated in response to changes in the concentra- 
tions of specific molecules in the environment of prokaryotic cells. 


31.1 Many DNA-Binding Proteins Recognize 
Specific DNA Sequences 


How do regulatory systems distinguish the genes that need to be activated 
or repressed from genes that are constitutive? After all, the DNA sequences 
of genes themselves do not have any distinguishing features that would 
allow regulatory systems to recognize them. Instead, gene regulation 
depends on other sequences in the genome. In prokaryotes, these regulatory 
sites are close to the region of the DNA that is transcribed. Regulatory sites 
are usually binding sites for specific DNA-binding proteins, which can 
stimulate or repress gene expression. These regulatory sites were first iden- 
tified in E. coli in studies of changes in gene expression. In the pres- 
ence of the sugar lactose, the bacterium starts to express a gene encoding 
B-galactosidase, an enzyme that can process lactose for use as a carbon and 
energy source. The sequence of the regulatory site for this gene is shown in 
Figure 31.1. The nucleotide sequence of this site shows a nearly perfect 
inverted repeat, indicating that the DNA in this region has an approximate 
twofold axis of symmetry. Recall that cleavage sites for restriction enzymes 
such as EcoRV have similar symmetry properties (Section 9.3). Symmetry 
in such regulatory sites usually corresponds to symmetry in the protein 

Figure 31.1 Sequence of the lac that binds the site. Symmetry matching is a recurring theme in protein-DNA 

regulatory site. The nucleotide sequence of interactions. 

this regulatory site shows a nearly perfect 

inverted repeat, corresponding to twofold 


rotational symmetry in the DNA. Parts of the ; ; 
sequences that are related by this symmetry 5'’-... TGTGTGGAAT TGTGAGCGGATAACAATTTCACACA...3 


are shown in the same color. 3’-...ACACACCT T AACACT CGCCT AAT GTT AAAGT GT GT ...5’ 


To understand these protein-DNA interactions in detail, scientists 
examined the structure of the complex between an oligonucleotide that 
includes this site and the DNA-binding unit that recognizes it (Figure 31.2). 


Operator DNA 


Figure 31.2 The Jac repressor-DNA 
complex. The DNA-binding domain from a 
gene-regulatory protein, the /ac repressor, 
binds to a DNA fragment containing its 
preferred binding site (referred to as operator 
DNA) by inserting an @ helix into the major 
groove of operator DNA. Notice that a specific 
contact forms between an arginine residue of 
the repressor and a G-C base pair in the 
binding site. [Drawn from 1EFA.pdb.] 


lac repressor Arg 


a helix 


lac repressor CAP trp repressor 


The DNA-binding unit comes from a protein called the lac repressor, which 
represses the expression of the lactose-processing gene. As expected, this 
DNA-binding unit binds as a dimer, and the twofold axis of symmetry 
of the dimer matches the symmetry of the DNA. An a helix from each 
monomer of the protein is inserted into the major groove of the DNA, 
where amino acid side chains make specific contacts with exposed edges of 
the base pairs. For example, the side chain of an arginine residue of the pro- 
tein forms a pair of hydrogen bonds with a guanine residue of the DNA, 
which would not be possible with any other base. This interaction and 
similar ones allow the lac repressor to bind more tightly to this site than to 
the wide range of other sites present in the E. coli genome. 


The helix-turn-helix motif is common to many prokaryotic 
DNA-binding proteins 


Are similar strategies utilized by other prokaryotic DNA-binding proteins? 
The structures of many such proteins have now been determined, and 
amino acid sequences are known for many more. Strikingly, the DNA- 
binding surfaces of many, but not all, of these proteins consist of a pair of a 
helices separated by a tight turn (Figure 31.3). In complexes with DNA, the 
second of these two helices (often called the recognition helix) lies in the 
major groove, where amino acid side chains make contact with the edges of 
base pairs. In contrast, residues of the first helix participate primarily in 
contacts with the DNA backbone. Helix-turn-helix motifs are present on 
many proteins that bind DNA as dimers, and thus two of the units will be 
present, one on each monomer. 

Although the helix-turn-helix motif is the most commonly observed 
DNA-binding unit in prokaryotes, not all regulatory proteins bind DNA 
through such units. A striking example is provided by the E. coli methion- 
ine repressor (Figure 31.4). This protein binds DNA through the insertion 
of a pair of B strands into the major groove. 


31.2 Prokaryotic DNA-Binding Proteins Bind Specifically to 
Regulatory Sites in Operons 


A historically important example reveals many common principles of gene 
regulation by DNA-binding proteins. Bacteria such as E. coli usually rely 
on glucose as their source of carbon and energy, even when other sugars are 
available. However, when glucose is scarce, E. coli can use lactose as their 
carbon source, even though this disaccharide does not lie on any major 
metabolic pathways. An essential enzyme in the metabolism of lactose is 
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hk Figure 31.3 Helix-turn-helix motif. 
These structures show three sequence-specific 
DNA-binding proteins that interact with DNA 
through a helix-turn-helix motif (highlighted in 
yellow). Notice that, in each case, the 
helix-turn-helix units within a protein dimer are 
approximately 34 A apart, corresponding to 
one full turn of DNA. [Drawn from 1EFA, 
1RUN, and 1TRO.pdb.] 


GA strands 


Ls Figure 31.4 DNA recognition 
through £ strands. A methionine repressor 
is shown bound to DNA. Notice that residues 
in B strands, rather than in « helices, 
participate in the crucial interactions between 
the protein and the DNA. [Drawn from 
1CMA.pdb.] 
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Figure 31.6 B-Galactosidase induction. 
The addition of lactose to an . col culture 
causes the production of 8-galactosidase to 
increase from very low amounts to much 
larger amounts. The increase in the amount of 
enzyme parallels the increase in the number 
of cells in the growing culture. B-Galactosidase 
constitutes 6.6% of the total protein 
synthesized in the presence of lactose. 
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Figure 31.5 Monitoring the B-galactosidase reaction. The galactoside substrate X-Gal 
produces a colored product on cleavage by B-galactosidase. The appearance of this colored product 
provides a convenient means for monitoring the amount of the enzyme both in vitro and in vivo. 


B-galactosidase, which hydrolyzes lactose into galactose and glucose. These 
products are then metabolized by pathways discussed in Chapter 16. 


CH20H CH20H H50 CH2OH CH,0H 
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This reaction can be conveniently followed in the laboratory through the 
use of alternative galactoside substrates that form colored products such as 
X-Gal (Figure 31.5). An E. coli cell growing on a carbon source such as 
glucose or glycerol contains fewer than 10 molecules of B-galactosidase. In 
contrast, the same cell will contain several thousand molecules of the enzyme 
when grown on lactose (Figure 31.6). The presence of lactose in the culture 
medium induces a large increase in the amount of B-galactosidase by 
eliciting the synthesis of new enzyme molecules rather than by activating a 
preexisting but inactive precursor. 

A crucial clue to the mechanism of gene regulation was the observation 
that two other proteins are synthesized in concert with B-galactosidase— 
namely, galactoside permease and thiogalactoside transacetylase. The permease 
is required for the transport of lactose across the bacterial cell membrane 
(Section 13.3). The transacetylase is not essential for lactose metabolism 
but appears to play a role in the detoxification of compounds that also 
may be transported by the permease. Thus, the expression levels of a set of 
enzymes that all contribute to the adaptation to a given change in the environ- 
ment change together. Such a coordinated unit of gene expression is called 
an operon. 


An operon consists of regulatory elements and protein-encoding genes 


The parallel regulation of B-galactosidase, the permease, and the transacety- 
lase suggested that the expression of genes encoding these enzymes is 
controlled by a common mechanism. Francois Jacob and Jacques Monod 
proposed the operon model to account for this parallel regulation as well as 
the results of other genetic experiments. The genetic elements of the model 
are a regulator gene that encodes a regulatory protein, a regulatory DNA 
sequence called an operator site, and a set of structural genes (Figure 31.7). 
The regulator gene encodes a repressor protein that binds to the operator 
site. The binding of the repressor to the operator prevents transcription of the 
structural genes. The operator and its associated structural genes constitute 
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Figure 31.7 Operons. (A) The general structure of an operon as conceived by Jacob and Monod. 
(B) The structure of the lactose operon. In addition to the promoter, p, in the operon, a second 
promoter is present in front of the regulator gene, j, to drive the synthesis of the regulator. 


the operon. For the lactose (lac) operon, the 1 gene encodes the repressor, 
o is the operator site, and the z, y, and a genes are the structural genes for 
B-galactosidase, the permease, and the transacetylase, respectively. The 
operon also contains a promoter site (denoted by p), which directs the RNA 
polymerase to the correct transcription initiation site. The z, y, and a genes 
are transcribed to give a single mRNA molecule that encodes all three pro- 
teins. An mRNA molecule encoding more than one protein is known as a 
polygenic or polycistronic transcript. 


The /ac repressor protein in the absence of lactose binds to 
the operator and blocks transcription 


In the absence of lactose, the lactose operon is repressed. How does the lac 
repressor mediate this repression? The lac repressor exists as a tetramer of 
37-kd subunits with two pairs of subunits coming together to form the 
DNA-binding unit previously discussed. In the absence of lactose, the 
repressor binds very tightly and rapidly to the operator. When the lac 
repressor is bound to DNA, the repressor prevents RNA polymerase from 
transcribing the protein-coding genes inasmuch as the operator site is 
directly adjacent to and downstream of the promoter site where the repres- 
sor would block the progress of RNA polymerase. 

How does the lac repressor locate the operator site in the E. coli chromo- 
some? The lac repressor binds 4 X 10° times as strongly to operator DNA 
as it does to random sites in the genome. This high degree of selectivity 
allows the repressor to find the operator efficiently even with a large excess 
(4.6 X 10°) of other sites within the E. coli genome. The dissociation 
constant for the repressor—operator complex is approximately 0.1 pM 
(10-'° M). The rate constant for association (~ 10! M~!s~') is strikingly 
high, indicating that the repressor finds the operator primarily by diffusing 
along a DNA molecule (a one-dimensional search) rather than encountering 
it from the aqueous medium (a three-dimensional search). This diffusion 
has been confirmed by studies that monitored the behavior of fluorescently 
labeled single molecules of lac repressor inside living E. coli cells. 

Inspection of the complete E. coli genome sequence reveals two sites 
within 500 bp of the primary operator site that approximate the sequence of 
the operator. When one dimeric DNA-binding unit binds to the operator 
site, the other DNA-binding unit of the lac repressor tetramer can bind to 
one of these sites with similar sequences. The DNA between the two bound 
sites forms a loop. No other sites that closely match the sequence of the 
lac operator site are present in the rest of the E. coli genome sequence. Thus, 
the DNA-binding specificity of the lac repressor is sufficient to specify a nearly 
unique site within the E. coli genome. 

The three-dimensional structure of the lac repressor has been deter- 
mined in various forms. Each monomer consists of a small amino-terminal 
domain that binds DNA and a larger domain that mediates the formation of 
the dimeric DNA-binding unit and the tetramer (Figure 31.8). A pair of the 
amino-terminal domainscome together to form the functional DNA-binding 
unit. Each monomer has a helix-turn-helix unit that interacts with the 
major groove of the bound DNA. 
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h Figure 31.8 Structure of the fac 
repressor. A /ac repressor dimer is shown 
bound to DNA. Notice that the amino-terminal 
domain binds to DNA, whereas the 
carboxyl-terminal domain forms a separate 
structure. A part of the structure that mediates 
the formation of lac repressor tetramers is not 
shown. [Drawn from 1EFA.pdb.] 
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Figure 31.9 Effects of IPTG on /ac repressor 
structure. The structure of the /ac repressor bound 
to the inducer isopropylthiogalactoside (IPTG), shown 
in orange, is superimposed on the structure of the /ac 
repressor bound to DNA, shown in purple. Notice that 
the binding of IPTG induces structural changes that 
alter the relation between the two DNA-binding 
domains so that they cannot interact effectively with 
DNA. The DNA-binding domains of the /ac repressor 
bound to IPTG are not shown, because these regions 
are not well ordered in the crystals studied. 
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Figure 31.10 Induction of the /ac operon. (A) In the absence 
of lactose, the /ac repressor binds DNA and represses transcription eo 
from the /ac operon. (B) Allolactose or another inducer binds to 
the /ac repressor, leading to its dissociation from DNA and to the 


production of lac mRNA. 
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Repressor bound to operator site 
prevents transcription of z, y, and a 


Ligand binding can induce structural changes in regulatory proteins 


In the situation just described, glucose is present and lactose is absent, 
and the lac operon is repressed. How does the presence of lactose trigger 
the relief of this repression and, hence, the expression of the lac operon? 
Interestingly, lactose itself does not have this effect; rather, allolactose, a 
combination of galactose and glucose with an a-1,6 rather than an a-1,4 
linkage, does. Allolactose is thus referred to as the inducer of the lac 
operon. Allolactose is a side product of the B- galactosidase reaction and 
is produced at low levels by the few molecules of B-galactosidase that 
are present before induction. Some other B-galactosides such as isopyo- 
pylthiogalactoside (IPTG) are potent inducers of B-galactosidase expres- 
sion, although they are not substrates of the enzyme. IPTG is useful in 
the laboratory as a tool for inducing gene expression in engineered bac- 
terial strains. 

The inducer triggers gene expression by preventing the lac repressor 
from binding the operator. The inducer binds to the lac repressor and 
thereby greatly reduces the repressor’s affinity for operator DNA. An 
inducer molecule binds in the center of the large domain within each 
monomer. This binding leads to conformational changes that modify 
the relation between the two small DNA-binding domains (Figure 31.9). 
These domains can no longer easily contact DNA simultaneously, leading 
to a dramatic reduction in DNA-binding affinity. 

Let us recapitulate the processes that regulate gene expression in the 
lactose operon (Figure 31.10). In the absence of inducer, the lac repres- 
sor is bound to DNA in a manner that blocks RNA polymerase from 
transcribing the z, y, and a genes. Thus, very little B-galactosidase, 
permease, or transacetylase are produced. The addition of lactose to the 
environment leads to the formation of allolactose. This inducer binds to 
the lac repressor, leading to conformational changes and the release of 
DNA by the lac repressor. With the operator site unoccupied, RNA 
polymerase can then transcribe the other lac genes and the bacterium 
will produce the proteins necessary for the efficient use of lactose. 


aa The structure of the large domain of the lac repressor is similar to 

those of a large class of proteins that are present in E. coli and other 
bacteria. This family of homologous proteins binds ligands such as sug- 
ars and amino acids at their centers. Remarkably, domains of this fam- 
ily are utilized by eukaryotes in taste proteins and in neurotransmitter 
receptors, as will be discussed in Chapter 33. 


The operon is a common regulatory unit in prokaryotes 


Many other gene-regulatory networks function in ways analogous to 
those of the lac operon. For example, genes taking part in purine and, to 
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a lesser degree, pyrimidine biosynthesis are repressed by the pur repressor. 
This dimeric protein is 31% identical in sequence with the lac repressor and 
has a similar three-dimensional structure. However, the behavior of the pur 
repressor is opposite that of the lac repressor: whereas the lac repressor is 
released from DNA by binding to a small molecule, the pur repressor binds 
DNA specifically, blocking transcription, only when bound to a small molecule. 
Such a small molecule is called a corepressor. For the pur repressor, the 
corepressor can be either guanine or hypoxanthine. The dimeric pur 
repressor binds to inverted-repeat DNA _ sites of the form 
5'-ANGCAANCGNTTNCNT-3’, in which the bases shown in bold- 
face type are particularly important. Examination of the E. coli genome 
sequence reveals the presence of more than 20 such sites, regulating 19 
operons and including more than 25 genes (Figure 31.11). 


hag Because the DNA binding sites for these regulatory proteins are short, 

it is likely that they evolved independently and are not related by 
divergence from an ancestral regulatory site. Once a ligand-regulated 
DNA-binding protein is present in a cell, binding sites for the protein may 
arise by mutation adjacent to additional genes. Binding sites for the pur 
repressor have evolved in the regulatory regions of a wide range of genes 
taking part in nucleotide biosynthesis. All such genes can then be regulated 
in a concerted manner. 

The organization of prokaryotic genes into operons is useful for the 
analysis of completed genome sequences. Sometimes a gene of unknown 
function is discovered to be part of an operon containing well-characterized 
genes. Such associations can provide powerful clues to the biochemical and 
physiological functions of the uncharacterized gene. 


Transcription can be stimulated by proteins that contact 
RNA polymerase 


All the DNA-binding proteins discussed thus far function by inhibiting 
transcription until some environmental condition, such as the presence of 
lactose, is met. There are also DNA-binding proteins that stimulate tran- 
scription. One particularly well studied example is a protein in E. coli that 
stimulates the expression of catabolic enzymes. 

E. coli grown on glucose, a preferred energy source, have very low levels 
of catabolic enzymes for metabolizing other sugars. Clearly, the synthesis of 
these enzymes when glucose is abundant would be wasteful. Glucose has 
an inhibitory effect on the genes encoding these enzymes, an effect called 
catabolite repression. It is due to the fact that glucose lowers the concentra- 
tion of cyclic AMP in E. coli. When its concentration is high, cAMP stimu- 
lates the concerted transcription of many catabolic enzymes by acting 
through a protein called the catabolite activator protein (CAP), which is also 
known as the cAMP receptor protein (CRP). 

When bound to cAMP, CAP stimulates the transcription of lactose- and 
arabinose-catabolizing genes. CAP is a sequence-specific DNA-binding 
protein. Within the lac operon, CAP binds to an inverted repeat that is 
centered near position —61 relative to the start site for transcription 
(Figure 31.12). This site is approximately 70 base pairs from the operator site. 
As expected from the symmetry of the binding site, CAP functions as a 
dimer of identical subunits. 

The CAP-cAMP complex stimulates the initiation of transcription by 
approximately a factor of 50. Energetically favorable contacts between 
CAP and RNA polymerase increase the likelihood that transcription will be 
initiated at sites to which the CAP-cAMP complex is bound (Figure 31.13). 
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Figure 31.11 Binding-site distributions. 
The E. coli genome contains only a single 
region that closely matches the sequence of 
the /ac operator (shown in blue). In contrast, 
20 sites match the sequence of the pur 
operator (shown in red). Thus, the pur 
repressor regulates the expression of many 
more genes than does the /ac repressor. 


Start site 
for transcription 


Figure 31.12 Binding site for catabolite 
activator protein (CAP). This protein binds 
as a dimer to an inverted repeat that is at 
the position —61 relative to the start site of 
transcription. The CAP-binding site on DNA 
is adjacent to the position at which RNA 
polymerase binds. 
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hk Figure 31.13 Structure of a dimer of 
CAP bound to DNA. The residues shown in 
yellow in each CAP monomer have been 
implicated in direct interactions with RNA 
polymerase. [Drawn from 1RUN.pdb.] 


h Figure 31.14 Structure of the d 
repressor bound to DNA. The } repressor 
binds to DNA as a dimer. The amino-terminal 


domain of one subunit is shown in red and 
the carboxyl-terminal domain is shown in 
blue. In the other subunit, both domains are 
shown in yellow. Notice how « helices on the 
amino-terminal domains fit into the major 
groove of the DNA. [Drawn from 3DBN.pdb.] 


Thus, in regard to the lac operon, gene expression is maximal when the 
binding of allolactose relieves the inhibition by the lac repressor and the 
CAP-cAMP complex stimulates the binding of RNA polymerase. 

The E. coli genome contains many CAP-binding sites in positions 
appropriate for interactions with RNA polymerase. Thus, an increase in the 
cAMP level inside an EF. coli bacterium results in the formation of CAP— 
cAMP complexes that bind to many promoters and stimulate the transcrip- 
tion of genes encoding a variety of catabolic enzymes. 


31.3 Regulatory Circuits Can Result in Switching Between 
Patterns of Gene Expression 


The study of viruses that infect bacteria has led to significant advances in 
our understanding of the processes that control gene expression. Again, 
sequence-specific DNA-binding proteins play key roles in these processes. 
Investigations of bacteriophage \ have been particularly revealing. We 
examined the alternative infection modes of \ phage in Chapter 5. In the 
lytic pathway, most of the genes in the viral genome are transcribed, initiat- 
ing the production of many virus particles and leading to the eventual lysis 
of the bacterial cell with the concomitant release of approximately 100 virus 
particles. In the lysogenic pathway, the viral genome is incorporated into the 
bacterial DNA where most of the viral genes remain unexpressed, allowing 
the viral genome to be carried along as the bacteria replicate. Two key pro- 
teins and a set of regulatory sequences in the viral genome are responsible 
for the switch that determines which of these two pathways is followed. 


Lambda repressor regulates its own expression 


The first protein that we shall consider is the \ repressor, sometimes known 
as the d cl protein. This protein is key because it blocks, either directly or 
indirectly, the transcription of almost all genes encoded by the virus. The 
one exception is the gene that encodes the A repressor itself. The \ repressor 
consists ofan amino-terminal DNA-binding domain andacarboxyl-terminal 
domain that participates in protein oligomerization (Figure 31.14). This 
protein binds to a number of key sites in the \ phage genome. The sites of 
greatest interest for our present discussion are in the so-called right operator 
(Figure 31.15). This region includes three binding sites for the \ repressor 
dimer as well as two promoters within a region of approximately 80 base 
pairs. One promoter drives the expression of the gene for the \ repressor 


‘ ‘ —35 


-«-TACGTTAAATC AATATC ACTATTT 
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Figure 31.15 Sequence of the ) right operator. The three operator sites (Og1, Or2, and Op3) 
are shaded yellow with their centers indicated. The start sites for the \ repressor MRNA and the Cro 
mRNA are indicated, as are their -10 and —35 sequences. 


itself, whereas the other drives the expression of a number of other viral 
genes. The \ repressor does not have the same affinity for the three sites; it 
binds the site Op1 with the highest affinity. In addition, the binding to 
adjacent sites is cooperative so that, after a \ repressor dimer has bound at 
Opt, the likelihood that a protein will bind to the adjacent site Op2 increases 
by approximately 25-fold. Thus, when X repressor is present in the cell at 
moderate concentrations, the most likely configuration has \ repressor 
bound at Opi and Or?2, but not at Og3. In this configuration, the \ repres- 
sor dimer bound at Or!1 blocks access to the promoter on the right side of 
the operator sites, repressing transcription of the adjacent gene, which 
encodes a protein termed Cro (controller of repressor and others), while the 
repressor dimer at Op2 can be in contact with RNA polymerase and stimu- 
late transcription of the promoter that controls the transcription of the gene 
that encodes the \ repressor itself. Thus, the \ repressor stimulates its own 
production. As the concentration of the \ repressor increases further, an 
additional repressor dimer can bind to the Ok3 site, blocking the other pro- 
moter and repressing the production of additional repressor. Thus, the right 
operator serves to maintain the \ repressor in a narrow, stable concentration 
range (Figure 31.16). The \ repressor also blocks other promoters in the X 
phage genome so that the repressor is the only phage protein produced, 
which corresponds to the lysogenic state. 


A circuit based on lambda repressor and Cro form a genetic switch 


What stimulates the switch to the lytic pathway? Changes such as DNA 
damage initiate the cleavage of the \ repressor at a specific bond between 
the DNA-binding and oligomerization domains. This process is mediated 
by the E. coli RecA protein (Section 28.5). After this cleavage has taken place, 
the affinity of the \ repressor for DNA is reduced. After the \ repressor is no 
longer bound to the Or! site, the Cro gene can be transcribed. Cro is a small 
protein that binds to the same sites as the \ repressor does, but with a 
different order of affinity for the three sites in the right operator. In particu- 
lar, Cro has the highest affinity for Op3. Cro bound in this site blocks the 
production of new repressor. The absence of repressor leads to the produc- 
tion of other phage genes, leading to the production of virus particles and 
the eventually lysis of the host cells. Thus, this genetic circuit acts a switch 
with two stable states: (1) repressor high, Cro low, corresponding to the 
lysogenic state, and (2) Cro high, repressor low, corresponding to the lytic 
state (Figure 31.17). Regulatory circuits with different DNA-binding pro- 
teins controlling the expression of each other’s genes constitute a common 
motif for controlling gene expression. 


Many prokaryotic cells release chemical signals that regulate 
gene expression in other cells 


Prokaryotic cells have been traditionally viewed as solitary single cells. 
However, it is becoming increasingly clear that, in many circumstances, 
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Figure 31.16 The \ repressor controls its 
own synthesis. When } repressor levels are 
relatively low, the repressor binds to sites Og] 
and Op2 and stimulates the transcription of 
the gene than encodes the X repressor itself. 
When X repressor levels are higher, the 
repressor also binds to site Op3, blocking 
access to its promoter and repressing 
transcription from this gene. 
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Figure 31.17 The \ repressor and Cro 
form a genetic circuit. The \ repressor 
blocks the production of Cro by binding most 
favorably to site Og1 whereas Cro blocks the 
production of the \ repressor by binding most 
favorably to site Op3. This circuit forms a 
switch that determines whether the lysogenic 
or the lytic pathway is followed. 
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Acyl-homoserine lactone (AHL) 


Figure 31.18 Autoinducer structure. The 
structure of the acyl-homeserine lactone 
N-3-oxo-hexanoyl homoserine lactone, the 
autoinducer from V. fischeri. The autoinducers 
from other bacterial species can have different 
acyl groups (shown in red). 


Figure 31.19 Quorum-sensing gene regulator. The structure of a 
homolog of LuxR (TraR from the bacterium Agrobacterium tumefaciens) is 
shown. Notice that the dimeric protein binds to DNA through an a-helical 
domain, whereas the autoinducer binds to a separate domain. 


prokaryotic cells live in complex communities, interacting with other cells 
of their own and different species. These social interactions change the 
patterns of gene expression within the cells. 

An important type of interaction is called quorum sensing. This phenom- 
enon was discovered in the bacteria Vibrio fischeri, a species of bacterium 
that can live inside a specialized light organ in the bobtail squid. In this 
symbiotic relation, the bacteria produce luciferase and fluoresce, providing 
protection for the squid (by preventing being backlit by moonlight) in 
exchange for a protected place to live and reproduce. When these bacteria 
are grown in culture at low density, they are not fluorescent. However, when 
the cell density reaches a critical level, the gene for luciferase is expressed 
and the cells fluoresce. A key observation was that, when V. fischeri cells 
were transferred to a sterile medium in which other V. fischeri cells had been 
grown to high density, the cells became fluorescent even at low cell density. 
This experiment revealed that a chemical, subsequently shown to be N-3- 
oxo-hexanoyl homoserine lactone (hereafter AHL for acyl-homoserine 
lactone), had been released into the medium that triggers the development 
of the fluorescence (Figure 31.18). This compound and other compounds 
that play similar roles are termed autoinducers. 

Cells of V. fischeri release the autoinducer into their environment and 
other V. fischeri cells take up the chemical. V. fischeri cells express a DNA- 
binding protein LuxR that serves as the receptor for the autoinducer. LuxR 
comprises two domains, one of which binds AHL and the other of which 
binds DNA through a helix-turn-helix motif (Figure 31.19). After the 
AHL concentration inside the cell has increased to an appropriate level, a 
substantial fraction of the LuxR molecules bind AHL. When bound to 
AHL, LuxR dimers bind to specific sites on DNA and increase the rate of 
transcription initiation at specific genes. These target genes include an 
operon that includes LuxA and LuxB, which together encode the luciferase 
enzyme, and LuxI, which produces an enzyme that catalyzes the formation 
of more AHL. 

Because each cell produces only a small amount of the autoinducer, this 
regulatory system allows each V. fischeri cell to determine the density of the 
V. fischeri population in its environment—hence the term quorum sensing for 
this process. Studies of other prokaryotic cells are revealing an elaborate 
chemical language of different autoinducers (as well as 
autorepressors that serve to repress specific genes). 
The “words” in this language include other acyl- 
homoserine lactone molecules with different acyl chain 
lengths and functionalities as well as other distinct 
classes of molecules. 


Biofilms are complex communities of prokaryotes 


Many species of prokaryotes can be found in special- 
ized structures termed biofilms that can form on sur- 
faces. Biofilms are of considerable medical importance 
because organisms within them are often quite resis- 
tant to the immune response of the host as well as to 
antibiotics. Quorum sensing appears to play a major 
role in the formation of biofilms in that cells are able to 
sense other cells in their environments and to promote 
the formation of communities with particular compo- 
sitions. Some genes controlled by quorum-sensing 
mechanisms promote the formation of specific mole- 
cules that serve as scaffolds for the biofilm. An intrigu- 


Autoinducer 


ing recent discovery is that many of the organisms that are present in bio- 
films on or in our bodies (perhaps 95% or more) have not been grown in 
culture. Through DNA-sequencing methods, we are developing a better 
census of our microbiome and are working toward understanding the gene- 
regulatory mechanisms that support these complex communities. 


31.4 Gene Expression Can Be Controlled at 
Posttranscriptional Levels 


The modulation of the rate of transcription initiation is the most common 
mechanism of gene regulation. However, other stages of transcription also 
can be targets for regulation. In addition, the process of translation provides 
other points of intervention for regulating the level of a protein produced in 
a cell. In Chapter 29, we considered riboswitches that control transcription 
termination (Section 29.1). Other riboswitches control gene expression by 
other mechanisms such as the formation of structures that inhibit transla- 
tion. Additional mechanisms for posttranscriptional gene regulation have 
been discovered. 


Attenuation is a prokaryotic mechanism for regulating transcription 
through the modulation of nascent RNA secondary structure 


A means for regulating transcription in bacteria was discovered by Charles 
Yanofsky and his colleagues as a result of their studies of the tryptophan 
operon. This operon encodes five enzymes that convert chorismate into 
tryptophan. Analysis of the 5’ end of trp mRNA revealed the presence of a 
leader sequence of 162 nucleotides before the initiation codon of the first 
enzyme. The next striking observation was that bacteria produced a tran- 
script consisting of only the first 130 nucleotides when the tryptophan level 
was high, but they produced a 7000-nucleotide trp mRNA, including the 
entire leader sequence, when tryptophan was scarce. Thus, when trytophan 
is plentiful and the biosynthetic enzymes are not needed, transcription is 
abruptly broken off before any mRNA coding for the enzymes is produced. 
The site of termination is called the attenuator, and this mode of regulation 
is called attenuation. 

Attenuation depends on features at the 5’ end of the mRNA product 
(Figure 31.20). The first part of the leader sequence encodes a 14-amino-acid 
leader peptide. Following the open reading frame for the peptide is the 
attenuator, a region of RNA that is capable of forming several alternative 
structures. Recall that transcription and translation are tightly coupled in 
bacteria. Thus, the translation of the trp mRNA begins soon after the 
ribosome-binding site has been synthesized. 
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Figure 31.20 Leader region of trp mRNA. 
(A) The nucleotide sequence of the 5’ end of 
trp MRNA includes a short open reading 
frame that encodes a peptide comprising 14 
amino acids; the leader encodes two 
tryptophan residues and has an untranslated 
attenuator region (blue and red nucleotides). 
(B and C) The attenuator region can adopt 
either of two distinct stem-loop structures. 
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Figure 31.21 Attenuation. (A) In the presence of adequate concentrations of tryptophan (and, 
hence, Trp-tRNA), translation proceeds rapidly and an RNA structure forms that terminates 
transcription. (B) At low concentrations of tryptophan, translation stalls while awaiting Trp-tRNA, 
giving time for an alternative RNA structure to form that does not terminate transcription efficiently. 


How does the level of tryptophan alter transcription of the trp operon? 
An important clue was the finding that the 14-amino-acid leader peptide 
includes two adjacent tryptophan residues. A ribosome is able to translate 
the leader region of the mRNA product only in the presence of adequate 
concentrations of tryptophan. When enough tryptophan is present, a stem- 
loop structure forms in the attenuator region, which leads to the release of 
RNA polymerase from the DNA (Figure 31.21). However, when tryptophan 
is scarce, transcription is terminated less frequently. Little tryptophanyl- 
tRNA is present, and so the ribosome stalls at the tandem UGG codons 
encoding tryptophan. This delay leaves the adjacent region of the mRNA 
exposed as transcription continues. An alternative RNA structure that does 
not function as a terminator is formed, and transcription continues into and 
through the coding regions for the enzymes. Thus, attenuation provides an 
elegant means of sensing the supply of tryptophan required for protein 
synthesis. 


a Several other operons for the biosynthesis of amino acids in E. coli also 

are regulated by attenuator sites. The leader peptide of each contains 
an abundance of the amino acid residues of the type synthesized by the 
operon (Figure 31.22). For example, the leader peptide for the phenylala- 
nine operon includes 7 phenylalanine residues among 15 residues. The 
threonine operon encodes enzymes required for the synthesis of both threo- 
nine and isoleucine; the leader peptide contains 8 threonine and 4 isoleucine 
residues in a 16-residue sequence. The leader peptide for the histidine 
operon includes 7 histidine residues in a row. In each case, low levels of 
the corresponding charged tRNA cause the ribosome to stall, trapping the 


Met - Lys - Arg - lle - Ser - Thr - Thr - lle - Thr - Thr - Thr - fle - Thr - lle - Thr - Thr - 
AUG AAA CGC AUU AGC ACC ACC AUU ACC ACC ACC AUC ACC AUU ACC ACA 3’ 


Met - Lys - His - lle - Pro - Phe - Phe - Phe - Ala - Phe - Phe - Phe - Thr - Phe - Pro - Stop 
AUG AAA CAC AUA CCG UUU UUC UUC GCA UUC UUU UUU ACC UUC CCC UGA 3’ 


Met - Thr - Arg - Val - Gln - Phe - Lys - His - His - His - His - His - His - His - Pro - Asp- 
AUG ACA CGC GUU CAA UUU AAA CAC CAC CAU CAU CAC CAU CAU CCU GAC 3’ 
Figure 31.22 Leader peptide sequences. Amino acid sequences and the corresponding mRNA 
nucleotide sequences of the (A) threonine operon, (B) phenylalanine operon, and (C) histidine 


operon. In each case, an abundance of one amino acid in the leader peptide sequence leads to 
attenuation. 


nascent mRNA in a state that can form a structure that allows RNA 
polymerase to read through the attenuator site. Evolution has apparently 
converged on this strategy repeatedly as a mechanism for controlling amino 
acid biosynthesis. 


Summary 


31.1 


31.2 


31.3 


Many DNA-Binding Proteins Recognize Specific DNA Sequences 

The regulation of gene expression depends on the interplay between 
specific sequences within the genome and proteins that bind specifi- 
cally to these sites. Specific DNA-binding proteins recognize regula- 
tory sites that usually lie adjacent to the genes whose transcription is 
regulated by these proteins. The proteins of the largest family contain 
a helix-turn-helix motif. The first helix of this motif inserts into the 
major groove of DNA and makes specific hydrogen-bonding and 
other contacts with the edges of the base pairs. 


Prokaryotic DNA-Binding Proteins Bind Specifically to Regulatory Sites 
in Operons 

In prokaryotes, many genes are clustered into operons, which are units 
of coordinated genetic expression. An operon consists of control sites 
(an operator and a promoter) and a set of structural genes. In addition, 
regulator genes encode proteins that interact with the operator and 
promoter sites to stimulate or inhibit transcription. The treatment of 
E. coli with lactose induces an increase in the production of B-galacto- 
sidase and two additional proteins that are encoded in the lactose 
operon. In the absence of lactose or a similar galactoside inducer, the 
lac repressor protein binds to an operator site on the DNA and blocks 
transcription. The binding of allolactose, a derivative of lactose, to the 
lac repressor induces a conformational change that leads to dissocia- 
tion from DNA. RNA polymerase can then move through the opera- 
tor to transcribe the lac operon. 

Some proteins activate transcription by directly contacting RNA 
polymerase. For example, cyclic AMP stimulates the transcription of 
many catabolic operons by binding to the catabolite activator protein. 
The binding of the cAMP-CAP complex to a specific site in the pro- 
moter region of an inducible catabolic operon enhances the binding of 
RNA polymerase and the initiation of transcription. 


Regulatory Circuits Can Result in Switching Between Patterns 

of Gene Expression 

The study of bacterial viruses, particularly bacteriophage A, has 
revealed key aspects of gene-regulatory networks. Bacteriophage \ can 
develop by either a lytic or a lysogenic pathway. A key regulatory pro- 
tein, the \ repressor, regulates its own expression, promoting tran- 
scription of the gene that encodes the repressor when repressor levels 
are low and blocking transcription of the gene when levels are high. 
This behavior depends on the A right operator, which includes three 
sites to which ) repressor dimers can bind. Cooperative binding of the 
\ repressor to two of the sites stabilizes the state in which two J repres- 
sor dimers are bound. The Cro protein binds to the same sites as does 
the \ repressor, but with reversed affinities. When Cro is present at 
sufficient concentrations, it blocks the transcription of the gene for the 
\ repressor while allowing the transcription of its own gene. Thus, 
these two proteins and the operator form a genetic switch that can exist 
in either of two states. 
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Some prokaryotic species participate in quorum sensing. This 
process includes the release of chemicals called autoinducers into 
the medium surrounding the cells. These autoinducers are often, but 
not always, acyl homoserine lactones. Autoinducers are taken up by 
surrounding cells. When the autoinducer concentration reaches an 
appropriate level, it is bound by receptor proteins that activate the 
expression of genes, including those that promote the synthesis of 
more autoinducer. These chemically mediated social interactions 
allow these prokaryotes to change their gene-expression patterns 
in response to the number of other cells in their environments. 
Biofilms are complex communities of prokaryotes that are promoted 
by quorum-sensing mechanisms. 


31.4 Gene Expression Can Be Controlled at Posttranscriptional Levels 


Gene expression can also be regulated at the level of translation. In 
prokaryotes, many operons important in amino acid biosynthesis are 
regulated by attenuation, a process that depends on the formation of 
alternative structures in mRNA, one of which favors the termination 
of transcription. Attenuation 1s mediated by the translation of a leader 
region of mRNA. A ribosome stalled by the absence of an aminoacy]l- 
tRNA needed to translate the leader mRNA alters the structure of 
mRNA, allowing RNA polymerase to transcribe the operon beyond 


the attenuator site. 


Key Terms 

helix-turn-helix motif (p. 923) inducer (p. 926) catabolite activator protein (CAP) 
B-galactosidase (p. 924) isopropylthiogalactoside (IPTG) (p. 927) 

operon model (p. 924) (p. 926) quorum sensing (p. 930) 


repressor (p. 924) 
lac repressor (p. 925) corepressor (p. 927) 


lac operator (p. 925) 


Problems 


1. Missing genes. Predict the effects of deleting the follow- 
ing regions of DNA: 


(a) The gene encoding lac repressor 
(b) The lac operator 
(c) The gene encoding CAP 


2. Minimal concentration. Calculate the concentration of lac 
repressor, assuming that one molecule is present per cell. 
Assume that each E. coli cell has a volume of 10°? cm’. 
Would you expect the single molecule to be free or bound to 


DNA? 


3. Counting sites. Calculate the expected number of times 
that a given 8-base-pair DNA site should be present in the 
E. coli genome. Assume that all four bases are equally prob- 
able. Repeat for a 10-base-pair site and a 12-base-pair site. 


4, The same but not the same. The lac repressor and the pur 
repressor are homologous proteins with very similar three- 
dimensional structures, yet they have different effects on 


pur repressor (p. 927) 


catabolite repression (p. 927) 


autoinducer (p. 930) 
biofilm (p. 930) 
attenuation (p. 931) 


gene expression. Describe two important ways in which the 
gene-regulatory properties of these proteins differ. 


5. The opposite direction. Some compounds called anti- 
inducers bind to repressors such as the lac repressor and 
inhibit the action of inducers; that is, transcription is 
repressed and higher concentrations of inducer are required 
to induce transcription. Propose a mechanism of action for 
anti-inducers. 


6. Inverted repeats. Suppose that a nearly perfect 20-base-pair 
inverted repeat is observed in a DNA sequence. Provide 
two possible explanations. 


7. Broken operators. Consider a hypothetical mutation in 
Op2 that blocks both \ repressor and Cro binding. How 
would this mutation affect the likelihood of bacteriophage \ 
entering the lytic phase? 


8. Promoters. Compare the —10 and —35 sequences for the 
\ repressor and Cro genes in the right operator. How many 
differences are there between these sequences? 


9. Positive and negative feedback. What is the effect of an 
increased Cro concentration on the expression of the gene 
for the \ repressor? Of an increased concentration of \ 
repressor on the expression of the Cro gene? Of an increased 
concentration of \ repressor on the expression of the d 
repressor gene? 


10. Leaderless. The mRNA for the A repressor begins with 
5’-AUG-3’, which encodes the methionine residue that 
begins the protein. What is unusual about this beginning? 
Would it cause the mRNA to translate efficiently or not? 


11. Quorum count. Suppose you have a series of compounds 
that you wish to test for the autoinducer activity in Vibrio 
fischeri. Propose a simple assay, assuming that you can grow 
V. fischeri cultures at low cell densities. 


12. Codon utilization. There are four codons that encode 
threonine. Consider the leader sequence in Figure 31.22A. 
What codons are used and with what frequency? 


Mechanism Problem 


13. Follow the stereochemistry. The hydrolysis of lactose is 
catalyzed by B- galactosidase. 


ou 
OH 
O 
HO ove OH + H,0 ——> 
OH 
HO 
Lactose 
HO 
OH 
OH 
oO 
Ho OH + HOHO OH 
oO 
OH 
HO 
Galactose Glucose 


Does the overall reaction proceed with retention or inver- 
sion of configuration? Given that each step likely proceeds 
with inversion of configuration, what does the overall 
change in stereochemistry suggest about the mechanism? A 
key residue in the reaction has been identified to be Glu 
537. Propose an overall mechanism for the hydrolysis of 
lactose. 


Data Interpretation Problem 


14. Leaving tracks. A powerful method for examining 
protein-DNA interactions is called DNA footprinting. In 
this method, a DNA fragment containing a potential bind- 
ing site is radiolabeled on one end. The labeled DNA is then 
treated with a DNA-cleaving agent such as DNase I such 
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Problems 


that each DNA molecule within the population is cut only 
once. The same cleavage process is carried out in the pres- 
ence of the DNA-binding protein. The bound protein 
protects some sites within the DNA from cleavage. The 
patterns of DNA fragments in the cleaved pool of DNA 
molecules are then examined by electrophoresis followed 
by autoradiography. 
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This method is applied to a DNA fragment containing a 
single binding site for the \ repressor in the presence of dif- 
ference concentrations of the \ repressor. The results are 
shown below: 
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Estimate the dissociation constant for the \ repressor— 
DNA complex and the standard free energy of binding. 
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The Control of Gene 
Expression in Eukaryotes 


Complex biological processes often require coordinated control of the expression of 
many genes. The maturation of a tadpole into a frog is largely controlled by thyroid 
hormone. This hormone regulates gene expression by binding to a protein, the 
thyroid-hormone receptor, as shown at the right. In response to the hormone’s binding, 
this protein binds to specific DNA sites in the genome and modulates the expression 
of nearby genes. [(Right) Sharon Cummings/Dembinsky Photo Associates] 


M| any of the most important and intriguing features in modern biology 
and medicine, such as the pathways crucial for the development of 
multicellular organisms, the changes that distinguish normal cells and cancer 
cells, and the evolutionary changes leading to new species, entail networks 
of gene-regulatory pathways. Gene regulation in eukaryotes is significantly 
more complex than in prokaryotes in several ways. First, the genomes being 
regulated are significantly larger. The F. coli genome consists of a single, 
circular chromosome containing 4.6 Mb. This genome encodes approximately 
2000 proteins. In comparison, one of the simplest eukaryotes, Saccharomyces 
cerevisiae (baker’s yeast), contains 16 chromosomes ranging in size from 
0.2 to 2.2 Mb (Figure 32.1). The yeast genome totals 12 Mb and encodes 
approximately 6000 proteins. The genome within a human cell contains 
23 pairs of chromosomes ranging in size from 50 to 250 Mb. Approximately 
23,000 genes are present within the 3000 Mb of human DNA. 

Second, whereas prokaryotic genomic DNA is relatively accessible, 
eukaryotic DNA is packaged into chromatin, a complex between the DNA 
and a special set of proteins (Figure 32.2). Although the principles for the 
construction of chromatin are relatively simple, the chromatin structure for 
a complete genome is quite complex. Importantly, ina given eukaryotic cell, 


CHAPTER 32 


OUTLINE 


32.1 Eukaryotic DNA Is Organized 
into Chromatin 


32.2 Transcription Factors Bind DNA 
and Regulate Transcription 
Initiation 


32.3. The Control of Gene Expression 
Can Require Chromatin 
Remodeling 


32.4 Eukaryotic Gene Expression 
Can Be Controlled at 
Posttranscriptional Levels 
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(if double stranded) or 10° bases (if single 
stranded). 
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Figure 32.1 Yeast chromosomes. Pulsed- 
field electrophoresis allows the separation of 
16 yeast chromosomes. [From G. Chu, D. 
Vollrath, and R. W. Davis. Science 234:1582— 
1585, 1986.] 
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Figure 32.2 Chromatin structure. An electron micrograph of chromatin showing its “beads on a 
string” character. The beads correspond to DNA complexes with specific proteins. [Courtesy of Dr. 
Ada Olins and Dr. Donald Olins.] 


some genes and their associated regulatory regions are relatively accessible 
for transcription and regulation, whereas other genes are tightly packaged 
and are thus rendered inactive. Eukaryotic gene regulation frequently 
requires the manipulation of chromatin structure. 

A manifestation of this complexity is the presence of many different cell 
types in most eukaryotes. A liver cell, a pancreatic cell, and an embryonic 
stem cell contain the same DNA sequences, yet the subset of genes highly 
expressed in cells from the pancreas, which secretes digestive enzymes, 
differs markedly from the subset highly expressed in the liver, the site of 
lipid transport and energy transduction. Embryonic stem cells do not 
express any subset of genes at high levels; the most highly expressed genes 
are “housekeeping” genes involved in the cytoskeleton and processes such 
as translation (Table 32.1). The existence of stable cell types is due to differ- 
ences in the epigenome, differences in chromatin structure, and covalent 
modifications of the DNA, not in the DNA sequence itself. 

In addition, eukaryotic genes are not generally organized into operons. 
Instead, genes that encode proteins for steps within a given pathway are 
often spread widely across the genome. This characteristic requires that 
other mechanisms function to regulate genes in a coordinated way. 

Despite these differences, some aspects of gene regulation in eukaryotes 
are quite similar to those in prokaryotes. In particular, activator and repres- 
sor proteins that recognize specific DNA sequences are central to many 
gene-regulatory processes. In this chapter, we will focus first on chromatin 
structure. We will then turn to transcription factors—DNA-binding 
proteins similar in many ways to the prokaryotic proteins that we encoun- 
tered in the preceding chapter. Eukaryotic transcription factors can act directly 
by interacting with the transcriptional machinery or indirectly by influencing 
chromatin structure. Finally, we examine selected posttranscriptional gene- 
regulatory mechanisms, including those based on microRNAs, an important 
class of gene-regulatory molecules discovered in recent years. 


32.1 Eukaryotic DNA Is Organized into Chromatin 


Eukaryotic DNA is tightly bound to a group of small basic proteins called 
histones. In fact, histones constitute half the mass of a eukaryotic chromo- 
some. The entire complex of a cell’s DNA and associated protein is called 


Table 32.1 Highly expressed protein-encoding genes of the pancreas, liver, and 
embryonic stem cells (as percentage of total mRNA pool) 


Proteins expressed Proteins expressed Proteins expressed 
Rank in pancreas % in liver % in stem cells % 
1 Procarboxypeptidase 7.6 Albumin 3.5. Glyceraldehyde-3- 0.7 
Al phosphate 
dehydrogenase 
2 Pancreatic 5.5 Apolipoprotein 2.8 Translation 0.6 
trypsinogen 2 A-I elongation 
factor 1 a1 
3 Chymotrypsinogen 4.4 Apolipoprotein 2.5 Tubulina 0.5 
C-I 
4 Pancreatic trypsin 1 3.7. Apolipoprotein 2.1. Translationally 0.5 
C-Ill controlled tumor 
protein 
5 Elastase IIIB 2.4 ATPase 6/8 1.5. Cyclophilin A 0.4 
6 Protease E 1.9 Cytochrome 1.1 Cofilin 0.4 
oxidase 3 
7 Pancreatic lipase 1.9 Cytochrome 1.1. Nucleophosmin 0.3 
oxidase 2 
8 Procarboxypeptidase 1.7 a ,-Antitrypsin 1.0 Connexin 43 0.3 
B 
9 Pancreatic amylase 1.7. Cytochrome 0.9 Phosphoglycerate 0.2 
oxidase 1 mutase 


10 Bile-salt-stimulated 1.4. Apolipoprotein E 0.9 ‘Translation elongation 0.2 
lipase factor 1 B2 


Sources: Data for pancreas from V. E. Velculescu, L. Zhang, B. Vogelstein, and K. W. Kinzler. Science 
270:484-487, 1995. Data for liver from T. Yamashita, S. Hashimoto, S. Kaneko, 8. Nagai, N. Toyoda, T. 
Suzuki, K. Kobayashi, and K. Matsushima. Biochem. Biophys. Res. Commun. 269:110-116, 2000. Data for 
stem cells from M. Richards, 5. P. Tan, J. H. Tan, W. K. Chan, and A. Bongso. Stem Cells 22:51-64, 2004. 


chromatin. Chromatin serves to compact and organize eukaryotic DNA and 
its presence has dramatic consequences for gene regulation. 


Nucleosomes are complexes of DNA and histones 


Chromatin is made up of repeating units, each containing 200 bp of DNA 
and two copies each of four histone proteins H2A, H2B, H3, and H4. 
Histones have strikingly basic properties because a quarter of the residues 
in each histone are either arginine or lysine, positively charged amino acids 
that strongly interact with the negatively charged DNA. The protein 
complex is called the histone octamer. The repeating units of the histone 
octamer and the associated DNA are known as nucleosomes. Chromatin 
viewed with the electron microscope has the appearance of beads on a string 
(see Figure 32.2); each bead has a diameter of approximately 100 A. Partial 
digestion of chromatin with DNase yields the isolated beads. These 
particles consist of fragments of DNA about 200 bp in length bound to the 
histone octamer. More-extensive digestion yields a shorter DNA fragment 
of 145 bp bound to the octamer. The smaller complex formed by the histone 
octamer and the 145-bp DNA fragment is the nucleosome core particle. The 
DNA connecting core particles in undigested chromatin is called linker 
DNA. Histone H1 binds, in part, to the linker DNA. 


DNA wraps around histone octamers to form nucleosomes 


The overall structure of the nucleosome was revealed through electron 
microscopic and x-ray crystallographic studies pioneered by Aaron Klug 
and his colleagues. More recently, the three-dimensional structure of a 
reconstituted nucleosome core particle (Figure 32.3) was determined to 
higher resolution by x-ray diffraction methods. The four types of histone 
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Amino-terminal 


(A) (B) (9) 


r Figure 32.3 Nucleosome core particle. The structure consists of a core of eight histone 
proteins surrounded by DNA. (A) A view showing the DNA wrapping around the histone core. 
B) A view related to that in part A by a 90-degree rotation. Notice that the DNA forms a left- 
Pp Y g 
handed superhelix as it wraps around the core. (C) A schematic view. [Drawn from 1AOl.pdb.] 


that make up the protein core are homologous and similar in structure 
(Figure 32.4). The eight histones in the core are arranged into a (H3)2(H4)> 
tetramer and a pair of H2A—H2B dimers. The tetramer and dimers come 
together to form a left-handed superhelical ramp around which the DNA 
wraps. In addition, each histone has an amino-terminal tail that extends out 
from the core structure. These tails are flexible and contain a number of 
lysine and arginine residues. As we shall see, covalent modifications of these 
tails play an essential role in regulating gene expression. 

The DNA forms a left-handed superhelix as it wraps around the outside 
of the histone octamer. The protein core forms contacts with the inner 
surface of the DNA superhelix at many points, particularly along the phos- 
phodiester backbone and the minor groove. Nucleosomes will form on 
almost all DNA sites, although some sequences are preferred because the 
dinucleotide steps are properly spaced to favor bending around the histone 
core. A histone with a different structure from that of the others, called 


y H2B H3 H4 
hk Figure 32.4 Homologous histones. Histones H2A, H2B, H3, and H4 adopt a similar 
mat dimensional structure as a consequence of common ancestry. Some parts of the tails at the 


termini of the proteins are not shown. [Drawn from 1AOl.pdb_] 
940 


histone H1, seals off the nucleosome at the location at which the linker 
DNA enters and leaves. The amino acid sequences of histones, including 
their amino-terminal tails, are remarkably conserved from yeast to human 
beings. 

The winding of DNA around the nucleosome core contributes to the 
packing of DNA by decreasing its linear extent. An extended 200-bp stretch 
of DNA would have a length of about 680 A. Wrapping this DNA around 
the histone octamer reduces the length to approximately 100 A along the 
long dimension of the nucleosome. Thus the DNA is compacted by a factor 
of 7. However, human chromosomes in metaphase, which are highly con- 
densed, are compacted by a factor of 10*. Clearly, the nucleosome is just the 
first step in DNA compaction. What is the next step? The nucleosomes 
themselves are arranged in a helical array approximately 360 A across, form- 
ing a series of stacked layers approximately 110 A apart (Figure 32.5). The 
folding of these fibers of nucleosomes into loops further compacts DNA. 

The wrapping of DNA around the histone octamer as a left-handed 
helix also stores negative supercoils; if the DNA in a nucleosome is 
straightened out, the DNA will be underwound. This underwinding is 
exactly what is needed to separate the two DNA strands during replication 
and transcription. 


32.2 Transcription Factors Bind DNA and 
Regulate Transcription Initiation 


DNA-binding transcription factors are key to gene regulation in eukaryotes, 
just as they are in prokaryotes. However, the roles of eukaryotic transcrip- 
tion factors are different in several ways. First, whereas the DNA-binding 
sites crucial for the control of gene expression in prokaryotes are usually 
quite close to promoters, those in eukaryotes can be farther away from pro- 
moters and can exert their action at a distance. Second, most prokaryotic 
genes are regulated by single transcription factors, and multiple genes in a 
pathway are expressed in a coordinated fashion because such genes are often 
transcribed as part of a polycistronic mRNA. In eukaryotes, the expression 
of each gene is typically controlled by multiple transcription factors, and 
the coordinated expression of different genes depends on having similar 
transcription-factor-binding sites in each gene in the set. Third, in prokary- 
otes, transcription factors usually interact directly with RNA polymerase. 
In eukaryotes, some transcription factors interact directly with RNA poly- 
merase, whereas others interact with other proteins associated with RNA 
polymerase and still others act by modifying the chromatin structure. Let us 
now examine eukaryotic transcription factors in more detail. 

Eukaryotic transcription factors usually consist of several domains. The 
DNA-binding domain binds to regulatory sequences that can either be adja- 
cent to the promoter or at some distance from it. Most commonly, transcrip- 
tion factors include additional domains that help activate transcription. 
When a transcription factor is bound to the DNA, its activation domain 
promotes transcription by interacting with RNA polymerase II, by inter- 
acting with other associated proteins, or by modifying the local structure of 
chromatin. 


A range of DNA-binding structures are employed by 
eukaryotic DNA-binding proteins 


The structures of many eukaryotic DNA-binding proteins have been deter- 
mined and a range of structural motifs have been observed, but we will 
focus on three that reveal the common features and the diversity of these 
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360A 
(36 nm) 


Figure 32.5 Higher-order chromatin 
structure. A proposed model for chromatin 
arranged in a helical array consisting of six 
nucleosomes per turn of helix. The DNA 
double helix (shown in red) is wound around 
each histone octamer (shown in blue). [After 
J.T. Finch and A. Klug. Proc. Natl, Acad. 

Sci. U. S. A. 73:1897-1901, 19761] 


h Figure 32.6 Homeodomain structure. The structure of a 
heterodimer formed from two different DNA-binding domains, each 
based on a homeodomain. Notice that each homeodomain has a 
helix-turn-helix motif with one helix inserted into the major groove of 


DNA. [Drawn from 1AKH.pdb.] 


Leucine 
zipper 


Basic region 


hk Figure 32.7 Basic-leucine zipper. 
This heterodimer comprises two basic-leucine 
zipper proteins. Notice that the basic region 
lies in the major groove of DNA. The 
leucine-zipper part stabilizes the protein 
dimer. [Drawn from 1FOS.pdb.] 


Figure 32.8 Zinc-finger domains. A 
DNA-binding domain comprising three 
Cys2His, zinc-finger domains (shown in 
yellow, blue, and red) is shown in a complex 
with DNA. Each zinc-finger domain is 
stabilized by a bound zinc ion (shown in 
green) through interactions with two cysteine 
residues and two histidine residues. Notice 
how the protein wraps around the DNA in the 
major groove. [Drawn from 1AAY.pdb.] 
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motifs. The first class of eukaryotic DNA-binding unit 
that we will consider is the homeodomain (Figure 32.6). 
The structure of this domain and its mode of recognition 
of DNA are very similar to those of the prokaryotic helix- 
turn-helix proteins. In eukaryotes, homeodomain proteins 
often form heterodimeric structures, sometimes with other 
homeodomain proteins, that recognize asymmetric DNA 
sequences. 

The second class of eukaryotic DNA-binding unit com- 
prises the basic-leucine zipper (bZip) proteins (Figure 32.7). 
chelix inserted L518 DNA-binding unit consists of a pair of long « helices. 
into DNA major ‘The first part of each o helix 1s a basic region that lies in the 
groove major groove of the DNA and makes contacts responsible 
for DNA-site recognition. The second part of each a helix 
forms a coiled-coil structure with its partner. Because these 
units are often stabilized by appropriately spaced leucine 
residues, these structures are often referred to as leucine 
zippers. 

The final class of eukaryotic DNA-binding units that 
we will consider here are the Cys»His» zinc-finger domains 
(Figure 32.8). A DNA-binding unit of this class comprises tandem sets of 
small domains, each of which binds a zinc ion through conserved sets of two 
cysteine and two histidine residues. These domains, often called zinc-finger 
domains, form a string that follows the major groove of DNA. An a helix 
from each domain makes specific contact with the edges of base pairs 
within the groove. Some proteins contain arrays of 10 or more zinc-finger 
domains, potentially enabling them to contact long stretches of DNA. The 
human genome encodes several hundred proteins that contain zinc-finger 
domains of this class. We will encounter another class of zinc-based 
DNA-binding domain when we consider nuclear hormone receptors in 
Section 32.3. 


Activation domains interact with other proteins 


The activation domains of transcription factors generally recruit other pro- 
teins that promote transcription. In some cases, these activation domains 
interact directly with RNA polymerase II or closely associated proteins. 
The activation domains act through intermediary proteins that bridge 
between the transcription factors and the polymerase. An important target 
of activators is mediator, a complex of 25 to 30 subunits conserved from 


yeast to human beings, that acts as a bridge between transcription factors 
and promoter-bound RNA polymerase II (Figure 32.9). 

Activation domains are less conserved than DNA-binding domains. In 
fact, very little sequence similarity has been found. For example, they may 
be acidic, hydrophobic, glutamine rich, or proline rich. However, certain 
features are common to activation domains. First, they are often redundant; 
that is, a part of the activation domain can be deleted without loss of func- 
tion. Second, they are modular and can activate transcription when paired 
with a variety of DNA-binding domains. Third, activation domains can act 
synergistically: two activation domains acting together create a stronger 
effect than either acting separately. 

We have been considering the case in which gene control increases the 
expression level of a gene. In many cases, the expression of a gene must be 
decreased by blocking transcription. The agents in such cases are transcrip- 
tional repressors. Like activators, transcriptional repressors act in many 
cases by altering chromatin structure. 


Mediator : . ; 
Figure 32.9 Mediator. Mediator, a large 


complex of protein subunits, acts as a bridge 
between transcription factors bearing 
activation domains and RNA polymerase II. 
These interactions help recruit and stabilize 
RNA polymerase II near specific genes that 
are then transcribed. 


Transcription 
factor 


RNA polymerase II 


Multiple transcription factors interact with eukaryotic regulatory regions 


The basal transcription complex described in Chapter 29 initiates transcrip- 
tion at a low frequency. Recall that several general transcription factors join 
with RNA polymerase II to form the basal transcription complex. Additional 
transcription factors must bind to other sites that can be near the promoter 
or quite distant for a gene to achieve a higher rate of mRNA synthesis. In 
contrast with the regulators of prokaryotic transcription, few eukaryotic 
transcription factors have any effect on transcription on their own. Instead, 
each factor recruits other proteins to build up large complexes that interact 
with the transcriptional machinery to activate transcription. 

A major advantage of this mode of regulation is that a given regulatory 
protein can have different effects, depending on what other proteins are 
present in the same cell. This phenomenon, called combinatorial control, is 
crucial to multicellular organisms that have many different cell types. Even 
in unicellular eukaryotes such as yeast, combinatorial control allows the 
generation of distinct cell types. 


Enhancers can stimulate transcription in specific cell types 


Transcription factors can often act even if their binding sites lie at a consid- 
erable distance from the promoter. These distant regulatory sites are called 
enhancers (Chapter 29). Enhancers function by serving as binding sites for 
specific transcription factors. An enhancer is effective only in the specific 
cell types in which appropriate regulatory proteins are expressed. In many 
cases, these DNA-binding proteins influence transcription initiation by 
perturbing the local chromatin structure to expose a gene or its regulatory 
sites rather than by direct interactions with RNA polymerase. This mecha- 
nism accounts for the ability of enhancers to act at a distance. 

The properties of enhancers are illustrated by studies of the enhancer 
controlling the muscle isoform of creatine kinase (Figure 32.10). The 
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Start 

site 
TATA 
CAGCTG 
CAGCTG 

Enhancer CAGCTG 
region TTATAATTAA 

CCATGTAAGG 


Figure 32.10 Enhancer binding sites. A 
schematic structure for the region 1 kb 
upstream of the start site for the muscle 
creatine kinase gene. One binding site of the 
form 5'-CAGCTG-3’ is present near the TATA 
box. The enhancer region farther upstream 
contains two binding sites for the same 
protein and two additional binding sites for 
other proteins. 


Figure 32.11 An experimental 
demonstration of enhancer function. A 
promoter for muscle creatine kinase artificially 
drives the transcription of B-galactosidase in 
a zebrafish embryo. Only specific sets of 
muscle cells produce B-galactosidase, as 
visualized by the formation of the blue 
product on treatment of the embryo with 
X-Gal. [From F. Miller, D. W. Williamson, 

J. Kobolak, L. Gauvry, G. Goldspink, L. Orban, 
and N. MacLean. Mol. Reprod. Dev. 
47:404-412, 1997.] 


Figure 32.12 Induced pluripotent stem 
cells. A micrograph of human induced 
pluripotent stem cells stained green for a 
transcription factor that is characteristic of 
pluripotent cells. [From K. Takahashi et al., Ce// 
131:861—872, 2007; with permission from 
Elsevier, courtesy of Shinya Yamanaka, Kyoto 
University.] 
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results of mutagenesis and other studies revealed the presence of an 
enhancer located between 1350 and 1050 base pairs upstream of the start 
site of the gene for this enzyme. Experimentally inserting this enhancer 
near a gene not normally expressed in muscle cells is sufficient to cause the 
gene to be expressed at high levels in muscle cells but not in other cells 
(Figure 32.11). 


Induced pluripotent stem cells can be generated by introducing 
four transcription factors into differentiated cells 


e An important application illustrating the power of transcription 
factors is the development of induced pluripotent stem (iPS) cells. 
Pluripotent stem cells have the ability to differentiate into many different 
cell types on appropriate treatment. Previously isolated cells derived from 
embryos show a very high degree of pluripotency. Over time, researchers 
identified dozens of genes in embryonic stem cells that contributed to this 
pluripotency when expressed. In a remarkable experiment reported for 
mouse cells in 2006 and human cells in 2007, Shinya Yamanaka demon- 
strated that just four genes out of this entire set could induce pluripotency 
in already-differentiated skin cells. Yamanaka introduced genes encoding 
four transcription factors into skin cells called fibroblasts. The fibroblasts 
de-differentiated into cells that appeared to have characteristics very nearly 
identical with those of embryonic stem cells (Figure 32.12). 

These iPS cells represent powerful new research tools and, potentially, a 
new class of therapeutic agents. The proposed concept is that a sample of a 
patient’s fibroblasts could be readily isolated and converted into 1P5 cells. 
These 1P5 cells could then be treated to differentiate into a desired cell type 
that could then be transplanted into the patient. For example, such an 
approach might be used to restore a particular class of nerve cells that had 
been depleted by a neurodegenerative disease. Although the field of 1P5 cell 
research is still in its very early stages, it holds great promise as a possible 
approach to treatment for many common and difficult-to-treat diseases. 


32.3 The Control of Gene Expression Can 
Require Chromatin Remodeling 


Early observations suggested that chromatin structure plays a major role in 
controlling eukaryotic gene expression. DNA that is densely packaged into 
chromatin is less susceptible to cleavage by the nonspecific DNA-cleaving 
enzyme DNase I. Regions adjacent to genes that are being transcribed are 
more sensitive to cleavage by DNase I than are other sites in the genome, 
suggesting that the DNA in these regions is less compacted than it is else- 
where and more accessible to proteins. In addition, some sites, usually 
within 1 kb of the start site of an active gene, are exquisitely sensitive to 
DNase I and other nucleases. These hypersensitive sites correspond to 
regions that have few nucleosomes or contain nucleosomes in an altered 
conformational state. Hypersensitive sites are cell-type specific and develop- 
mentally regulated. For example, globin genes in the precursors of erythroid 
cells from 20-hour-old chicken embryos are insensitive to DNase I. 
However, when hemoglobin synthesis begins at 35 hours, regions adjacent 
to these genes become highly susceptible to digestion. In tissues such as the 
brain that produce no hemoglobin, the globin genes remain resistant to 
DNase I throughout development and into adulthood. These studies sug- 
gest that a prerequisite for gene expression is a relaxing of the chromatin 
structure. 


Recent experiments even more clearly revealed the role of chromatin 
structure in regulating access to DNA binding sites. Genes required for 
galactose utilization in yeast are activated by a transcription factor called 
GAL4, which recognizes DNA binding sites with two 5’-CGG-3’ sequences 
on complementary strands separated by 11 base pairs (Figure 32.13). 
Approximately 4000 potential GAL4 binding sites of the form 
5’-CGG(N),,;CCG-3’ are present in the yeast genome, but only 10 of them 
regulate genes necessary for galactose metabolism. How is GAL4 targeted 
to only a small fraction of the potential binding sites? This question is 
addressed through the use of a technique called chromatin immunoprecipita- 
tion (ChIP; Figure 32.14). GAL4 is first cross-linked to its DNA binding 
sites in chromatin. The DNA is then fragmented into small pieces, and 
antibodies to GAL4 are used to isolate the chromatin fragments containing 
GAL4. The cross-linking is reversed, and the DNA is isolated and charac- 
terized. The results of these studies reveal that only approximately 10 of the 
4000 potential GAL4 sites are occupied by GAL4 when the cells are grow- 
ing on galactose; more than 99% of the sites appear to be blocked, presum- 
ably by the local chromatin structure. Thus, whereas in prokaryotes all sites 
appear to be equally accessible, chromatin structure shields a large number 
of the potential binding sites in eukaryotic cells. GAL4 is thereby prevented 
from binding to sites that are unimportant in galactose metabolism. These 
lines of evidence and others reveal that chromatin structure is altered in 
active genes compared with inactive ones. 


The methylation of DNA can alter patterns of gene expression 


The degree of methylation of DNA provides another mechanism, in addi- 
tion to packaging with histones, for inhibiting gene expression inappro- 
priate to a specific cell type. Carbon 5 of cytosine can be methylated by 
specific methyltransferases. About 70% of the 5’-CpG-3’ sequences (where 
“p” represents the phosphate residue in the DNA backbone) in mammalian 
genomes are methylated. However, the distribution of these methylated 
cytosines varies, depending on the cell type. Consider the B-globin gene. In 
cells that are actively expressing hemoglobin, the region from approxi- 
mately 1 kb upstream of the start site to approximately 100 bp downstream 
of the start site is less methylated than the corresponding region in cells that 
do not express this gene. The relative absence of 5-methylcytosines near the 
start site is referred to as hypomethylation. The methyl group of 5-methyl- 
cytosine protrudes into the major groove where it could easily interfere with 
the binding of proteins that stimulate transcription. 
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h Figure 32.13 GAL4 binding sites. The 
yeast transcription factor GAL4 binds to DNA 
sequences of the form 5’-CGG(N),;CCG-3’. 
Two zinc-based domains are present in the 
DNA-binding region of this protein. Notice that 
these domains contact the 5’-CGG-3’ 
sequences, leaving the center of the site 
uncontacted. [Drawn from 1D66.pdb.] 
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Formaldehyde- Chromatin with proteins Fragmented DNA with 
treated cells iaiate cross-linked to DNA cross-linked proteins 
> chromatin wrung Sonicate my W~ Na my vw 
/ wiv vw vw Figure 32.14 Chromatin 
GAL4 Precipitate with immunoprecipitation. Cells or 
antibody specific isolated nuclei are treated with a 
for GAL4 formaldehyde to cross-link proteins 
ay to DNA. The cells are then lysed 
Ww 
‘i on * and the DNA is fragmented by 
everse mM Row : : 
DNA sequences Analyze pn NN cross-links NEA sonication. DNA fragments bound 
that bound GAL4 <————— ~~ wy wi yy ww to a particular protein are isolated 
inside cells < St ~< < through the use of an antibody 
specific for that protein. The cross- 
Isolated DNA DNA fragments links are then reversed and the 


with GAL4 bound 


DNA fragments are characterized. 
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Expression in Eukaryotes 


HO 


Estradiol 
(an estrogen) 


CH; 


OH 


H3C 


ae The distribution of CpG sequences in mammalian genomes is not 

uniform. Many CpG sequences have been converted into TpG through 
mutation by the deamination of 5-methylcytosine to thymine. However, 
sites near the 5’ ends of genes have been maintained because of their role in 
gene expression. Thus, most genes are found in CpG islands, regions of the 
genome that contain approximately four times as many CpG sequences as 
does the remainder of the genome. 


Steroids and related hydrophobic molecules pass through 
membranes and bind to DNA-binding receptors 


We next look at an example that illustrates how transcription factors can 
stimulate changes in chromatin structure that affect transcription. We will 
consider in some detail the system that detects and responds to estrogens. 
Synthesized and released by the ovaries, estrogens, such as estradiol, are 
cholesterol-derived steroid hormones (Section 26.4). They are required for 
the development of female secondary sex characteristics and, along with 
progesterone, participate in the ovarian cycle. 

Because they are hydrophobic molecules, estrogens easily diffuse across 
cell membranes. When inside a cell, estrogens bind to highly specific, solu- 
ble receptor proteins. Estrogen receptors are members of a large family of 
proteins that act as receptors for a wide range of hydrophobic molecules, 
including other steroid hormones, thyroid hormones, and retinoids. 


wee OTS SS OH 
OH 


CH; 


All-trans-retinoic acid Thyroxine 
(a retinoid) (t-3,5,3,,5’-Tetraiodothyronine) 
(a thyroid hormone) 


The human genome encodes approximately 50 members of this family, 
often referred to as nuclear hormone receptors. The genomes of other multi- 
cellular eukaryotes encode similar numbers of nuclear hormone receptors, 
although they are absent in yeast. 

All these receptors have a similar mode of action. On binding of the 
signal molecule (called, generically, a ligand), the ligand-receptor complex 
modifies the expression of specific genes by binding to control elements in 
the DNA. Estrogen receptors bind to specific DNA sites (referred to as 
estrogen response elements or EREs) that contain the consensus sequence 
5'’-AGGTCANNNTGACCT-3’. As expected from the symmetry of 
this sequence, an estrogen receptor binds to such sites as a dimer. 

A comparison of the amino acid sequences of members of this family 
reveals two highly conserved domains: a DNA-binding domain and a 
ligand-binding domain (Figure 32.15). The DNA-binding domain lies 
toward the center of the molecule and consists of a set of zinc-based domains 
different from the Cys2His» zinc-finger proteins introduced in Section 32.2. 
These zinc-based domains bind to specific DNA sequences by virtue of an 
a helix that lies in the major groove in the specific DNA complexes formed 
by estrogen receptors. 


Nuclear hormone receptors regulate transcription by 
recruiting coactivators to the transcription complex 


The second highly conserved domain of the nuclear receptor proteins lies 
near the carboxyl terminus and is the ligand-binding site. This domain folds 
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hk. Figure 32.15 Structure of two 
nuclear hormone-receptor domains. 
Nuclear hormone receptors contain two 
crucial conserved domains: (1) a DNA-binding 
domain toward the center of the sequence 
and (2) a ligand-binding domain toward the 
carboxyl terminus. The structure of a dimer of 
the DNA-binding domain bound to DNA is 
shown, as is one monomer of the normally 
DNA-binding Ligand-binding dimeric ligand-binding domain. [Drawn from 

domain domain 1HCQ and 1LBD.pdb_] 


into a structure that consists almost entirely of a helices, arranged in three 
layers. The ligand binds in a hydrophobic pocket that lies in the center of 
this array of helices (Figure 32.16). This domain changes conformation 
when it binds its ligand, estrogen. How does ligand binding lead to changes 
in gene expression? The simplest model would have the binding of ligand 
alter the DNA-binding properties of the receptor, analogously to the lac 
repressor in prokaryotes. However, experiments with purified nuclear 
hormone receptors revealed that ligand binding does not significantly alter 
DNA-binding affinity and specificity. Another mechanism is operative. 
Because ligand binding does not alter the ability of nuclear hormone 
receptors to bind DNA, investigators sought to determine whether specific 
proteins might bind to the nuclear hormone receptors only in the presence 
of ligand. Such searches led to the identification of several related proteins 
called coactivators, such as SRC-1 (steroid receptor coactivator-1), GRIP-1 
(glucocorticoid receptor interacting protein-1), and NcoA-1 (nuclear 
hormone receptor coactivator-1). These coactivators are referred to as the 


HO 
Estradiol 


S3— Helix 12 


hk Figure 32.16 Ligand binding to nuclear hormone receptor. The ligand lies completely 
surrounded within a pocket in the ligand-binding domain. Notice that the last a helix, helix 12 
(shown in purple), folds into a groove on the side of the structure on ligand binding. [Drawn from 
1LDB and 1ERE.pdb_] 


Figure 32.17 Coactivator 

P 2 of Estrogen 
recruitment. The binding of (ligand) 
ligand to a nuclear hormone @ 
receptor induces a \ 
conformational change in the 
ligand-binding domain. This \ 
change in conformation a helix 


generates favorable sites for the 
binding of a coactivator. 


p160 family because of their size. The binding of ligand to the receptor 
induces a conformational change that allows the recruitment of a coactivator 
(Figure 32.17). In many cases, these coactivators are enzymes that catalyze 
reactions that lead to the modification of chromatin structure. 


Steroid-hormone receptors are targets for drugs 


e Molecules such as estradiol that bind to a receptor and trigger signal- 

ing pathways are called agonists. Athletes sometimes take natural and 
synthetic agonists of the androgen receptor, a member of the family of 
nuclear hormone receptors, because their binding to the androgen receptor 
stimulates the expression of genes that enhance the development of lean 
muscle mass. 


CH; O 
CH; 
oO 
Androstendione Dianabol 
(a natural androgen) (methandrostenolone) 


(a synthetic androgen) 


Referred to as anabolic steroids, such compounds used in excess are not 
without side effects. In men, excessive use leads to a decrease in the secretion 
of testosterone, to testicular atrophy, and sometimes to breast enlargement 
(gynecomastia) if some of the excess androgen is converted into estrogen. In 
women, excess testosterone causes a decrease in ovulation and estrogen 
secretion; it also causes breast regression and growth of facial hair. 

Other molecules bind to nuclear hormone receptors but do not effec- 
tively trigger signaling pathways. Such compounds are called antagonists 
and are, in many ways, like competitive inhibitors of enzymes. Some impor- 
tant drugs are antagonists that target the estrogen receptor. For example, 
tamoxifen and raloxifene are used in the treatment and prevention of breast 
cancer, because some breast tumors rely on estrogen-mediated pathways for 
growth. Because some of these compounds have distinct effects on different 
forms of the estrogen receptor, they are referred to as selective estrogen 
receptor modulators (SERMs). 
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r Figure 32.18 Estrogen 
receptor—tamoxifen complex. Tamoxifen binds in 
the pocket normally occupied by estrogen. However, 
notice that part of the tamoxifen structure extends 
from this pocket, and so helix 12 cannot pack in its 
usual position. Instead, this helix blocks the 
coactivator-binding site. [Drawn from 3ERT.pdb.] 


Tamoxifen 


The determination of the structures of complexes between the estro- 
gen receptor and these drugs revealed the basis for their antagonist effect 
(Figure 32.18). Tamoxifen binds to the same site as estradiol does. However, 
tamoxifen has a group that extends out of the normal ligand-binding pocket, 
as do other antagonists. These groups block the normal conformational 
changes induced by estrogen. Tamoxifen blocks the binding of coactivators 
and thus inhibits the activation of gene expression. 


Chromatin structure is modulated through covalent 
modifications of histone tails 


We have seen that nuclear receptors respond to signal molecules by recruit- 
ing coactivators. Now we can ask, How do coactivators modulate transcrip- 
tional activity? These proteins act to loosen the histone complex from the DNA, 
exposing additional DNA regions to the transcription machinery. 

Much of the effectiveness of coactivators appears to result from their 
ability to covalently modify the amino-terminal tails of histones as well as 
regions on other proteins. Some of the p160 coactivators and the proteins 
that they recruit catalyze the transfer of acetyl groups from acetyl CoA to 
specific lysine residues in these amino-terminal tails. 
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Lysine in histone tail Acetyl CoA 


. : Histone 
Enzymes that catalyze such reactions are called histone acetyltransferases H3 tail 


(HATS). The histone tails are readily extended; so they can fit into the HAT 
active site and become acetylated (Figure 32.19). 

What are the consequences of histone acetylation? Lysine bears a posi- 
tively charged ammonium group at neutral pH. The addition of an acetyl 
group generates an uncharged amide group. This change dramatically 
reduces the affinity of the tail for DNA and modestly decreases the affinity 
of the entire histone complex for DNA, loosening the histone complex from 
the DNA. 

In addition, the acetylated lysine residues interact with a specific h Figure 32.19 Structure of histone 
acetyllysine-binding domain that is present in many proteins that regulate acetyltransferase. The amino-terminal tail of 
eukaryotic transcription. This domain, termed a bromodomain, comprises BSS ee ee POs anions 

; ‘ : : a lysine side chain can accept an acetyl group 
approximately 110 amino acids that form a four-helix bundle containing a 


; tae : : from acetyl CoA bound in an adjacent site. 
peptide-binding site at one end (Figure 32.20). [Drawn from 1QSN.pdb] 


Coenzyme 


Histone H4 tail 


Acetyllysine 


hk Figure 32.20 Structure of a 
bromodomain. This four-helix-bundle 
domain binds peptides containing acetyllysine. 
Notice that an acetylated peptide from histone 
H4 is bound in the structure. [Drawn from 


1EGL.pdb] 


Figure 32.21 Chromatin remodeling. 
Eukaryotic gene regulation begins with an 
activated transcription factor bound to a 
specific site on DNA. One scheme for the 
initiation of transcription by RNA polymerase II 
requires five steps: (1) recruitment of a 
coactivator, (2) acetylation of lysine residues 
in the histone tails, (3) binding of a 
remodeling-engine complex to the acetylated 
lysine residues, (4) ATP-dependent 
remodeling of the chromatin structure to 
expose a binding site for RNA polymerase or 
for other factors, and (5) recruitment of RNA 
polymerase. Only two subunits are shown for 
each complex, although the actual complexes 
are much larger. Other schemes are possible. 
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Bromodomain-containing proteins are components of two large com- 
plexes essential for transcription. One is a complex of more than 10 poly- 
peptides that binds to the TATA-box-binding protein. Recall that the TATA- 
box-binding protein is an essential transcription factor for many genes 
(Section 29.2). Proteins that bind to the TATA-box-binding protein are 
called TAF's (for TATA-box-binding protein associated factors). In partic- 
ular, TAF1 contains a pair of bromodomains near its carboxyl terminus. 
The two domains are oriented such that each can bind one of two acetyllysine 
residues at positions 5 and 12 in the histone H4 tail. Thus, acetylation of the 
histone tails provides a mechanism for recruiting other components of the 
transcriptional machinery. 

Bromodomains are also present in some components of large complexes 
known as chromatin-remodeling complexes or chromatin-remodeling engines. 
These complexes, which also contain domains homologous to those of heli- 
cases, utilize the free energy of ATP hydrolysis to shift the positions of 
nucleosomes along the DNA and to induce other conformational changes in 
chromatin (Figure 32.21). Histone acetylation can lead to a reorganization 
of the chromatin structure, potentially exposing binding sites for other 
factors. Thus, histone acetylation can activate transcription through a combi- 
nation of three mechanisms: by reducing the affinity of the histones for DNA, 
by recruiting other components of the transcriptional machinery, and by initiat- 
ing the remodeling of the chromatin structure. 

Nuclear hormone receptors also include regions that interact with com- 
ponents of the mediator complex. Thus, two mechanisms of gene regulation 
can work in concert. Modification of histones and chromatin remodeling 
can open up regions of chromatin into which the transcription complex can 
be recruited through protein-protein interactions. 


Histone deacetylases contribute to transcriptional repression 


Just as in prokaryotes, some changes in a cell’s environment lead to the 
repression of genes that had been active. The modification of histone tails 
again plays an important role. However, in repression, a key reaction 
appears to be the deacetylation of acetylated lysine, catalyzed by specific 
histone deacetylase enzymes. 

In many ways, the acetylation and deacetylation of lysine residues in his- 
tone tails (and, likely, in other proteins) is analogous to the phosphorylation 
and dephosphorylation of serine, threonine, and tyrosine residues in other 


Exposed 
site 


RNA 


Acetylated polymerase II 


lysine residues 


Remodeling 
engine 


stages of signaling processes. Like the addition of phosphoryl groups, the 
addition of acetyl groups can induce conformational changes and generate 
novel binding sites. Without a means of removal of these groups, however, 
these signaling switches will become stuck in one position and lose their 
effectiveness. Like phosphatases, deacetylases help reset the switches. 

Acetylation is not the only modification of histones and other pro- 
teins in gene-regulation processes. The methylation of specific lysine 
and arginine residues also can be important. Some of the more common 
modifications are shown in Table 32.2. The elucidation of the roles of 
these processes is a very active area of research at present. The relation 
between various histone modifications and their roles in controlling 
gene expression is sometimes referred to as “the histone code.” 
Although important generalizations have been discovered, this code 
clearly is subtle and complicated rather than being a set of hard and fast 
rules. 


32.4 Eukaryotic Gene Expression Can Be Controlled 
at Posttranscriptional Levels 


Just as in prokaryotes, gene expression in eukaryotes can be regulated sub- 
sequent to transcription. We shall consider two examples. The first is the 
regulation of genes taking part in iron metabolism through key features in 
RNA secondary structure, similar in many ways to prokaryotic posttran- 
scriptional regulation (Section 31.4). The second entails an entirely new 
mechanism, first glimpsed with the discovery of RNA interference (p. 165). 
Certain small regulatory RNA molecules allow the regulation of gene expres- 
sion through interaction with a range of mRNA molecules. Remarkably, 
this mechanism, discovered only recently, affects the expression of approxi- 
mately 60% of all human genes. 


Genes associated with iron metabolism are translationally 
regulated in animals 


RNA secondary structure plays a role in the regulation of iron metabolism 
in eukaryotes. Iron is an essential nutrient, required for the synthesis of 
hemoglobin, cytochromes, and many other proteins. However, excess iron 
can be quite harmful because, untamed by a suitable protein environment, 
iron can initiate a range of free-radical reactions that damage proteins, 
lipids, and nucleic acids. Animals have evolved sophisticated systems for 
the accumulation of iron in times of scarcity and for the safe storage of 
excess iron for later use. Key proteins include transferrin, a transport protein 
that carries iron in the serum, transferrin receptor, a membrane protein that 
binds iron-loaded transferrin and initiates its entry into cells, and ferritin, 
an impressively efficient iron-storage protein found primarily in the liver 
and kidneys. Twenty-four ferritin polypeptides form a nearly spherical shell 
that encloses as many as 2400 iron atoms, a ratio of one iron atom per amino 
acid (Figure 32.22). 

Ferritin and transferrin-receptor expression levels are reciprocally related 
in their responses to changes in iron levels. When iron is scarce, the amount 
of transferrin receptor increases and little or no new ferritin is synthesized. 
Interestingly, the extent of mRNA synthesis for these proteins does not 
change correspondingly. Instead, regulation takes place at the level of 
translation. 

Consider ferritin first. Ferritin mRNA includes a stem-loop struc- 
ture termed an iron-response element (IRE) in its 5’ untranslated region 
(Figure 32.23). This stem-loop binds a 90-kd protein, called an [RE-binding 
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Table 32.2 Selected histone modifications 


Modification Associated with 
H4 K8 acetylation Activation 

H3 K14 acetylation Activation 

H3 K27 monomethylation Activation 

H3 K27 trimethylation Repression 

H3 R17 methylation Activation 

H2B $14 phosphorylation DNA repair 

H2B K120 ubiquitination Activatiom 


hk Figure 32.22 Structure of ferritin. 
(A) Twenty-four ferritin polypeptides form a 
nearly spherical shell. (B) A cutaway view 
reveals the core that stores iron as an iron 
oxide—hydroxide complex [Drawn from 
1IES.pdb.] 
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Figure 32.24 Transferrin-receptor mRNA. 
This mRNA has a set of iron-response 
elements (IREs) in its 3’ untranslated region. 
The binding of the IRE-binding protein to these 
elements stabilizes the mRNA but does not 
interfere with translation. 
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Figure 32.23 lron-response element. Ferritin mRNA includes a stem-loop structure, termed an 
iron-response element (IRE), in its 5’ untranslated region. The IRE binds a specific protein that 
blocks the translation of this mRNA under low iron conditions. 


protein (IRP), that blocks the initiation of translation as the IRE binds on 
the 5’ side of the coding region. When the iron level increases, the IRP 
binds iron as a 4Fe-4S cluster. The IRP bound to iron cannot bind RNA, 
because the binding sites for iron and RNA substantially overlap. Thus, in 
the presence of iron, ferritin mRNA is released from the IRP and translated 
to produce ferritin, which sequesters the excess iron. 

An examination of the nucleotide sequence of transferrin-receptor 
mRNA reveals the presence of several [RE-like regions. However, these 
regions are located in the 3’ untranslated region rather than in the 5’ 
untranslated region (Figure 32.24). Under low-iron conditions, IRP binds 
to these IREs. However, given the location of these binding sites, the 
transferrin-receptor mRNA can still be translated. What happens when the 
iron level increases and the IRP no longer binds transferrin-receptor 
mRNA? Freed from the IRP, transferrin-receptor mRNA is rapidly 
degraded. Thus, an increase in the cellular iron level leads to the destruction 
of transferrin-receptor mRNA and, hence, a reduction in the production of 
transferrin-receptor protein. 


Iron-response elements 


Coding region 


w The purification of the IRP and the cloning of its cDNA were sources 
of truly remarkable insight into evolution. The IRP was found to be 
approximately 30% identical in amino acid sequence with the citric acid 
cycle enzyme aconitase from mitochondria (p. 513). Further analysis revealed 
that the IRP is, in fact, an active aconitase enzyme; it is a cytoplasmic 
aconitase that had been known for a long time, but its function was not well 
understood (Figure 32.25). The iron-sulfur center at the active site of the 
IRP is rather unstable, and loss of the iron triggers significant changes in 
protein conformation. Thus, this protein can serve as an iron-sensing 
factor. 

Other mRNAs, including those taking part in heme synthesis, have 
been found to contain IREs. Thus, genes encoding proteins required for 


iron metabolism acquired sequences that, when transcribed, (A) 
provided binding sites for the iron-sensing protein. An environ- 
mental signal—the concentration of iron—controls the transla- 
tion of proteins required for the metabolism of this metal. Thus, 
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Small RNAs regulate the expression of many eukaryotic 
genes 


Genetic studies of development in C. elegans revealed that a 
gene called lin-4 encodes an RNA molecule 61 nucleotides long 
that can regulate the expression of certain other genes. The 
61-nucleotide lin-4 RNA does not encode a protein, but rather 
is cleaved into a 22-nucleotide RNA that possesses the regula- 
tory activity. This discovery was the first view of a large class of 

; contains an unstable 4Fe-4S cluster at its center. (B) Under 
regulatory RNA molecules that are now called microRNAs or conditions of low iron, the 4Fe-4S cluster dissociates and 
miRNAs. The key to the activity of microRNAs and their appropriate RNA molecules can bind in its place. [Drawn from 
specificity for particular genes is their ability to form Watson— _—_1€96.pdb] 
Crick base-pair-stabilized complexes with the mRNAs of those 
genes. 

These miRNAs do not function on their own. Instead, the miRNAs bind 
to members of a class of proteins called the Argonaute family (Figure 32.26). 
These Argonaute-miRNA complexes can then bind mRNAs that have 
sequences that are substantially complementary to the miRNAs. Once 
bound, such an mRNA can be cleaved by the Argonaute-miRNA complex 
through the action of a magnesium-based active site. Thus, the miRNAs 
serve as guide RNAs that determine the specificity of the Argonaute complex 
(Figure 32.27). Cleavage of mRNA by the Argonaute-miRNA complex is 
similar to the mechanism of RNA interference (p. 165). The small 
RNA molecules participating in RNAi, however, come from a different 
source. In RNAt, double-stranded RNAs are cleaved into 21-nucleotide 
fragments, single-stranded components of which are bound by members of 
the Argonaute family to form a RISC complex that cleaves complementary 
mRNAs. For miRNAs, the single-stranded RNAs are generated from 
larger genetically encoded precursors, as described in Chapter 28. 

The occurrence of gene regulation by miRNAs was originally thought to 
be limited to a relatively small number of species. However, subsequent 
studies have revealed that this mode of gene regulation is nearly ubiquitous 
in eukaryotes. Indeed, more than 700 miRNAs encoded by the human 
genome have been identified. Each miRNA can regulate many different 
genes because many different target sequences are present in each mRNA. 
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hk Figure 32.25 The IRP is an aconitase. (A) Aconitase 
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An estimated 60% of all human genes are regulated by one or more Figure 32.26 MicroRNA-Argonaute 
miRNAs. As one example, consider the human miRNA called miR-206. complex. The miRNA (shown in red) is 
This miRNA down-regulates the expression of one isoform of the estrogen bound by the Argonaute protein. Notice that 


I a ie ‘RNA - 1 h : 4 the miRNA serves as a guide that binds the 
receptor. In addition, this miRNA appears to down-regulate the expression o RNA substrate (shown'in gray) through the 


several different coactivators that interact with the estrogen receptor. Thus, foimaienofa-double belie aveunaenesiom 
this miRNA can mute the influence of estrogen by blocking the estrogen- ions are shown in green. [Drawn from 
initiated signaling pathway at several different steps. 3HK2.pdb]. 
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Figure 32.27 MicroRNA action. MicroRNAs bind to members of the Argonaute family where 
they serve to target specific mRNA molecules for cleavage. 953 
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Ww The microRNA pathway has tremendous implications for the evolu- 

tion of gene-regulatory pathways. Most of the target sites for miRNAs 
are present in the 3’-untranslated regions of mRNAs. These sequences are 
quite free to mutate because they do not encode proteins and are not 
required to fold into any specific structures. Thus, in the context of a set of 
expressed miRNAs, mutations in this region of any particular gene could, 
in principle, increase or decrease the affinity for one or more miRNAs and, 
hence, alter the regulation of the gene. 


Summary 


32.1 Eukaryotic DNA Is Organized into Chromatin 

Eukaryotic DNA is tightly bound to basic proteins called histones; the 
combination is called chromatin. DNA wraps twice around an octamer 
of core histones to form a nucleosome. The four core histones are 
homologous and fold into similar structures. Each core histone has an 
amino-terminal tail rich in lysine and arginine residues. Nucleosomes 
are the first stage of compaction of eukaryotic DNA. Chromatin blocks 
access to many potential DNA binding sites. Changes in chromatin 
structure play a major role in regulating gene expression. 


32.2 Transcription Factors Bind DNA and Regulate Transcription Initiation 
Most eukaryotic genes are not expressed unless they are activated by 
the binding of specific proteins, called transcription factors, to sites on 
the DNA. These specific DNA-binding proteins interact directly or 
indirectly with RNA polymerases or their associated proteins. 
Eukaryotic transcription factors are modular: they consist of separate 
DNA-binding and activation domains. Important classes of DNA- 
binding proteins include the homeodomains, the basic-leucine zipper 
proteins, and Cys His» zinc-finger proteins. Each of these classes of 
proteins uses an a helix to make specific contacts with DNA. 
Activation domains interact with RNA polymerases or their associated 
factors or with other protein complexes such as mediator. Enhancers 
are DNA elements that can modulate gene expression from more than 
1000 bp away from the start site of transcription. Enhancers are often 
specific for certain cell types, depending on which DNA-binding 
proteins are present. The introduction of genes for a specific set of 
four transcription factors into fibroblasts can cause these cells to de- 
differentiate into induced pluripotent stem cells. 


32.3 The Control of Gene Expression Can Require Chromatin Remodeling 
Chromatin structure is crucial to the control of gene expression; chro- 
matin structure is more open near the transcription start sites of 
actively transcribed genes. Steroids such as estrogens bind to eukary- 
otic transcription factors called nuclear hormone receptors. These 
proteins are capable of binding DNA whether or not ligands are 
bound. The binding of ligands induces a conformational change that 
allows the recruitment of additional proteins called coactivators. 
Among the most important functions of coactivators is to catalyze the 
addition of acetyl groups to lysine residues in the tails of histone pro- 
teins. Histone acetylation decreases the affinity of the histones for 
DNA, making additional genes accessible for transcription. In addi- 
tion, acetylated histones are targets for proteins containing specific 
binding units called bromodomains. Bromodomains are components 
of two classes of large complexes: (1) chromatin-remodeling engines 
and (2) factors associated with RNA polymerase IT. These complexes 
open up sites on chromatin and initiate transcription. 


32.4 Eukaryotic Gene Expression Can Be Controlled at 


Posttranscriptional Levels 

Genes encoding proteins that transport and store iron are regulated at 
the translational level. Iron-response elements, structures that are 
present in certain mRNAs, are bound by an IRE-binding protein 
when this protein is not binding iron. Whether the expression of a 
gene is stimulated or inhibited in response to changes in the iron status 
of a cell depends on the location of the [RE within the mRNA. The 
IRE-binding protein is a cytoplasmic aconitase that loses its iron— 
sulfur center under low-iron conditions. MicroRNAs are specialized 
RNA molecules encoded as parts of larger RNA precursors. Micro 
RNAs bind to proteins of the Argonaute family; the bound miRNAs 
function as guides that help bind specific mRNA molecules that are 
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then cleaved. 


Key Terms 


cell type (p. 938) 
histone (p. 938) 
chromatin (p. 939) (ChIP) (p. 945) 
nucleosome (p. 939) 
nucleosome core particle (p. 939) 
transcription factor (p. 941) 
homeodomain (p. 942) 
basic-leucine zipper (bZip) 
protein (p. 942) agonist (p. 948) 
Cys His» zinc-finger domain (p. 942) 
mediator (p. 942) 
combinatorial control (p. 943) 
enhancer (p. 943) (SERM) (p. 948) 


induced pluripotent stem (iPS) cell (p. 944) 


hypersensitive site (p. 944) 


chromatin immunoprecipitation 


hypomethylation (p. 945) 

CpG island (p. 946) 

nuclear hormone receptor (p. 946) 
estrogen response element (ERE) (p. 946) 
coactivator (p. 947) 


anabolic steroid (p. 948) 
antagonist (p. 948) 
selective estrogen receptor modulator 


histone acetyltransferase (HAT) (p. 949) 


acetyllysine-binding domain (p. 949) 
bromodomain (p. 949) 


TATA-box-binding protein associated 
factor (TAF) (p. 950) 


chromatin-remodeling complex 
(p. 950) 


histone deacetylase (p. 950) 
transferrin (p. 951) 

transferrin receptor (p. 951) 

ferritin (p. 951) 

iron-response element (IRE) (p. 951) 
IRE-binding protein (IRP) (p. 951) 
microRNA (miRNA) (p. 953) 
Argonaute family proteins (p. 953) 


Problems 


1. Charge neutralization. Given the histone amino acid 
sequences illustrated below, estimate the charge of a his- 
tone octamer at pH 7. Assume that histidine residues are 
uncharged at this pH. How does this charge compare with 
the charge on 150 base pairs of DNA? 


Histone H2A 
MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNA 


ARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLLPKKTESHHKAKGK 


Histone H2B 
MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDI 


FERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 


Histone H3 
MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRPGTVALREIRRYQKSTELLIRKLPFOQR 


LVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA 


Histone H4 
MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDA 


VTYTEHAKRKTVTAMDWVYALKRQGRTLYGFGG 


2. Chromatin immunoprecipitation. You have used the tech- 
nique of chromatin immunoprecipitation to isolate DNA 
fragments containing a DNA-binding protein of interest. 


Suppose that you wish to know whether a particular known 
DNA fragment is present in the isolated mixture. How 
might you detect its presence? 


3. Around we go. Assuming that 145 base pairs of DNA 
wrap around the histone octamer 1 3/4 times, estimate the 
radius of the histone octamer. Assume 3.4 A per base pair 
and simplify the calculation by assuming that the wrapping 
is in two rather than three dimensions and neglecting the 


thickness of the DNA. 


4. Nitrogen substitution. Growth of mammalian cells in the 
presence of 5-azacytidine results in the activation of some 
normally inactive genes. Propose an explanation. 


NH> 


_— 
ees 


T 
deoxyribose 
5-Azacytidine 
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5. Anew domain. A protein domain that recognizes 5-meth- 
ylcytosine in the context of double-stranded DNA has been 
characterized. What role might proteins containing such 
a domain play in regulating gene expression? Where on a 
double-stranded DNA molecule would you expect such a 
domain to bind? 


6. Hybrid receptor. Through recombinant DNA methods, a 
modified steroid hormone receptor was prepared that con- 
sists of an estrogen receptor with its ligand-binding domain 
replaced by the ligand-binding domain from the progester- 
one receptor. Predict the expected responsiveness of gene 
expression for cells treated with estrogen or with progester- 
one. 


7. Different modifications. What is the effect of acetylation 
of a lysine residue on the charge of a histone protein? Of 
lysine methylation? 


8. Transformer. The following amino acid sequence of 
one of the four transcription factors is used to generate iPS 
cells: 
HTCDYAGCGKTYTKSSHLKAHLRTHTGEKPYHCDWDGCGWKFARSDELTRHYRKHTGHRPFQCQKCD 


RAFSRSDHLALHMKRHF 


This transcription factor belongs to one of the three struc- 
tural classes discussed in Section 32.2. Identify the class. 


9. Coverage. What percentage of the DNA sites in yeast are 
accessible, assuming that the fraction of sites observed for 
GAL4 is typical? To how many base pairs of the 12-Mb 
yeast genome does this percentage correspond? 


10. Iron regulation. What effect would you expect from the 
addition of an IRE to the 5’ end of a gene that is not nor- 
mally regulated by iron levels? To the 3’ end? 


11. Predicting microRNA regulation. Suppose that you 
have identified an miRNA that has the sequence 


5'-GCCUAGCCUUAGCAUUGAUUGG-3’. Propose a 
strategy for identifying mRNA that might be regulated by 
this miRNA, given the sequences of all mRNAs encoded 
by the human genome. 


Mechanism Problem 


12. Acetyltransferases. Propose a mechanism for the trans- 
fer of an acetyl group from acetyl CoA to the amino group 
of lysine. 


Data Interpretation Problem 


13. Limited restriction. The restriction enzyme Hpall is a 
powerful tool for analyzing DNA methylation. This enzyme 
cleaves sites of the form 5’-CCGG-3’ but will not cleave 
such sites if the DNA is methylated on any of the cytosine 
residues. Genomic DNA from different organisms is treated 
with Hpall and the results are analyzed by gel electropho- 
resis (see the adjoining patterns). Provide an explanation for 
the observed patterns. 


Mouse Drosophila _ E. coli 
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Sensory Systems 


Color perception requires specific photoreceptors. The photoreceptor rhodopsin (right), which absorbs light in 
the process of vision, consists of the protein opsin and a bound vitamin A derivative, retinal. The amino acids 
(shown in red) that surround the retinal determine the color of light that is most efficiently absorbed. A person 
lacking a light-absorbing photoreceptor for the color green will see a colorful fruit stand (left) as mostly yellows 
(middle). [(Left and middle) From L. T. Sharpe et al. In Color Vision: from Genes to Perception, K. Gegenfurtner 


and L. T. Sharpe, Eds. (Cambridge University Press, 1999), pp. 3-51.] 


‘@a senses provide us with means for detecting a diverse set of external 
signals, often with incredible sensitivity and specificity. For example, 
when fully adapted to a darkened room, our eyes allow us to sense very low 
levels of light, down to a limit of less than 10 photons. With more light, we 
are able to distinguish millions of colors. Through our senses of smell and 
taste, we are able to detect thousands of chemicals in our environment and 
sort them into categories: pleasant or unpleasant? healthful or toxic? Finally, 
we can perceive mechanical stimuli in the air and around us through our 
senses of hearing and touch. 

How do our sensory systems work? How are the initial stimuli detected? 
How are these initial biochemical events transformed into perceptions and 
experiences? We have already encountered systems that sense and respond 
to chemical signals—namely, receptors that bind to growth factors and 
hormones. Our knowledge of these receptors and their associated signal- 
transduction pathways provides us with concepts and tools for unraveling 
some of the workings of sensory systems. For example, 7IM receptors 
(seven-transmembrane receptors; Section 14.1) play key roles in olfaction, 
taste, and vision. Ion channels that are sensitive to mechanical stress are 
essential for hearing and touch. 

In this chapter, we focus on the five major sensory systems found in 
human beings and other mammals: olfaction (the sense of smell—i.e., the 
detection of small molecules in the air), taste, or gustation (the detection of 
selected organic compounds and ions by the tongue), vision (the detection 
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Figure 33.1 Sensory connections to the brain. ean : 
Sensory nerves connect sensory organs to the brain and compound 3-methylbutane-1-thiol is a major component of the odor 
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of light), hearing (the detection of sound, or pressure waves in the 
air), and touch (the detection of changes in pressure, temperature, 
and other factors by the skin). Each of these primary sensory systems 
contains specialized sensory neurons that transmit nerve impulses to 
the central nervous system (Figure 33.1). In the central nervous sys- 
tem, these signals are processed and combined with other informa- 
tion to yield a perception that may trigger a change in behavior. By 
these means, our senses allow us to detect changes in our environ- 
ments and to adjust our behavior appropriately. 


33.1 A Wide Variety of Organic Compounds Are 
Detected by Olfaction 


Human beings can detect and distinguish thousands of different 
compounds by smell, often with considerable sensitivity and speci- 
ficity. Most odorants are small organic compounds with sufficient 
volatility that they can be carried as vapors into the nose. For exam- 
ple, a major component responsible for the odor of almonds is the 
simple aromatic compound benzaldehyde, whereas the sulfhydryl 


3-Methylbutane-1-thiol Geraniol Zingiberene 
(Skunk) (Rose) (Ginger) 


What properties of these molecules are responsible for their odors? First, 
the shape of the molecule rather than its other physical properties is crucial. We 
can most clearly see the importance of shape by comparing molecules such 
as those responsible for the odors of spearmint and caraway. These com- 
pounds are identical in essentially all physical properties such as hydropho- 
bicity because they are exact mirror images of one another. Thus, the odor 
produced by an odorant depends not on a physical property but on the com- 
pound’s interaction with a specific binding surface, most likely a protein 
receptor. Second, some human beings (and other animals) suffer from 
specific anosmias; that is, they are incapable of smelling specific compounds 
even though their olfactory systems are otherwise normal. Such anosmias 
are often inherited. These observations suggest that mutations in individ- 
ual receptor genes lead to the loss of the ability to detect a small subset of 
compounds. 


Olfaction is mediated by an enormous family of 
seven-transmembrane-helix receptors 


Odorants are detected in a specific region of the nose, called the main 
olfactory epithelium, that lies at the top of the nasal cavity (Figure 33.2). 
Approximately 1 million sensory neurons line the surface of this region. 
Cilia containing the odorant-binding protein receptors project from these 
neurons into the mucous lining of the nasal cavity. 
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Biochemical studies in the late 1980s examined isolated cilia from rat 
olfactory epithelium that had been treated with odorants. Exposure to the 
odorants increased the cellular level of cyclic AMP, and this increase was 
observed only in the presence of GTP. On the basis of what was known 
about signal-transduction systems, the participation of cAMP and GTP 
strongly suggested the involvement of a G protein and, hence, 7TM receptors. 
Indeed, Randall Reed purified and cloned a G-protein a subunit, termed 
Gio), which is uniquely expressed in olfactory cilia. The involvement of 
7TM receptors suggested a strategy for identifying the olfactory receptors 
themselves. Complementary DNAs were sought that (1) were expressed 
primarily in the sensory neurons lining the nasal epithelium, (2) encoded 
members of the 7IM-receptor family, and (3) were present as a large 
and diverse family to account for the range of odorants. Through the use of 
these criteria, cDNAs for odorant receptors from rats were identified in 
1991 by Richard Axel and Linda Buck. 


ae The odorant receptor (hereafter, OR) family is even 
larger than expected: move than 1000 OR genes are 
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Figure 33.2 The main nasal epithelium. 
This region of the nose, which lies at the top 
of the nasal cavity, contains approximately 1 
million sensory neurons. Nerve impulses 
generated by odorant molecules binding to 
receptors on the cilia travel from the sensory 
neurons to the olfactory bulb. 
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thus one of the largest gene families in human beings. 
Further analysis of primate OR genes reveals that the 
fraction of pseudogenes is greater in species more closely 
related to human beings (Figure 33.3). Thus, we may have 
a glimpse at the evolutionary loss of acuity in the sense of 
smell as higher mammals presumably became less depen- 
dent on this sense for survival. For rodents that are highly 
dependent on their sense of smell, essentially all OR 
genes encode functional proteins. 

The OR proteins are typically 20% identical in 
sequence with the B-adrenergic receptor (Section 14.1) 
and from 30% to 60% identical with one another. Several 
specific sequence features are present in most or all 


Figure 33.3 Evolution of odorant receptors. Odorant receptors 
appear to have lost function through conversion into pseudogenes in 
the course of primate evolution. The percentage of OR genes that 


OR family members (Figure 33.4). The central region, —_ appear to be functional for each species is given in parentheses. 


Figure 33.4 Conserved and variant regions in odorant 


particularly transmembrane helices 4 and 5, is highly vari- 
able, suggesting that this region is the site of odorant bind- 
ing. That site must be different in odorant receptors that 
bind distinct odorant molecules. 

What is the relation between OR gene expression and 
the individual neuron? Interestingly, each olfactory neuron 
expresses only a single OR gene, among hundreds available. 
Apparently, the precise OR gene expressed is determined 
largely at random. After one OR gene is expressed and a 
functional OR protein is produced, the expression of all 
other OR genes is suppressed by a feedback mechanism 
that remains to be fully elucidated. 

The binding of an odorant to an OR on the neuronal 
surface initiates a signal-transduction cascade that results 
in an action potential (Figure 33.5). The ligand-bound OR 


receptors. Odorant receptors are members of the 7TM-receptor activates Golf), the specific G protein mentioned earlier. 
family. The green cylinders represent the seven presumed Golf) 18 initially in its GDP-bound form. When activated, 
transmembrane helices. Strongly conserved residues characteristic of it releases GDP, binds GTP, and releases its associated By 


this protein family are shown in blue, whereas highly variable residues 


are shown in red. 
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subunits. The a subunit then activates a specific adenylate 
cyclase, increasing the intracellular concentration of 
cAMP. The rise in the intracellular concentration of cAMP 
activates a nonspecific cation channel that allows calcium 
and other cations into the cell. The flow of cations through the channel 
depolarizes the neuronal membrane and initiates an action potential. This 
action potential, combined with those from other olfactory neurons, leads to 
the perception of a specific odor. 
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Figure 33.5 The olfactory signal-transduction cascade. The binding of odorant to the olfactory 
receptor activates a signaling pathway similar to those initiated in response to the binding of some 
hormones to their receptors. The final result is the opening of cAMP-gated ion channels and the 
initiation of an action potential. 


Odorants are decoded by a combinatorial mechanism 


An obvious challenge presented to an investigator by the large size of the 
OR family is to match each OR with the one or more odorant molecules to 
which it binds. Exciting progress has been made in this regard. Initially, an 
OR was matched with odorants by overexpressing a single, specific OR 
gene in rats. This OR responded to straight-chain aldehydes, most favor- 
ably to n-octanal and less strongly to n-heptanal and n-hexanal. More- 
dramatic progress was made by taking advantage of our knowledge of the 
OR signal-transduction pathway and the power of the polymerase chain 
reaction (PCR). A section of nasal epithelium from a mouse was loaded with 
the calcitum-sensitive dye Fura-2 (Section 14.1). The tissue was then treated 
with different odorants, one at a time, at a specific concentration. If the 
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Figure 33.6 Four series of odorants tested for olfactory-receptor 


activation. 
odorant had bound to an OR and activated it, that neu- Receptor 
ron could be detected under a microscope by the change tlalalalslel7isloholai2tista 
in fluorescence caused by the influx of calcium that takes CCOOn es 
place as part of the signal-transduction process. To est ee ices 
determine which OR was responsible for the response, = 60H 
cDNA was generated from mRNA that had been iso- = aGon 
lated from single identified neurons. The cDNA was C.-COOH 
then subjected to PCR with the use of primers that are C,-COOH 
effective in amplifying most or all OR genes. The C;-OH 
sequence of the PCR product from each neuron was then C,-OH 
determined and analyzed. _ [C-OH 
Using this approach, investigators analyzed the & |C,-OH 
responses of neurons to a series of compounds having 3 [C,-OH 
varying chain lengths and terminal functional groups - Br-C;-COOH 
(Figure 33.6). The results of these experiments appear Br-C,-COOH 
surprising at first glance (Figure 33.7). Importantly, Br-C;-COOH 
there is not a simple 1:1 correspondence between odor- Br-C,-COOH 
ants and receptors. Almost every odorant activates a num- Br-C,-COOH || 
ber of receptors (usually to different extents) and almost HOOC-C,-COOH 
every receptor is activated by more than one odorant. Note, HOOC-C;-COOH 
however, that each odorant activates a unique combina- HOOC-C,-COOH 
tion of receptors. In principle, this combinatorial mecha- HeVe Ee SONn EERE Be 


nism allows even a small array of receptors to distinguish 


Figure 33.7 Patterns of olfactory-receptor activation. Fourteen 
a vast number of odorants. 


different receptors were tested for responsiveness to the compounds 


How is the information about which receptors have shown in Figure 33.6. A colored box indicates that the receptor at the 
been activated transmitted to the brain? Recall that top responded to the compound at the left. Darker colors indicate that 
each neuron expresses only one OR and that the pat- the receptor was activated at a lower concentration of odorant. 


tern of expression appears to be largely random. A 
substantial clue to the connections between receptors 
and the brain has been provided by the creation of mice 
that express a gene for an easily detectable colored marker in conjunction 
with a specific OR gene. Olfactory neurons that express the OR—marker- 
protein combination were traced to their destination in the brain, a struc- 
ture called the olfactory bulb (Figure 33.8). The processes from neurons 
that express the same OR gene were found to connect to the same location 
in the olfactory bulb. Moreover, this pattern of neuronal connection was 
found to be identical in all mice examined. Thus, neurons that express spe- 
cific ORs are linked to specific sites in the brain. This property creates a 
spatial map of odorant-responsive neuronal activity within the olfactory 
bulb. 

Can such a combinatorial mechanism truly distinguish many different 
odorants? An electronic “nose” that functions by the same principles pro- 


Figure 33.8 Converging olfactory neurons. This section of the nasal cavity is stained to reveal 
processes from sensory neurons expressing the same olfactory receptor. The processes converge to 
a single location in the olfactory bulb. [From P. Mombaerts, et al. Cel! 87:675-689, 1996] 
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vides compelling evidence that it can (Figure 33.9). The receptors for the 


CHAPTER 33 Sensory Systems electronic nose are polymers that bind a range of small molecules. Each 


Figure 33.9 The Cyranose 320. The 
electronic nose may find uses in the food 


polymer binds every odorant, but to varying degrees. Importantly, the elec- 
trical properties of these polymers change on odorant binding. A set of 32 of 
these polymer sensors, wired together so that the pattern of responses can be 
evaluated, is capable of distinguishing individual compounds such as 
n-pentane and n-hexane as well as complex mixtures such as the odors of 
fresh and spoiled fruit. 


33.2 Taste Is a Combination of Senses That Function 
by Different Mechanisms 


The inability to taste food is a common complaint when nasal congestion 
reduces the sense of smell. Thus, smell greatly augments our sense of taste 
(also known as gustation), and taste is, in many ways, the sister sense to 
olfaction. Nevertheless, the two senses differ from each other in several 
important ways. First, we are able to sense several classes of compounds by 
taste that we are unable to detect by smell; salt and sugar have very little 
odor, yet they are primary stimuli of the gustatory system. Second, whereas 
we are able to discriminate thousands of odorants, discrimination by taste is 
much more modest. Five primary tastes are perceived: bitter, sweet, sour, 
salty, and umami (the taste of glutamate and aspartate from the Japanese 
word for “deliciousness”). These five tastes serve to classify compounds 
into potentially nutritive and beneficial (sweet, salty, umami) or potentially 


industry, animal husbandry, law enforcement, harmful or toxic (bitter, sour). Tastants (the molecules sensed by taste) are 
and medicine. [Courtesy of Cyrano Sciences|] quite distinct for the different groups (Figure 33.10). 


OH 
HO (e) 
HO ‘a 
OH H 
OH Na* 
Glucose Sodium ion Glutamate Quinine Hydrogen ion 
(sweet) (salty) (umami) (bitter) (sour) 
Figure 33.10 Examples of tastant molecules. Tastants fall into five groups: sweet, salty, umami, 
bitter, and sour. 
a The simplest tastant, the hydrogen ion, is perceived as sour. Other 
seks simple ions, particularly sodium ion, are perceived as salty. The taste called 


umami is evoked by the amino acids glutamate and aspartate, the former 


In contrast, tastants perceived as sweet and, particularly, bitter are extremely 
diverse. Many bitter compounds are alkaloids or other plant products of 
which many are toxic. However, they do not have any common structural 
elements or other common properties. Carbohydrates such as glucose and 
sucrose are perceived as sweet, as are other compounds including some 
simple peptide derivatives, such as aspartame, and even some proteins. 


taste receptors ; 
IS often encountered as the flavor enhancer monosodium glutamate (MSG). 
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Figure 33.11 A taste bud. Each taste bud contains sensory neurons that extend microvilli to the 
surface of the tongue, where they interact with tastants. 


These differences in specificity among the five tastes are due to differences 
in their underlying biochemical mechanisms. The sense of taste is, in fact, a 
number of independent senses all utilizing the same organ, the tongue, for 
their expression. 

Tastants are detected by specialized structures called taste buds, which 
contain approximately 150 cells, including sensory neurons (Figure 33.11). 
Fingerlike projections called microvilli, which are rich in taste receptors, 
project from one end of each sensory neuron to the surface of the tongue. 
Nerve fibers at the opposite end of each neuron carry electrical impulses to 
the brain in response to stimulation by tastants. Structures called taste 
papillae contain numerous taste buds. 


Sequencing of the human genome led to the discovery of 
a large family of 7TM bitter receptors 


Just as in olfaction, a number of clues pointed to the involvement of G pro- 
teins and, hence, 7T’M receptors in the detection of bitter and sweet tastes. 
The evidence included the isolation of a specific G-protein a subunit termed 
gustducin, which is expressed primarily in taste buds (Figure 33.12). How 
could the 7TM receptors be identified? The ability to detect some com- 
pounds depends on specific genetic loci in both human beings and mice. For 
instance, the ability to taste the bitter compound 6-n-propyl-2-thiouracil 
(PROP) was mapped to a region on human chromosome 5 by comparing 
DNA markers of persons who vary in sensitivity to this compound. 

This observation suggested that this region might encode a 7TM 
receptor that responded to PROP. Approximately 450 kilobases in this 
region had been sequenced early in the human genome project. This 
sequence was searched by computer for potential 7TM-receptor genes, and, 
indeed, one was detected and named T2R1. Additional database searches 
detected approximately 30 sequences similar to T2R1 in the human 
genome. The encoded proteins are between 30 and 70% identical with T2R1 
(Figure 33.13). 

Are these proteins, in fact, bitter receptors? Several lines of evidence 
suggest that they are. First, their genes are expressed in taste-sensitive 
cells—in fact, in many of the same cells that express gustducin. Second, cells 
that express individual members of this family respond to specific bitter 
compounds. For example, cells that express a specific mouse receptor 
(mT2R5) responded when exposed specifically to cycloheximide. Third, 
mice that had been found unresponsive to cycloheximide were found to 
have point mutations in the gene encoding mT 2R5. Finally, cycloheximide 


Figure 33.13 Conserved and variant regions in bitter receptors. The bitter receptors are 
members of the 7TM-receptor family. Strongly conserved residues characteristic of this protein 
family are shown in blue, and highly variable residues are shown in red. 


Figure 33.12 Expression of gustducin in 
the tongue. (A) A section of tongue stained 
with a fluorescent antibody reveals the 
position of the taste buds.(B) The same 
region stained with an antibody directed 
against gustducin reveals that this G protein is 
expressed in taste buds. [Courtesy of Dr. 
Charles S. Zuker.] 
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Figure 33.14 Evidence that T2R proteins 
are bitter taste receptors. Cycloheximide 
uniquely stimulates the binding of the GTP 
analog GTPyS to gustducin in the presence of 
the mT2R protein. [After J. Chandrashekar, et al. 
Cell 100:703-711, 2000.] 


specifically stimulates the binding of GTP analogs to gustducin in the pres- 
ence of the mT2R5 protein (Figure 33.14). 

Importantly, each taste-receptor cell expresses many different members 
of the T2R family. This pattern of expression stands in sharp contrast to the 
pattern of one receptor type per cell that characterizes the olfactory system 
(Figure 33.15). The difference in expression patterns accounts for the much 
greater specificity of our perceptions of odors compared with tastes. We are 
able to distinguish among subtly different odors because each odorant stimulates 
a unique pattern of neurons. In contrast, many tastants stimulate the same 
neurons. Thus, we perceive only “bitter” without the ability to discriminate 
cycloheximide from quinine. 
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Figure 33.15 Differing gene-expression and connection patterns in olfactory and bitter 
taste receptors. In olfaction, each neuron expresses a single OR gene, and the neurons expressing 
the same OR converge to specific sites in the brain, enabling specific perception of different 
odorants. In gustation, each neuron expresses many bitter receptor genes, and so the identity of 
the tastant is lost in transmission. 


= 2 , TIR2, TIR3 A heterodimeric 7TM receptor responds to 
5 ra sweet compounds 
5 A Most sweet compounds are carbohydrates, energy rich and 
£ rd easily digestible. Some noncarbohydrate compounds such 
E f as saccharin and aspartame also taste sweet. Members of a 
2 44 - second family of 7IM receptors are expressed in taste-re- 
g rs ceptor cells sensitive to sweetness. The three members of 
- Fo this family, referred to as T1R1, T1R2, and T1R3, are dis- 
g at on } tke tinguished by their large extracellular domains compared 
s ee ee ar TIRI, TIR2 with those of the bitter receptors. Studies in knockout mice 
an eee coins Fe aoe ad TIRI only have revealed that T1R2 and T1R3 are expressed simulta- 
°' 30 100 300 1000 neously in mice able to taste carbohydrates (Figure 33.16). 
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Figure 33.16 Evidence for a heterodimeric sweet receptor. 
The sensitivity to sweetness of mice with genes for either TIR1, 
T1R2, T1R3, or both T1R2 and T1R3 was determined by observing 
the relative rates at which they licked solutions containing various 
amount of sucrose. These studies revealed that both TIR2 and T1R3 
were required for a full response to sucrose. Mice with a disrupted 
TIR1 gene were indistinguishable from wild-type mice in this assay 
(not shown). [After G. Q. Zhao, et al. Cel! 115:255-266, 2003.] 
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Thus, T1R2 and T1R3 appear to form a specific heterodi- 
meric receptor responsible for mediating the response to 
sugars. This heterodimeric receptor also responds to artifi- 
cial sweeteners and to sweet-tasting proteins and therefore 
appears to be the receptor responsible for responses to all 
sweet tastants. Note that T1R2 and T1R3 do respond to 
sweet tastants individually but only at very high concentra- 
tions of tastant. 


The requirement for an oligomeric 7TM receptor for a fully functional 
response is surprising, considering our previous understanding of 7TM 
receptors. This discovery has at least two possible explanations. First, the 
sweet receptor could be a member of a small subset of the 7TM-receptor 
family that functions well only as oligomers. Alternatively, many 77M 
receptors may function as oligomers, but this notion is not clear, because 
these oligomers contain only one type of 7TM-receptor subunit. Further 
studies will be required to determine which of these explanations is correct. 


Umami, the taste of glutamate and aspartate, is mediated by 
a heterodimeric receptor related to the sweet receptor 


The family of receptors responsible for detecting sweetness is also respon- 
sible for detecting amino acids. In human beings, only glutamate and aspar- 
tate elicit a taste response. Studies similar to those for the sweet receptor 
revealed that the umami receptor consists of T1R1 and T1R3. Thus, this 
receptor has one subunit (T1R3) in common with the sweet receptor but 
has an additional subunit (T1R1) that does not participate in the sweet 
response. This observation is supported by the observation that mice in 
which the gene for T1R1 is disrupted do not respond to aspartate but do 
respond normally to sweet tastants; mice having disrupted genes for both 
T1R1 and T1R3 respond poorly to both umami and sweet tastants. 


Salty tastes are detected primarily by the passage of sodium 
ions through channels 


Salty tastants are not detected by 7TM receptors. Rather, they are detected 
directly by their passage through ion channels expressed on the surface of 
cells in the tongue. Evidence for the role of these ion channels comes from 
examining known properties of Na~ channels characterized in other bio- 
logical contexts. One class of channels, characterized first for its role in salt 
reabsorption, is thought to be important in the detection of salty tastes 
because these channels are sensitive to the compound amiloride, which mutes 
the taste of salt and significantly lowers sensory-neuron activation in response 
to sodium. 

An amiloride-sensitive Na* channel comprises four subunits that may be 
either identical or distinct but in any case are homologous. An individual 
subunit ranges in length from 500 to 1000 amino acids and includes two 
presumed membrane-spanning helices as well as a large extracellular 
domain in between them (Figure 33.17). The extracellular region includes 
two (or, sometimes, three) distinct regions rich in cysteine residues (and, 
presumably, disulfide bonds). A region just ahead of the second membrane- 
spanning helix appears to form part of the pore in a manner analogous to 
that of the structurally characterized potassium channel. The members of 
the amiloride-sensitive Na™ -channel family are numerous and diverse 
in their biological roles. We shall encounter them again in the context of the 
sense of touch. 

Sodium ions passing through these channels produce a significant trans- 
membrane current. Amiloride blocks this current, accounting for its effect 
on taste. However, about 20% of the response to sodium remains even in the 
presence of amiloride, suggesting that other ion channels also contribute to 
salt detection. 


Sour tastes arise from the effects of hydrogen ions (acids) on channels 


Like salty tastes, sour tastes are detected by direct interactions with ion chan- 
nels, but the incoming ions are hydrogen ions (in high concentrations) rather 
than sodium ions. For example, in the absence of high concentrations of 
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Figure 33.17 Schematic structure of the 
amiloride-sensitive sodium channel. 
Only one of the four subunits that constitute 
the functional channel is illustrated. The 
amiloride-sensitive sodium channel belongs 
to a superfamily having common structural 
features, including two hydrophobic 
membrane-spanning regions, intracellular 
amino and carboxyl termini; and a large, 
extracellular region with conserved 
cysteine-rich domains. 


966 sodium, hydrogen ion flow can induce substantial transmembrane currents 

CHAPTER 33 Sensory Systems through amiloride-sensitive Na* channels. However, hydrogen ions are also 
sensed by mechanisms other than their direct passage through membranes. 
Binding by hydrogen ions blocks some potassium ion channels and activates 
other types of channels. Together, these mechanisms lead to changes in mem- 
brane polarization in sensory neurons that produce the sensation of sour 
taste. We shall consider an additional receptor related to taste, one respon- 
sible for the “hot” taste of spicy food, when we examine mechanisms of 
touch perception. 
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Figure 33.18 The electromagnetic spectrum. Visible light has 
wavelengths between 300 and 850 nm. 


33.3 Photoreceptor Molecules in the Eye 
Detect Visible Light 


Vision is based on the absorption of light by photoreceptor 
cells in the eye. These cells are sensitive to light in a narrow 
region of the electromagnetic spectrum, the region with 
wavelengths between 300 and 850 nm (Figure 33.18). 
Vertebrates have two kinds of photoreceptor cells, called 
rods and cones because of their distinctive shapes. Cones 
function in bright light and are responsible for color 
vision, whereas rods function in dim light but do not per- 
ceive color. A human retina contains about 3 million cones 
and 100 million rods. Remarkably, a rod cell can respond 
to a single photon, and the brain requires fewer than 10 
such responses to register the sensation of a flash of light. 


Rhodopsin, a specialized 7TM receptor, absorbs visible light 


11-cis-Retinal 


Rods are slender, elongated structures; the outer segment is specialized for 
photoreception (Figure 33.19). It contains a stack of about 1000 discs, 
which are membrane-enclosed sacs densely packed with photoreceptor mol- 
ecules. The photosensitive molecule is often called a visual pigment because 
it is highly colored owing to its ability to absorb light. The photoreceptor 
molecule in rods is rhodopsin (Section 14.1), which consists of the protein 
H" So opsin linked to 11-cis-retinal, a prosthetic group. 
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Figure 33.19 The rod cell. (Left) Scanning electron micrograph of retinal rod cells. 
(Right) Schematic representation of a rod cell. [Photograph courtesy of Dr. Deric Bownds,] 


Rhodopsin absorbs light very efficiently in the middle of the visible 
spectrum, its absorption being centered on 500 nm, which nicely matches 
the solar output (Figure 33.20). A rhodopsin molecule will absorb a high 
percentage of the photons of the correct wavelength that strike it, as indi- 
cated by the extinction coefficient of 40,000 M ‘cm! at 500 nm. The 
extinction coefficient for rhodopsin is more than an order of magnitude 
greater than that for tryptophan, the most efficient absorber in proteins that 
lack prosthetic groups. 

Opsin, the protein component of rhodopsin, is a member of the 
7TM-receptor family. Indeed, rhodopsin was the first member of this fam- 
ily to be purified, its gene was the first to be cloned and sequenced, and its 
three-dimensional structure was the first to be determined. The color of 
rhodopsin and its responsiveness to light depend on the presence of the 
light-absorbing group (chromophore) 11-cis-retinal. This compound is a 
powerful absorber of light because it is a polyene; its six alternating single 
and double bonds constitute a long, unsaturated electron network. Recall 
that alternating single and double bonds account for the chromophoric 
properties of chlorophyll (Section 19.2). The aldehyde group of 11-cis-retinal 
forms a Schiff base (Figure 33.21) with the e-amino group of lysine residue 
296, which lies in the center of the seventh transmembrane helix. Free 
retinal absorbs maximally at 370 nm, and its unprotonated Schiff-base adduct 
absorbs at 380 nm, whereas the protonated Schiff base absorbs at 440 nm or 
longer wavelengths. Thus, the 500-nm absorption maximum for rhodopsin 
strongly suggests that the Schiff base is protonated; additional interactions 
with opsin shift the absorption maximum farther toward the red. The posi- 
tive charge of the protonated Schiff base is compensated by the negative 
charge of glutamate 113 located in helix 2; the glutamate residue closely 
approaches the lysine-retinal linkage in the three-dimensional structure of 
rhodopsin. 
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Figure 33.20 Rhodopsin absorption 
spectrum. Almost all photons with 
wavelengths near 500 nm that strike a 
rhodopsin molecule are absorbed. 


Figure 33.21 Retinal-lysine linkage. 
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Retinal is linked to lysine 296 in opsin by a 


Schiff-base linkage. In the resting state of 
rhodopsin, this Schiff base is protonated. 
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Light absorption induces a specific isomerization of bound 11-cis-retinal 


How does the absorption of light by the retinal Schiff base generate a signal? 
George Wald and his coworkers discovered that light absorption results in the 
isomerization of the 11-cis-retinal group of rhodopsin to its all-trans form 
(Figure 33.22). This isomerization causes the Schiff-base nitrogen atom to 
move approximately 5 A, assuming that the cyclohexane ring of the retinal 
group remains fixed. In essence, the light energy of a photon is converted into 
atomic motion. The change in atomic positions, like the binding of a ligand 
to other 7ITM receptors, sets in train a series of events that lead to the 
closing of ion channels and the generation of a nerve impulse. 

The isomerization of the retinal Schiff base takes place within a few 
picoseconds of a photon being absorbed. The initial product, termed bath- 
orhodopsin, contains a strained all-trans-retinal group. Within approximately 
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Figure 33.23 Analogous 7TM receptors. 
The conversion of rhodopsin into 
metarhodopsin II activates a signal- 
transduction pathway analogously to the 
activation induced by the binding of other 
7TM receptors to appropriate ligands. 


Figure 33.22 Atomic motion in retinal. The Schiff-base nitrogen atom moves 5 Aas a 
consequence of the light-induced isomerization of 1 1-cis-retinal to all-trans-retinal by rotation about 
the bond shown in red. 


1 ms, this intermediate is converted through several additional intermediates 
into metarhodopsin IT. In metarhodopsin II, the Schiff base is deproto- 
nated and the opsin protein has undergone significant reorganization. 

Metarhodopsin II (also referred to as R*) is analogous to the ligand-bound 
state of 7ITM receptors such as the B2-adrenergic receptor (Section 14.1) 
and the odorant and tastant receptors discussed previously (Figure 33.23). 
Like these receptors, this form of rhodopsin activates a heterotrimeric 
G protein that propagates the signal. The G protein associated with rhodop- 
sin is called transducin. Metarhodopsin II triggers the exchange of GDP for 
GTP by the a subunit of transducin (Figure 33.24). On the binding of GTP, 
the By subunits of transducin are released and the a subunit switches on a 
cGMP phosphodiesterase by binding to an inhibitory subunit and removing 
it. The activated phosphodiesterase is a potent enzyme that rapidly hydro- 
lyzes cGMP to GMP. The reduction in cGMP concentration causes 
cGMP-gated ion channels to close, leading to the hyperpolarization of 
the membrane and neuronal signaling. At each step in this process, the initial 
signal—the absorption of a single photon—is amplified so that it leads to suffi- 
cient membrane hyperpolarization to result in signaling. 


Light-induced lowering of the calcium level coordinates recovery 


As we have seen, the visual system responds to changes in light and color 
within a few milliseconds, quickly enough that we are able to perceive continu- 
ous motion at nearly 1000 frames per second. To achieve a rapid response, 
the signal must also be terminated rapidly and the system must be returned 
to its initial state. First, activated rhodopsin must be blocked from continuing 
to activate transducin. Rhodopsin kinase catalyzes the phosphorylation of 
the carboxyl terminus of R* at multiple serine and threonine residues. 
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Figure 33.24 Visual signal transduction. The light-induced activation of rhodopsin leads to the 
hydrolysis of cGMP, which in turn leads to ion-channel closing and the initiation of an action 
potential. 


Arrestin, an inhibitory protein then binds phosphorylated R* and prevents 969 
additional interaction with transducin. 33.3 Vision 
Second, the a subunit of transducin must be returned to its inactive state 
to prevent further signaling. Like other G proteins, the a subunit possesses 
built-in GTPase activity that hydrolyzes bound GTP to GDP. Hydrolysis 
takes place in less than a second when transducin is bound to the phospho- 
diesterase. The GDP form of transducin then leaves the phosphodiesterase 
and reassociates with the By subunits, and the phosphodiesterase returns to 
its inactive state. Third, the level of cGMP must be raised to reopen the 
cGMP-gated ion channels. The action of guanylate cyclase accomplishes this 530 
third step by synthesizing cGMP from GTP. ave.” [ee 
Calcium ion plays an essential role in controlling guanylate cyclase 
because it markedly inhibits the activity of the enzyme. In the dark, Ca?* as 
well as Na* enter the rod outer segment through the cGMP-gated channels. 
Calcium ion influx is balanced by its efflux through an exchanger, a trans- 
port system that uses the thermodynamically favorable flow of four Na* 
ions into the cell and one K* ion out of the cell to extrude one Ca’* ion. é 
After illumination, the entry of Ca’* through the cGMP-gated channels 300 400 500 600 700 800 
stops, but its export through the exchanger continues. Thus, the cytoplasmic Wavelength (nm) 
Ca** level drops from 500 nM to 50 nM after illumination. This drop migure 3446 Cone-pigmantabacipiion 
markedly stimulates guanylate cyclase, rapidly restoring the concentration spectra. The absorption spectra of the cone 
of cGMP to reopen the cGMP- gated channels. visual pigment responsible for color vision. 
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By controlling the rate of CGMP synthesis, Ca7* levels govern the speed 
with which the system is restored to its initial state. 


Color vision is mediated by three cone receptors that 
are homologs of rhodopsin 


Cone cells, like rod cells, contain visual pigments. Like 
rhodopsin, these photoreceptor proteins are members of 
the 7TM-receptor family and use 11-cis-retinal as their 
chromophore. In human cone cells, there are three distinct 
photoreceptor proteins with absorption maxima at 426, 
530, and ~560 nm (Figure 33.25). These absorbances cor- 
respond to (in fact, define) the blue, green, and red regions of 
the spectrum. Recall that the absorption maximum for rho- 
dopsin is 500 nm. 

The amino acid sequences of the cone photoreceptors 
have been compared with one another and with rhodopsin. 
The result is striking. Each of the cone photoreceptors is 
approximately 40% identical in sequence with rhodopsin. 
Similarly, the blue photoreceptor is 40% identical with each 
of the green and red photoreceptors. The green and red 
photoreceptors, however, are >95% identical with each 
other, differing in only 15 of 364 positions (Figure 33.26). 


Figure 33.26 Comparison of the amino acid sequences of the 
green and red photoreceptors. Open circles correspond to identical 
residues, whereas colored circles mark residues that are different. The 
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Figure 33.27 Evolutionary relationships 
among visual pigments. Visual pigments 
have evolved by gene duplication along 
different branches of the animal evolutionary 
tree. The branch lengths of the “trees” 
correspond to the percentage of amino acid 
divergence. [After J. Nathans. Neuron 24:299-— 
312, 1999; by permission of Cell Press.] 
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Homologous recombination 


The exchange of DNA segments with 
substantial sequence similarity at equivalent 
positions between chromosomes. 


Figure 33.28 Recombination pathways 
leading to color blindness. Rearrangements 
in the course of DNA replication may lead to 
(A) the loss of visual pigment genes or (B) 
the formation of hybrid pigment genes that 
encode photoreceptors with anomolous 
absorption spectra. Because the amino acids 
most important for determining absorption 
spectra are in the carboxyl-terminal half of 
each photoreceptor protein, the part of the 
gene that encodes this region most strongly 
affects the absorption characteristics of hybrid 
receptors. [After J. Nathans. Neuron 24:299- 
312, 1999; by permission of Cell Press.] 


Mouse ceptors are clearly products of a recent evolutionary event 


a (Figure 33.27). The green and red pigments appear to have 

diverged in the primate lineage approximately 35 million 

Pee years ago. Mammals, such as dogs and mice, that diverged 

€ from primates earlier have only two cone photoreceptors, 

[Rhodopsin ]590 = —_ blue and green. They are not sensitive to light as far toward 

2 the infrared region as we are, and they do not discriminate 

4450 “g colors as well. In contrast, birds such as chickens have a 

= total of six pigments: rhodopsin, four cone pigments, and 

Jaco a pineal visual pigment called pinopsin. Birds have highly 
acute color perception. 

Second, the high level of similarity between the green 


and the red pigments has made the identification of the 

specific amino acid residues responsible for spectral tun- 

ing possible. Three residues (at positions 180, 277, and 
285) are responsible for most of the difference between the green and the red 
pigments. In the green pigment, these residues are alanine, phenylalanine, 
and alanine, respectively; in the red pigment, they are serine, tyrosine, and 
threonine. A hydroxyl group has been added to each amino acid in the red 
pigment. The hydroxyl groups can interact with the photoexcited state of 
retinal and lower its energy, leading to a shift toward the lower-energy (red) 
region of the spectrum. 


Rearrangements in the genes for the green and red pigments 
lead to “color blindness” 


e The genes for the green and red pigments lie adjacent to each other on 

the human X chromosome. These genes are more than 98% identical 
in nucleotide sequence, including introns and untranslated regions as well 
as the protein-coding region. Regions with such high similarity are very 
susceptible to unequal homologous recombination. 

Recombination can take place either between or within transcribed 
regions of the gene (Figure 33.28). If recombination takes place between 
transcribed regions, the product chromosomes will differ in the number of 
pigment genes that they carry. One chromosome will lose a gene and thus 
may lack the gene for, say, the green pigment; the other chromosome will 
gain a gene. Consistent with this scenario, approximately 2% of human 
X chromosomes carry only a single color-pigment gene, approximately 20% 
carry two, 50% carry three, 20% carry four, and 5% carry five or more. 
A person lacking the gene for the green pigment will have trouble distin- 
guishing red and green color, characteristic of the most common form of 
color blindness. Approximately 5% of males have this form of color blind- 
ness. Recombination can also take place within the transcription units, 


(A) Recombination between genes 
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(B) Recombination within genes 


970 


nS 


—_ 
Greenlike 
hybrid 
Redlike 
? hybrid 


resulting in genes that encode hybrids of the green and red photoreceptors. 
The absorption maximum of such a hybrid lies between that of the red and 
green pigments. A person with such hybrid genes who also lacks either a 
functional red- or a functional green-pigment gene does not discriminate 
color well. 


33.4 Hearing Depends on the Speedy Detection of 
Mechanical Stimuli 


Hearing and touch are based on the detection of mechanical stimuli. 
Although the proteins of these senses have not been as well characterized 
as those of the senses already discussed, anatomical, physiological, and 
biophysical studies have elucidated the fundamental processes. A major clue 
to the mechanism of hearing is its speed. We hear frequencies ranging from 
200 to 20,000 Hz (cycles per second), corresponding to times of 5 to 0.05 ms. 
Furthermore, our ability to locate sound sources, one of the most impor- 
tant functions of hearing, depends on the ability to detect the time delay 
between the arrival of a sound at one ear and its arrival at the other. Given 
the separation of our ears and the speed of sound, we must be able to accu- 
rately sense time differences of 0.7 ms. In fact, human beings can locate 
sound sources associated with temporal delays as short as 0.02 ms. This 
high time resolution implies that hearing must employ direct transduction 
mechanisms that do not depend on second messengers. Recall that, in 
vision, for which speed also is important, the signal-transduction processes 
take place in milliseconds. 


Hair cells use a connected bundle of stereocilia to detect tiny motions 


Sound waves are detected inside the cochlea of the inner ear. The cochlea is 
a fluid-filled, membranous sac that is coiled like a snail shell. The primary 
detection is accomplished by specialized neurons inside the cochlea called 
hair cells (Figure 33.29). Each cochlea contains approximately 16,000 hair 
cells, and each hair cell contains a hexagonally shaped bundle of 20 to 300 
hairlike projections called stereocilia (Figure 33.30). These stereocilia are 
graded in length across the bundle. Mechanical deflection of the hair bun- 
dle, as takes place when a sound wave arrives at the ear, creates a change in 
the membrane potential of the hair cell. 

Micromanipulation experiments have directly probed the connection 
between mechanical stimulation and membrane potential. Displacement 
toward the direction of the tallest part of the hair bundle results in the 
depolarization of the hair cell, whereas displacement in the opposite direc- 
tion results in the hyperpolarization (Figure 33.31). Motion perpendicular 
to the hair-length gradient does not produce any change in resting potential. 
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Stereocilia 


Figure 33.29 Hair cells, the sensory 
neurons crucial for hearing. These 
specialized neurons are capped with hairlike 
projections called stereocilia that are responsible 
for detecting very subtle vibrations. [After A. J. 
Hudspeth. Nature 341:397-404, 1989] 


Figure 33.30 An electron micrograph 
of a hair bundle. [Courtesy of Dr. A. Jacobs 
and Dr. AJ. Hudspeth.] 
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Figure 33.32 Electron micrograph of tip 
links. The tip link between two hair fibers is 
marked by an arrow. [Courtesy of Dr. A. Jacobs 
and Dr. AJ. Hudspeth.] 
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Figure 33.33 Model for hair-cell 
transduction. When the hair bundle is tipped 
toward the tallest part, the tip link pulls on an 
ion channel and opens it. Movement in the 
opposite direction relaxes the tension in the 
tip link, increasing the probability that any 
open channels will close. [After A.J. Hudspeth. 
Nature 341:397-404, 1989.] 


Remarkably, displacement of the hair bundle by as little as 3 A (0.3 nm) results 
in a measurable (and functionally important) change in membrane potential. 
This motion of 0.003 degree corresponds to a 1-inch movement of the top of 
the Empire State Building. 

How does the motion of the hair bundle create a change in membrane 
potential? The rapid response, within microseconds, suggests that the 
movement of the hair bundle acts on ion channels directly. An important 
observation is that adjacent stereocilia are linked by individual filaments 
called tip links (Figure 33.32). 

The presence of these tip links suggests a simple mechanical model for 
transduction by hair cells (Figure 33.33). The tip links are coupled to ion 
channels in the membranes of the stereocilia that are gated by mechanical 
stress. In the absence of a stimulus, approximately 15% of these channels are 
open. When the hair bundle is displaced toward its tallest part, the stereocilia 
slide across one another and the tension on the tip links increases, causing 
additional channels to open. The flow of ions through the newly opened 
channels depolarizes the membrane. Conversely, if the displacement is in 
the opposite direction, the tension on the tip links decreases, the open chan- 
nels close, and the membrane hyperpolarizes. Thus, the mechanical motion of the 
hair bundle is directly converted into current flow across the hair-cell membrane. 


Mechanosensory channels have been identified in 
Drosophila and vertebrates 


The search for ion channels that respond to mechanical impulses has been 
pursued in a variety of organisms. Drosophila have sensory bristles used for 
detecting small air currents. These bristles respond to mechanical displace- 
ment in ways similar to those of hair cells; displacement of a bristle in one 
direction leads to substantial transmembrane current. Strains of mutant 
fruit flies that show uncoordinated motion and clumsiness have been exam- 
ined for their electrophysiological responses to displacement of the sensory 
bristles. In one set of strains, transmembrane currents were dramatically 
reduced. The mutated gene in these strains was found to encode a protein of 
1619 amino acids, called NompC for no mechanoreceptor potential. 

The carboxyl-terminal 469 amino acids of NompC resemble a class of 
ion-channel proteins called TRP (transient receptor potential) channels. 
This region includes six putative transmembrane helices with a pore-like 
region between the fifth and the sixth helices. The amino-terminal 1150 
amino acids consist almost exclusively of 29 ankyrin repeats (Figure 33.34). 


hk Figure 33.34 Ankyrin repeat structure. One ankyrin domain is shown in red in this series 
of four ankyrin repeats. Notice the hairpin loop followed by a helix-turn-helix motif in the red- 
colored ankyrin unit. Ankyrin domains interact with other proteins, primarily through their loops. 
[Drawn from 1AWC.pdb.] 


Ankyrin repeats are structural motifs consisting of a hairpin loop followed 
by a helix-turn-helix. Importantly, in other proteins, regions with tandem 
arrays of these motifs mediate protein—protein interactions, suggesting that 
these arrays couple the motions of other proteins to the activity of the 
NompC channel. 

A candidate for at least one component of the mechanosensory channel 
taking part in hearing has been identified. The protein, TRPA1, isa member 
of the TRP channel family. The sequence of TRPA1 also includes 17 ankyrin 
repeats. TRPA1 is expressed in hair cells, particularly near their tips, and is 
likely to be at least one component of the mechanosensory channel that is 
central to hearing. Further studies are under way to confirm and extend this 
exciting discovery. 


33.5 Touch Includes the Sensing of Pressure, 
Temperature, and Other Factors 


Like taste, touch is a combination of sensory systems that are expressed in a 
common organ—in this case, the skin. The detection of pressure and the 
detection of temperature are two key components. Amiloride-sensitive Na* 
channels, homologous to those of taste, appear to play a role. Other systems 
are responsible for detecting painful stimuli such as high temperature, acid, 
or certain specific chemicals. Although our understanding of this sensory 
system is not as advanced as that of the other sensory systems, recent work 
has revealed a fascinating relation between pain and taste sensation, a rela- 
tion well known to anyone who has eaten “spicy” food. 


Studies of capsaicin reveal a receptor for sensing high temperatures 
and other painful stimuli 


Our sense of touch is intimately connected with the sensation of pain. 
Specialized neurons, termed nociceptors, transmit signals from skin to pain- 
processing centers in the spinal cord and brain in response to the onset of 
tissue damage. What is the molecular basis for the sensation of pain? An 
intriguing clue came from the realization that capsaicin, the chemical 
responsible for the “hot” taste of spicy food, activates nociceptors. 
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Early research suggested that capsaicin would act by opening ion 
channels that are expressed in nociceptors. Thus, a cell that expresses the 
capsaicin receptor should take up calcium on treatment with the molecule. 
This insight led to the isolation of the capsaicin receptor with the use of 
cDNA from cells expressing this receptor. Such cells had been detected by 
their fluorescence when loaded with the calcium-sensitive compound 
Fura-2 and then treated with capsaicin or related molecules. Cells expressing 
the capsaicin receptor, which is called VR1 (for vanilloid receptor 1), respond 
to capsaicin below a concentration of 1 wM. The deduced 838-residue 
sequence of VR1 revealed it to be a member of the TRP channel family 
(Figure 33.35). The amino-terminal region of VR1 includes three ankyrin 
repeats. 
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Figure 33.35 The membrane topology 
deduced for VR1, the capsaicin receptor. 
The proposed site of the membrane pore is 
indicated in red, and the three ankyrin 

(A) repeats are shown in orange. The 

active receptor comprises four of these 
subunits. [After M. J. Caterina, et al. Nature 
389:816-824, 1997.] 
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Figure 33.36 Response of the capsaicin receptor to pH and temperature. The ability of this 
receptor to respond to acid and to increased temperature helps detect potentially noxious situations. 
[After M. Tominaga, et al. Neuron 21:531-543, 1998] 


Currents through VR1 are also induced by temperatures above 40°C 
and by exposure to dilute acid, with a midpoint for activation at pH 5.4 
(Figure 33.36). Temperatures and acidity in these ranges are associated with 
infection and cell injury. The responses to capsaicin, temperature, and acid- 
ity are not independent. The response to heat is greater at lower pH, for 
example. Thus, VR1 acts to integrate several noxious stimuli. We feel these 
responses as pain and act to prevent the potentially destructive conditions 
that cause the unpleasant sensation. Mice that do not express VR1 suggest 
that this is the case; such mice do not mind food containing high concentra- 
tions of capsaicin and are, indeed, less responsive than control mice to nor- 
mally noxious heat. Plants such as chili peppers presumably gained the 
ability to synthesize capsaicin and other “hot” compounds to protect them- 
selves from being consumed by mammals. Birds, which play the beneficial 
role of spreading pepper seeds into new territory, do not appear to respond 
to capsaicin. 


¥ Because of its ability to simulate VR1, capsaicin is used in pain 
management for arthritis, neuralgia, and other neuropathies. How 
can a compound that induces pain assist in its alleviation? Chronic expo- 
sure to capsaicin overstimulates pain-transmitting neurons, leading to their 
desensitization. 


More sensory systems remain to be studied 


There may exist other subtle senses that are able to detect environmental 
signals that then influence our behavior. The biochemical basis of these 
senses 1s now under investigation. One such sense 1s our ability to respond, 
often without our awareness, to chemical signals called pheromones, 
released by other persons. Another is our sense of time, manifested in our 
daily (circadian) rhythms of activity and restfulness. Daily changes in light 
exposure strongly influence these rhythms. The foundations for these 
senses have been uncovered in other organisms; future studies should reveal 
to what extent these mechanisms apply to human beings as well. 


Summary 


Smell, taste, vision, hearing, and touch are based on signal-transduction 
pathways activated by signals from the environment. These sensory 
systems function similarly to the signal-transduction pathways for 
many hormones. These intercellular signaling pathways appear to 


have been appropriated and modified to process environmental 
information. 


33.1 A Wide Variety of Organic Compounds Are Detected by Olfaction 


The sense of smell, or olfaction, is remarkable in its specificity; it can, 
for example, discern stereoisomers of small organic compounds as 
distinct aromas. The 7TM receptors that detect these odorants operate 
in conjunction with Goi), a G protein that activates a cAMP cascade 
resulting in the opening of an ion channel and the generation of anerve 
impulse. An outstanding feature of the olfactory system is its ability to 
detect a vast array of odorants. Each olfactory neuron expresses only 
one type of receptor and connects to a particular region of the olfactory 
bulb. Odors are decoded by a combinatorial mechanism: each odorant 
activates a number of receptors, each to a different extent, and most 
receptors are activated by more than one odorant. 


33.2 Taste Is a Combination of Senses That Function by Different 


Mechanisms 

We can detect only five tastes: bitter, sweet, salt, sour, and umami. 
The transduction pathways that detect taste are, however, diverse. 
Bitter, sweet, and umami tastants are experienced through 7TM 
receptors acting through a special G protein called gustducin. Salty 
and sour tastants act directly through membrane channels. Salty 
tastants are detected by passage though Na* channels, whereas sour 
taste results from the effects of hydrogen ions on a number of types of 
channels. The end point is the same in all cases—membrane polariza- 
tion that results in the transmission of a nerve impulse. 


33.3 Photoreceptor Molecules in the Eye Detect Visible Light 


33.4 


33.5 


Vision is perhaps the best understood of the senses. Two classes of 
photoreceptor cells exist: cones, which respond to bright lights and 
colors, and rods, which respond only to dim light. The photoreceptor 
in rods is rhodopsin, a 7TM receptor that is a complex of the protein 
opsin and the chromophore 11-cis-retinal. The absorption of light by 
11-cis-retinal changes its structure into that of all-trans-retinal, setting 
in motion a signal-transduction pathway that leads to the breakdown 
of cGMP, to membrane hyperpolarization, and to a subsequent nerve 
impulse. Color vision is mediated by three distinct 7M photorecep- 
tors that employ 11-cis-retinal as a chromophore and absorb light in 
the blue, green, and red parts of the spectrum. 


Hearing Depends on the Speedy Detection of Mechanical Stimuli 

The immediate receptors for hearing are found in the hair cells of the 
cochleae, which contain bundles of stereocilia. When the stereocilia 
move in response to sound waves, cation channels will open or close, 
depending on the direction of movement. The mechanical motion of 
the cilia is converted into current flow and then into a nerve impulse. 


Touch Includes the Sensing of Pressure, Temperature, 

and Other Factors 

Touch, detected by the skin, senses pressure, temperature, and pain. 
Specialized nerve cells called nociceptors transmit signals that are 
interpreted in the brain as pain. A receptor responsible for the percep- 
tion of pain has been isolated on the basis of its ability to bind capsaicin, 
the molecule responsible for the hot taste of spicy food. The capsaicin 
receptor, also called VR1, functions as a cation channel that initiates a 
nerve impulse. 


975 


Summary 


976 
CHAPTER 33 Sensory Systems 


Key Terms 


opsin (p. 966) 
retinal (p. 966) 


main olfactory epithelium (p. 958) 

Gott) (p. 959) 

gustducin (p. 963) 

amiloride-sensitive Na* channel (p. 965) 
rod (p. 966) 

cone (p. 966) 

rhodopsin (p. 966) 


transducin (p. 968) 


chromophore (p. 967) 


cGMP phosphodiesterase (p. 968) 
cGMP- gated ion channel (p. 968) 
rhodopsin kinase (p. 968) 


arrestin (p. 969) 

guanylate cyclase (p. 969) 

hair cell (p. 971) 

stereocilium (p. 971) 

tip link (p. 972) 

nociceptor (p. 973) 

capsaicin receptor (VR1) (p. 973) 


Problems 


1. Olfaction in worms. Unlike the olfactory neurons in the 
mammalian systems discussed herein, olfactory neurons in 
thenematode C. elegans express multiple olfactory receptors. 
In particular, one neuron (called AWA) expresses receptors 
for compounds to which the nematode is attracted, whereas 
a different neuron (called AWB) expresses receptors for 
compounds that the nematode avoids. Suppose that a trans- 
genic nematode is generated such that one of the receptors 
for an attractant is expressed in AWB rather than AWA. 
What behavior would you expect in the presence of the 
corresponding attractant? 


2. Odorant matching. A mixture of two of the compounds 
illustrated in Figure 33.6 is applied to a section of olfactory 
epithelium. Only receptors 3, 5, 9, 12, and 13 are activated, 
according to Figure 33.7. Identify the likely compounds in 
the mixture. 


3. Timing. Compare the aspects of taste (bitter, sweet, salty, 
sour) in regard to their potential for rapid time resolution. 


4. Two ears. Our ability to determine the direction from 
which a sound is coming is partly based on the difference in 
time at which our two ears detect the sound. Given the 
speed of sound (350 ms‘) and the separation between our 
ears (0.15 m), what difference is expected in the times at 
which a sound arrives at our two ears? How does this 
difference compare with the time resolution of the human 
hearing system? Would a sensory system that utilized 
7TM receptors and G proteins be capable of adequate time 
resolution? 


5. It’s better to be bitter. Some nontoxic plants taste very 
bitter to us. Suggest one or more explanations. 


6. Of mice and men. In human beings, the umami taste is 
triggered only by glutamate and aspartate. In contrast, mice 
respond to many more amino acids. Design an experiment 
to test which of the subunits (T1R1 or T1R3) determines 
the specificity of this response. Assume that all desired 
mouse strains can be readily produced. 


7. Color discrimination. Some women who have had male 
children who display color blindness have been demon- 


strated to distinguish between colors that are not readily 
distinguished by other people. Propose an explanation. 


8. Combinatorial power. How many different odorants 
could a person distinguish if each odorant is bound by a 
single type of receptor and each receptor binds only a single 
odorant? How many if each odorant binds to two different 
odorant receptors? Three odorant receptors? 


9. Light action. Describe the effect of light absorption on 
11-cis-retinal bound within rhodopsin.. 


10. Mirror, mirror. Compounds A and B typically evoke 
different odors, with compound A described as smelling 
like tropical fruit and compound B smelling oniony. 


Pe ee en 
Pp. eee 


Explain how two such similar compounds can smell so 
different. 


11. Channeling. Provide an example of an ion channel in 
vision, taste, and hearing. 


Chapter Integration Problem 


12. Energy and information. The transmission of sensory 
information requires the input of free energy. For each sen- 
sory system (olfaction, gustation, vision, hearing, and 
touch), identify mechanisms for the input of free energy 
that allow the transmission of sensory information. 


Mechanism Problem 


13. Schiff-base formation. Propose a mechanism for the 
reaction between opsin and 11-cis-retinal. 


CHAPTER 34 


The Immune System 


Influenza 
hemagglutinin 
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Just as medieval defenders used their weapons and the castle walls to 
defend their city, the immune system constantly battles against foreign 
invaders such as viruses, bacteria, and parasites to defend the organism. 
Antibody molecules provide a key element in the immune system's 
defensive arsenal. For example, specific antibodies can bind to molecules on 
the surfaces of viruses and prevent the viruses from infecting cells. Above 
right, an antibody binds to one subunit on hemagglutinin from the surface of 
influenza virus. [(Left) The Granger Collection.] 


Wie are constantly exposed to an incredible diversity of bacteria, viruses, OUTLINE 
and parasites, many of which would flourish in our cells or extracel- 

lular fluids were it not for our immune system. How does the immune sys- 
tem protect us? The human body has two lines of defense: an innate immune 


34.1 Antibodies Possess Distinct 
Antigen-Binding and Effector 


system that responds rapidly to features present in many pathogens, and an pie 

adaptive immune system that responds to specific features present only ina 34.2 Antibodies Bind Specific 
given pathogen. Both the innate and the adaptive immune systems first iden- Molecules Through 

tify features on disease-causing organisms and then work to eliminate or neu- Hypervariable Loops 


tralize those organisms. While a thorough description of the immune system 
is certainly beyond the scope of this book, this chapter will focus on how 
biochemical concepts such as protein structure, receptor-ligand interactions, 
and signal transduction are applied to the identification of pathogens. 

The immune system must meet two tremendous challenges in the 
identification of pathogens: (1) to produce a system of receptors diverse 
enough to recognize a wide array of potential pathogens and (2) to distin- 


34.3 Diversity Is Generated by Gene 
Rearrangements 


34.4 Major-Histocompatibility-Complex 
Proteins Present Peptide Antigens 
on Cell Surfaces for Recognition 
by T-Cell Receptors 


guish invaders and their disease-causing products from the organism’s own 34.5 The Immune System Contributes 
products (i.e., self- versus nonself-recognition). To meet these challenges, to the Prevention and the 
the innate immune system evolved the ability to recognize structural Development of Human Diseases 
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Figure 34.1 Toll-like receptor. Each 
receptor is made up of a set of 18 or more 
leucine-rich repeat sequences, followed by a 
cysteine-rich domain, a single transmembrane 
helix, and an intracellular domain that 
functions in signal transduction. 


elements, such as specific glycolipids or forms of nucleic acid, that are well 
conserved in pathogens but absent in the host organism. The repertoire of 
such elements is limited, however, and so some pathogens have strategies to 
escape detection. The adaptive immune system has the remarkable ability to 
produce more than 10° distinct proteins, called antibodies, that can recog- 
nize different foreign molecules, and more than 10'” receptors on immune 
cells, called T-cell receptors (TCRs), each of which presents a different 
surface with the potential to specifically bind a structure from a foreign 
organism. In producing this vast range of defensive molecules, however, the 
adaptive immune system has the potential to create antibodies and 
T-cells that recognize and attack cells or molecules normally present in our 
bodies—a situation that can result in autoimmune diseases. 

This chapter will examine these challenges, focusing first on the struc- 
tures of proteins that recognize foreign organisms and then on the mecha- 
nisms for protecting us from a specific pathogen once it has been recog- 
nized. The chapter will closely examine the modular construction of the 
proteins of the immune system—identifying structural motifs and consider- 
ing how spectacular diversity can arise from modular construction. 


Innate immunity is an evolutionarily ancient defense system 


Innate immunity is an evolutionarily ancient defense system found, at least 
in some form, in all multicellular plants and animals. The innate immune 
system represents the first line of defense against foreign pathogens, relying 
on common features of invading organisms to identify and eliminate these 
threats. Components of the innate immune system include the epithelial 
lining that surrounds host cells and the specialized cells, called phagocytes, 
that can ingest and destroy pathogens without the aid of the adaptive 
immune system. 

The innate immune system also includes a family of receptors that can 
recognize specific features present in most pathogens and yet not respond to 
materials normally present in the host. The best-understood of these recep- 
tors are the Toll-like receptors (TLRs). The name “‘toll-like” is derived from 
a receptor known as Toll encoded in the Drosophila genome; Toll was first 
identified in a screen for genes important for Drosophila development and 
was subsequently shown to play a key role in the innate immune system 
later in development. The TLRs have a common structure (Figure 34.1). 
Each receptor consists of a large extracellular domain built primarily from 
repeated amino acid sequences termed leucine-rich repeats (LRRs). Each 
LRR typically contains 20—30 residues, including 6 that are usually leucine. 
The human TLRs have from 18 to 27 LRR repeats that are followed by a 
sequence forming a single transmembrane helix and an intracellular signal- 
ing domain. This signaling domain is not a protein kinase but acts as a dock- 
ing site for other proteins. Most TLRs are expressed in the cell membrane 
for the detection of extracellular pathogens such as fungi and bacteria. 
Other TLRs are located in the membranes of internal compartments for the 
detection of intracellular pathogens such as viruses and some bacteria. 

Each TLR targets a specific molecular characteristic, often called a 
pathogen-associated molecular pattern (PAMP), found primarily on invading 
organisms. Typically, a PAMP is a critical component of the pathogen’s 
function: mutations in these targets cannot easily block recognition by the 
TLR without compromising the activity of the pathogen. One particularly 
important PAMP is lipopolysaccharide (LPS), also referred to as endotoxin, 
a specific class of glycolipid found in the cell walls of Gram-negative bacte- 
ria such as E. coli. LPS is recognized by TLR-4. The response of the innate 
immune system to LPS can be easily demonstrated. Injection of less than 
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1 mg of LPS into a human being produces a fever and other signs of inflam- 
mation even though no living organisms are introduced. 

How do TLRs recognize PAMPs? The leucine-rich repeat domain from 
human TLR-3 has a remarkable structure (Figure 34.2). Each of its LRR 
units contributes a single B strand to a large parallel B sheet that lines the 
inside of a concave structure. This hooklike structure immediately suggests 
a model for how TLRs bind PAMPs—namely, that the PAMP lies on the 
inside of the “hook.” This model is likely accurate for some TLRs. However, 
for other TLRs, the PAMP-binding site appears to lie on one side of the 
structure, and the central hole is blocked by host carbohydrates linked to the 
structure. 

Regardless of the details of the interaction, PAMP binding appears to 
lead to the formation of a specific dimer of the TLR. The cytoplasmic side 
of this dimer is a signaling domain that initiates the signal-transduction 
pathway. Because the TLRs and other components of the innate immune 
system are always expressed, they provide the host organism with a rapid 
response to resist attack by pathogens. However, a number of pathogens 
have evolved the ability to escape detection by the innate immune system. 
For protection against such pathogens, the host relies on the adaptive 
immune system, which is able to target specific pathogens, even those that 
it has never encountered in the course of evolution. 


The adaptive immune system responds by using the 
principles of evolution 


The adaptive immune system comprises two parallel but interrelated sys- 
tems: humoral and cellular immune responses. In the humoral immune 
response, soluble proteins called antibodies (immunoglobulins) function as 
recognition elements that bind to foreign molecules and serve as markers 
signaling foreign invasion. Antibodies are secreted by plasma cells, which 
are derived from B lymphocytes (B cells) (Figure 34.3). A foreign macromol- 
ecule that binds selectively to an antibody is called an antigen. In a physio- 
logical context, if the binding of the foreign molecule stimulates an immune 
response, that molecule is called an immunogen. The specific affinity of an 
antibody is not for the entire macromolecular antigen but for a particular 


hk Figure 34.2 PAMP-recognition unit 
of the Toll-like receptor. (A) The structure 
of the leucine-rich repeat (LRR) domain from 
human TLR-3. Notice that the LRR units come 
together to form a central parallel 8 sheet that 
curls to form a concave structure. (B) The 
structure of a single LRR showing the 
positions of the residues that are generally 
approximately conserved. Notice that the 
leucine residues come together to form a 
hydrophobic core with the single B strand 
along on one side. [Drawn from 1ZIW.pdb]. 


Figure 34.3 Immunoglobulin production. 
An electron micrograph of a plasma cell 
shows the highly developed rough 
endoplasmic reticulum necessary for antibody 
secretion. [Courtesy of Lynne Mercer.] 
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site on the antigen called the epitope or antigenic determinant. Each B cell 
produces just one type of antibody that can recognize a single epitope. 

In the cellular immune response, cells called cytotoxic T lymphocytes (also 
commonly called killer T cells) destroy cells that have been invaded by a 
pathogen. Because intracellular pathogens do not leave markings on the 
exteriors of infected cells, vertebrates have evolved a mechanism to mark 
the exterior of cells with a sample of the interior contents, both self and 
foreign. Some of the internal proteins are broken into peptides, which are 
then bound to a complex of integral membrane proteins encoded by the 
major histocompatbility complex (MHC). T cells continually scan the bound 
peptides to find and kill cells that display foreign motifs on their surfaces. 
Another class of T cells called helper T lymphocytes contributes to both the 
humoral and the cellular immune responses by stimulating the differentia- 
tion and proliferation of appropriate B cells and cytotoxic T cells. The cel- 
lular immune response is mediated by specific receptors that are expressed 
on the surfaces of the T cells. 

The remarkable ability of the immune system to adapt to an essentially 
limitless set of potential pathogens requires a powerful system for trans- 
forming the immune cells and molecules present in our systems in response 
to the presence of pathogens. This adaptive system operates through the prin- 
ciples of evolution, including reproduction with variation followed by selection 
of the most well suited members of a population. 

If the human genome contains, by the latest estimates, only 25,000 
genes, how can the immune system generate more than 10° different anti- 
body proteins and 10’* T-cell receptors? The answer is found in a novel 
mechanism for generating a highly diverse set of genes from a limited set of 
genetic building blocks. Linking different sets of DNA regions in a combi- 
natorial manner produces many distinct protein-encoding genes that are 
not present in the genome. A rigorous selection process then leaves for pro- 
liferation only cells that synthesize proteins determined to be useful in the 
immune response. The subsequent reproduction of these cells without 
additional recombination serves to enrich the cell population with members 
expressing particular protein species. 

Critical to the development of the immune response is the selection 
process, which determines which immune cells will reproduce. The process 
comprises several stages. In the early stages of the development of an 
immune response, cells expressing molecules that bind tightly to self- 
molecules are destroyed or silenced, whereas cells expressing molecules 
that do not bind strongly to self-molecules and that have the potential for 
binding strongly to foreign molecules are preserved. The appearance of 
an immunogenic invader at a later time will stimulate cells expressing 
antibodies or T-cell receptors that bind specifically to elements of that 
pathogen to reproduce—in evolutionary terms, such cells are selected for. 
Thus, the immune response is based on the selection of cells expressing 
molecules that are specifically effective against a particular invader; the 
response evolves from a population with wide-ranging specificities to a 
more-focused collection of cells and molecules that are well suited to 
defend the host when confronted with that particular challenge. 

Not only are antibodies and T-cell receptors a result of genetic diversity 
and recombination, but antibodies have highly diverse structures as well. 
Antibodies require many different structural solutions for binding many 
different antigens, each of which has a different form. T-cell receptors, in 
contrast, are not structurally diverse, because they have coevolved with the 
MHC. The docking mode of a T-cell receptor to the peptide bound to 
MHC is similar for all structures. As a consequence of this coevolution, 
every T-cell receptor has an inherent reactivity with every MHC. The 


coevolution ensures that all T-cell receptors can scan all peptide-MHC 
complexes on all tissues. The genetic diversity of the 10'* different T-cell 
receptors is concentrated in a highly diverse set of residues in the center of 
the MHC groove. This localized diversity allows the T-cell receptor to rec- 
ognize the many different foreign peptides bound to the MHC. T-cell 
receptors must survey many different MHC-—peptide complexes with rapid 
turnover. Therefore, the binding affinities between T-cell receptors and the 
MHC are weaker than those between antibody and antigen. 


34.1 Antibodies Possess Distinct Antigen-Binding 
and Effector Units 


Antibodies are central molecular players in the immune response. In 
Chapter 3, we discussed the application of antibodies as tools to study pro- 
teins; let us now consider the native structures and functions of these 
remarkable molecules. In 1959, Rodney Porter showed that immunoglobulin 
G (IgG), the major antibody in serum, can be cleaved into three 50-kd 
fragments by the limited proteolytic action of papain. Two of these frag- 
ments bind antigen. They are called F,, (F stands for fragment, ab for antigen 
binding). The other fragment, called F, because it crystallizes readily, does 
not bind antigen, but it has other important biological activities, including 
the mediation of responses termed effector functions. These functions 
include the initiation of the complement cascade, a process that leads to the 
lysis of target cells. Although such effector functions are crucial to the func- 
tioning of the immune system, they will not be considered further here. 
How do these fragments relate to the three-dimensional structure of 
whole IgG molecules? Immunoglobulin G consists of two kinds of polypep- 
tide chains, a 25-kd light (L) chain and a 50-kd heavy (H) chain (Figure 34.4). 
The subunit composition is L7H. Each L chain is linked to an H chain by a 
disulfide bond, and the H chains are linked to each other by at least one disul- 
fide bond. Examination of the amino acid sequences and three-dimensional 
structures of IgG molecules reveals that each L chain comprises two homol- 
ogous domains, termed immunoglobulin domains, to be described in detail in 
Section 34.2. Each H chain has four immunoglobulin domains. Overall, the 
molecule adopts a conformation that resembles the letter Y, in which the 
stem, corresponding to the F, fragment obtained by cleavage with papain, 
consists of the two carboxyl-terminal immunoglobulin domains of each H 
chain and in which the two arms of the Y, corresponding to the two F,p frag- 
ments, are formed by the two amino-terminal domains of each H chain and 
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hk Figure 34.4 Immunoglobulin G 
structure. (A) The three-dimensional 
structure of an IgG molecule showing the light 
chains in yellow and the heavy chains in blue. 
(B) A schematic view of an IgG molecule 
indicating the positions of the interchain 
disulfide bonds. Abbreviations: N, amino 
terminus; C, carboxyl terminus. [Drawn from 
1IGT.pdb_] 
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Figure 34.5 Immunoglobulin G cleavage. 
Treatment of intact IgG molecules with the 
protease papain results in the formation of 
three large fragments: two Fp fragments that 
retain antigen-binding capability and one F, 
fragment that does not. 


Figure 34.6 Antigen cross-linking. 
Because IgG molecules include two 
antigen-binding sites, antibodies can cross-link 
multivalent antigens such as viral surfaces. 


Table 34.1 Properties of immunoglobulin classes 


Serum 

concentration Light Heavy 
Class (mg ml ) Mass (kd) chains chains Chain structure 
IgG 12 150 kKorA Y K2Y2 or A2Y2 
IgA 3 180-500 kord a (K2Q2)n OF (A2Q2)n 
IgM 1 950 kor\ p (kapL2)5 or (K2}L2)s 
IgD 0.1 175 Kkord ) K252 or A262 
IgE 0.001 200 kKorA € K2€2 or AzEQ 


Note: n = 1, 2, or 3. IgM and oligomers of IgA also contain J chains that connect immunoglobulin molecules. 
IgA in secretions has an additional component. 


the two amino-terminal domains of each L chain. The linkers between the 
stem and the two arms consist of extended polypeptide regions within the H 
chains and are quite flexible. 

Papain cleaves the H chains on the carboxyl-terminal side of the disul- 
fide bond that links each L and H chain (Figure 34.5). Thus, each F,,, con- 
sists of an entire L chain and the amino-terminal half of an H chain, 
whereas F, consists of the carboxyl-terminal halves of both H chains. Each 
F,p contains a single antigen-binding site. Because an intact IgG molecule 
contains two F,, components and therefore has two binding sites, it can 
cross-link multiple antigens (Figure 34.6). Furthermore, the F, and the two 
Fp units of the intact IgG are joined by flexible polypeptide regions that 
allow facile variation in the angle between the F,, units through a wide 
range (Figure 34.7). This kind of mobility, called segmental flexibility, can 
enhance the formation of an antibody—antigen complex by enabling both 
combining sites on an antibody to bind an antigen that possesses multiple 
binding sites, such as a viral coat composed of repeating identical monomers 
or a bacterial cell surface. The combining sites at the tips of the F,, units 
simply move to match the distance between specific determinants on the 
antigen. 

Immunoglobulin G is the antibody present in highest concentration 
in the serum, but other classes of immunoglobulin also are present 
(Table 34.1). Each class includes an L chain (either k or \) and a distinct H 
chain (Figure 34.8). The heavy chains in IgG are called y chains, whereas 
those in immunoglobulins A, M, D, and E are called a, p, 5, and €, respec- 
tively. Immunoglobulin M (IgM) is the first class of antibody to appear in the 
serum after exposure to an antigen. The presence of 10 combining sites 
enables IgM to bind especially tightly to antigens containing multiple iden- 
tical epitopes. The strength of an interaction comprising multiple indepen- 
dent binding interactions between partners is termed avidity rather than 
affinity, which denotes the binding strength of a single combining site. 


, Antigen-binding sites . 
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Figure 34.7 Segmental flexibility. The linkages between the F.p and the F, regions of an IgG 
molecule are flexible, allowing the two antigen-binding sites to adopt a range of orientations with 
respect to each other. This flexibility allows effective interactions with a multivalent antigen without 
requiring that the epitopes on the target be a precise distance apart. 
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Immunoglobulin A (IgA) is the major class of antibody in external secre- 
tions, such as saliva, tears, bronchial mucus, and intestinal mucus. Thus, 
IgA serves as a first line of defense against bacterial and viral antigens. The 
role of immunoglobulin D (IgD) is not yet known. Immunoglobulin E (IgE) is 
important in conferring protection against parasites, but IgE also partici- 
pates in allergic reactions. IgE—antigen complexes form cross-links with 
receptors on the surfaces of mast cells to trigger a cascade that leads to 
the release of granules containing pharmacologically active molecules. 
Histamine, one of the agents released, induces smooth-muscle contraction 
and stimulates the secretion of mucus. 


34.2 Antibodies Bind Specific Molecules Through 
Hypervariable Loops 


A comparison of the amino acid sequences of different IgG antibodies from 
human beings or mice shows that the carboxyl-terminal half of the L chains 
and the carboxyl-terminal three-quarters of the H chains are very similar in 
all of the antibodies. Importantly, the amino-terminal domain of each chain 
is more variable, including three stretches of approximately 7 to 12 amino 
acids within each chain that are hypervariable, as shown for the H chain in 
Figure 34.9. The amino-terminal immunglobulin domain of each chain is thus 
referred to as the variable region, whereas the remaining immunoglobulin 


150 — 


100 -— 


Variability 


50 -—- 


0 20 40 60 80 100 120 
Residue 


IgD IgE 


6 chain e chain 


Figure 34.8 Classes of immunoglobulin. 
Each of five classes of immunoglobulin has 
the same light chain (shown in yellow) 
combined with a different heavy chain (y, a, 
p, 5, or e). Disulfide bonds are indicated by 
green lines. The IgA dimer and the IgM 
pentamer have a small polypeptide chain in 
addition to the light and heavy chains. 


Figure 34.9 Immunoglobulin sequence 
diversity. A plot of sequence variability as a 
function of position along the sequence of the 
amino-terminal immunoglobulin domain of 
the H chain of human IgG molecules. Three 
regions (in red) show remarkably high levels 
of variability. These hypervariable regions 
correspond to three loops in the 
immunoglobulin domain structure. [After R. A. 
Goldsby, T. J. Kindt, and B. A. Osborne, Kuby 
Immunology, 4th ed. (W. H. Freeman and 
Company, 2000), p. 91.] 


983 


Figure 34.10 Variable and constant 
regions. Each L and H chain includes one 
immunoglobulin domain at its amino terminus 
that is quite variable from one antibody to 
another. These domains are referred to as V, 
and Vy. The remaining domains are more 
constant from one antibody to another and 
are referred to as constant domains (C,1, 
Cyl, Cy2, and Cy3). 
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hk Figure 34.11 Immunoglobulin fold. 
An immunoglobulin domain consists of a pair 
of 8 sheets linked by a disulfide bond and 
hydrophobic interactions. Notice that three 
hypervariable loops lie at one end of the 
structure. [Drawn from 1DQJ.pbd.] 
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domains are much more similar in all antibodies and are referred to as constant 
regions (Figure 34.10). 


The immunoglobulin fold consists of a beta-sandwich 
framework with hypervariable loops 


An IgG molecule consists of a total of 12 immunoglobulin domains. These 
domains have many sequence features in common and adopt a common 
structure, the immunoglobulin fold (Figure 34.11). Remarkably, this same 
structural domain is found in many other proteins that play key roles in the 
immune system and in nonimmune functions. 

The immunoglobulin fold consists of a pair of B sheets, each built of 
antiparallel 8 strands, that surround a central hydrophobic core. A single 
disulfide bond bridges the two sheets. Two aspects of this structure are 
particularly important for its function. First, three loops present at one end 
of the structure form a potential binding surface. These loops contain the 
hypervariable sequences present in antibodies and in T-cell receptors. 
Variation of the amino acid sequences of these loops provides the major 
mechanism for the generation of the vastly diverse set of antibodies and 
T-cell receptors expressed by the immune system. These loops are referred 
to as hypervariable loops or complementarity-determining regions (CDRs). 
Second, the amino terminus and the carboxyl] terminus are at opposite ends 
of the structure, which allows structural domains to be strung together to 
form chains, as in the L and H chains of antibodies. Such chains are present 
in several other key molecules in the immune system. 


ie The immunoglobulin fold is one of the most prevalent domains 

encoded by the human genome: more than 750 genes encode proteins 
with at least one immunoglobulin fold recognizable at the level of amino 
acid sequence. Such domains are also common in other multicellular ani- 
mals such as flies and nematodes. However, from inspection of amino acid 
sequence alone, immunoglobulin-fold domains do not appear to be present 
in yeast or plants, although these organisms possess other structurally simi- 
lar domains, including the key photosynthetic electron-transport protein 
plastocyanin in plants (Section 19.3). Thus, the immunoglobulin-fold 
family appears to have expanded greatly along evolutionary branches lead- 
ing to animals—particularly, vertebrates. 


X-ray analyses have revealed how antibodies bind antigens 


For each class of antibody, the variable domains at the amino-terminal ends 
of the L and H chains (designated Vi and Vy) come together to form a 
binding surface. The positions of the complementarity-determining regions 
are striking. These hypervariable sequences, present in three loops of each 
domain, come together so that all six loops form a single surface at the end 
of each arm (Figure 34.12). Because virtually any Vy can pair with any Vy, 
a very large number of different binding sites can be constructed by their 
combinatorial association. 

The results of x-ray crystallographic studies of several hundred large 
and small antigens bound to F,,, molecules have been sources of much 
insight into the structural basis of antibody specificity. The binding of 
antigens to antibodies is governed by the same principles that govern the 
binding of substrates to enzymes. The interaction between complementary 
shapes results in numerous contacts between amino acids at the binding 
surfaces of both molecules. Many hydrogen bonds, electrostatic interactions, 
and van der Waals interactions, reinforced by hydrophobic interactions, 
combine to give specific and strong binding. 


hk Figure 34.12 Variable domains. A 
side view of the variable domains of the L 
chain (yellow) and the H chain (blue); the 
complementarity-determining regions (CDRs) 
are shown in red. Notice that the six CDRs 
come together to form a binding surface. The 
specificity of the surface is determined by the 
sequences and structures of the CDRs. 
[Drawn from 1DQJ.pdb.] 


A few aspects of antibody binding merit specific attention, inasmuch as 
they relate directly to the structure of immunoglobulins. The binding site 
on the antibody incorporates some or all of the CDRs in the variable 
domains of the antibody. Small molecules are likely to make contact with 
fewer CDRs, with perhaps 15 residues of the antibody participating in the 
binding interaction. Macromolecules often make more extensive contact, 
sometimes interacting with all six CDRs and 20 or more residues of the 
antibody. Small molecules often bind in a cleft of the antigen-binding 
region. Macromolecules, such as globular proteins, tend to interact across 
larger, fairly flat apposed surfaces bearing complementary protrusions and 
depressions. 

A well-studied case of small-molecule binding is seen in an example of 
phosphorylcholine bound to Fy. Crystallographic analysis revealed phos- 
phorylcholine bound to a cavity lined by residues from five CDRs—two 
from the L chain and three from the H chain (Figure 34.13). The positively 
charged trimethylammonium group of phosphorylcholine is buried inside 
the wedge-shaped cavity, where it interacts electrostatically with two nega- 
tively charged residues, a glutamate and an aspartate. The negatively 
charged phosphoryl group of phosphorylcholine binds to the positively 
charged guanidinium group of an arginine residue at the mouth of the crev- 
ice and is hydrogen bonded to the side chain of a nearby tyrosine residue. 
Numerous van der Waals interactions, such as those made by a tryptophan 
side chain, also stabilize this complex. 
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Figure 34.13 Binding of a small antigen. The structure of a complex between an F,p fragment 
of an antibody and its target—in this case, phosphorylcholine. Notice that residues from the antibody 
interact with phosphorylcholine through hydrogen bonding and electrostatic interactions [Drawn 
from 2MCP.pdb.] 
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h Figure 34.14 Antibodies against 
lysozyme. The structures of three complexes 
(i, ii, iit) between Fap fragments (blue and 
yellow) and hen egg-white lysozyme (red) 
shown with lysozyme in the same orientation 
in each case. Notice that the three antibodies 
recognize completely different epitopes on 
the lysozyme molecule. [Drawn from 3HFL, 
1DQJ, and 1FDL.pdb] 
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Ls Figure 34.15 Antibody-protein interactions. Notice 
that a single residue of lysozyme, glutamine 121, penetrates 


Residues from five CDRs participate in the binding of phosphorylcho- 
line to human F,,. This binding does not significantly change the struc- 
ture of the antibody, yet induced fit plays a role in the formation of many 
antibody—antigen complexes. A malleable binding site can accommodate 
many more kinds of ligands than can a rigid one. Thus, induced fit increases 
the repertoire of antibody specificities. 


Large antigens bind antibodies with numerous interactions 


How do large antigens interact with antibodies? A large collection of anti- 
bodies against hen egg-white lysozyme has been structurally characterized 
in great detail (Figure 34.14). Each different antibody binds to a distinct 
surface of lysozyme. Let us examine the interactions in one of these com- 
plexes (complex ii in Figure 34.14) in detail. This antibody binds two poly- 
peptide segments of lysozyme that are widely separated in the primary 
structure (Figure 34.15). 

All six CDRs of the antibody make contact with this epitope. 
The region of contact is quite extensive (about 30 xX 20 A). The 
apposed surfaces are rather flat. The only exception is the side 
chain of glutamine 121 of lysozyme, which penetrates deeply 
into the antibody’s binding site, where it forms a hydrogen bond 
with a main-chain carbonyl oxygen atom and is surrounded by 
three aromatic side chains. The formation of 12 hydrogen bonds 
and numerous van der Waals interactions contributes to the high 
affinity (Kg = 20 nM) of this antibody—antigen interaction. 
Examination of the F,, molecule without bound protein reveals 
that the structures of the Vj, and Vy, domains change little on 


more:deeply into the antibody combining site. [Drawn from binding, although they slide 1 A apart to allow more intimate 
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contact with lysozyme. 
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A mammal such as a mouse or a human being can synthesize large amounts 
of specific antibody against virtually any foreign determinant within a mat- 
ter of days after exposure. We have seen that antibody specificity is deter- 
mined by the amino acid sequences of the variable regions of both light 
and heavy chains, which brings us to the key question: How are different 
variable-region sequences generated? 

The discovery of distinct variable and constant regions in the L and H 
chains raised the possibility that the genes that encode immunoglobulins 
have an unusual architecture that facilitates the generation of a diverse set of 
polypeptide products. In 1965, William Dreyer and Claude Bennett pro- 
posed that multiple V (variable) genes are separate from a single C (constant) 
gene in embryonic (germ-line) DNA. According to their model, one of these 
V genes becomes joined to the C gene in the course of differentiation of the 
antibody-producing cell. A critical test of this novel hypothesis had to await 
the isolation of pure immunoglobulin mRNA and the development of tech- 
niques for analyzing mammalian genomes. Twenty years later, Susumu 
Tonegawa found that V and C genes are indeed far apart in embryonic 
DNA but are closely associated in the DNA of antibody-producing cells. 
Thus, immunoglobulin genes are rearranged in the differentiation of 
lymphocytes. 


J Goining) genes and D (diversity) genes increase antibody diversity 


Sequencing studies carried out by Susumu Tonegawa, Uh. | Mees Neo: Vis ibd dds —€ 
Philip Leder, and Leroy Hood revealed thatV genes {§ Jf JB $i 9 QJ 3 ¢ LL sw 
in embryonic cells do not encode the entire variable 

region of L and H chains. Consider, for example, the region that encodes Figure 34.16 The « light-chain locus. This 
the k light-chain family. A tandem array of 40 segments, each of which Pett at nomen -caroriasointe 2 ineces ae 


‘ , ; : ; array of 40 segments that encode the variable 
encodes approximately the first 97 residues of the variable domain of the L (V) eae eee residues 1-57) oF 


chain, is present on human chromosome 2 (Figure 34.16). the light chain, an array of 5 segments that 
However, the variable region of the L chain contains 110 residues. encode the joining (J) region (residues 
Where is the DNA that encodes the last 13 residues of the variable domain? 98-110), and a single region that encodes 


For L chains in undifferentiated cells, this stretch of DNA is located in an ine constant (C) region: 


unexpected place: near the C gene. It is called the J gene because it joins the 
V and C gene segments in a differentiated cell. In fact, a tandem array of five 
J gene segments is located near the C gene in embryonic cells. In the 
differentiation of an antibody-producing cell, a V gene segment becomes 
spliced to a J gene segment to form a complete gene for the variable region 
(Figure 34.17). RNA splicing generates an mRNA molecule for the 
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} 
Figure 34.17 VJ recombination. A single V gene (in this case, V2) is linked to a J gene (here, J4) 
to form an intact VJ region. The intervening DNA is released in a circular form. Because the V and J 


regions are selected at random and the joint between them is not always in exactly the same place, 
many VJ combinations can be generated by this process. 
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Figure 34.18 Light-chain expression. The 
light-chain protein is expressed by the 
transcription of the rearranged gene to 
produce a pre-RNA molecule with the VJ and 
C regions separated. RNA splicing removes 
the intervening sequences to produce an 
mRNA molecule with the VJ and C regions 
linked. Translation of the mRNA and 
processing of the initial protein product 
produce the light chain. 


Figure 34.19 V(D)J recombination. The 
heavy-chain locus includes an array of 51 V 


segments, 27 D segments, and 6 J segments. 


Gene rearrangement begins with D—J joining, 
followed by further rearrangement to link the 
V segment to the DJ segment. 
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complete L chain by linking the coding regions for the rearranged VJ unit 
with that for the C unit (Figure 34.18). 

J gene segments are important contributors to antibody diversity because 
they encode part of the last hypervariable segment (CDR3). In forming a 
continuous variable-region gene, any of the 40 V gene segments can become 
linked to any of 5 J gene segments. Thus, somatic recombination of these 
gene segments amplifies the diversity already present in the germ line. The 
linkage between V and J is not precisely controlled. Recombination between 
these genes can take place at one of several bases near the codon for resi- 
due 95, generating additional diversity. A similar array of V and J gene 
segments encoding the d light chain is present on human chromosome 22. 
This region includes 30 V) gene segments and four J, gene segments. In 
addition, this region includes four distinct C genes, in contrast with the 
single C gene in the k locus. 

In human beings, the genes encoding the heavy chain are present on 
chromosome 14. Remarkably, the variable domain of heavy chains is assem- 
bled from three rather than two segments. In addition to Vy gene segments 
that encode residues 1 through 94 and Jy gene segments that encode resi- 
dues 98 through 113, this chromosomal region includes a distinct set of 
segments that encode residues 95 through 97 (Figure 34.19). These gene 
segments are called D (for diversity). Some 27 D gene segments lie between 
51 Vy and 6 Jy gene segments. The recombination process first joins a D 
gene segment to a Jy gene segment; a Vy gene segment is then joined to 
DJu. A greater variety of antigen-binding patches and clefts can be formed 
by the H chain than by the L chain because the H chain is encoded by three 
rather than two gene segments. Moreover, CDR3 of the H chain is diversi- 
fied by the action of terminal deoxyribonucleotidyl transferase, a special 
DNA polymerase that requires no template. This enzyme inserts extra 
nucleotides between Vy and D. The V(D)J recombination of both the L and 
the H chains is executed by specific enzymes present in immune cells. 
These proteins, called RAG-1 and RAG-2, recognize specific DNA 
sequences called recombination signal sequences (RSSs) adjacent to the V, D, 
and J gene segments and facilitate the cleavage and religation of the DNA 
segments. 


More than 10° antibodies can be formed by combinatorial 
association and somatic mutation 


Let us recapitulate the sources of antibody diversity. The germ line contains 
a rather large repertoire of variable-region genes. For k light chains, there 
are about 40 V-segment genes and 5 J-segment genes. Hence, a total of 40 X 
5 = 200 kinds of complete V, genes can be formed by the combinations of 
V and J. A similar analysis suggests that at least 120 different A light chains 
can be generated. A larger number of heavy-chain genes can be formed 
because of the role of the D segments. For 51 V, 27 D, and 6 J gene seg- 
ments, the number of complete Vy genes that can be formed is 8262. The 
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association of 320 kinds of L chains with 8262 kinds of H chains would yield 
2.6 X 10° different antibodies. Variability in the exact points of segment 
joining and other mechanisms increases this value by at least two orders of 
magnitude. 

Even more diversity is introduced into antibody chains by somatic 
mutation—that is, by the introduction of mutations into the recombined 
genes. In fact, a 1000-fold increase in binding affinity is seen in the course 
of a typical humoral immune response, arising from somatic mutation, a 
process called affinity maturation. The generation of an expanded repertoire 
leads to the selection of antibodies that more precisely fit the antigen. Thus, 
nature draws on each of three sources of diversity—a germ-line repertoire, 
somatic recombination, and somatic mutation—to form the rich variety of 
antibodies that protect an organism from foreign incursions. 


The oligomerization of antibodies expressed on the surfaces of 
immature B cells triggers antibody secretion 


The processes heretofore described generate a highly diverse set of antibody 
molecules—a key first step in the generation of an immune response. The 
next stage is the selection of a particular set of antibodies directed against a 
specific invader. How is this selection accomplished? Each immature B cell, 
produced in the bone marrow, expresses a specific monomeric form of IgM 
attached to its surface (Figure 34.20). Each cell expresses approximately 10° 
IgM molecules, but all of these molecules are identical in amino acid sequence 
and, hence, in antigen-binding specificity. Thus, the selection of a particular 
immature B cell for growth will lead to the amplification of an antibody with 
a unique specificity. The selection process begins with the binding of an 
antigen to the membrane-bound antibody. 

Associated with each membrane-linked IgM molecule are two molecules 
of a heterodimeric membrane protein called Ig-a—Ig-B (see Figure 34.20). 
Examination of the amino acid sequences of Ig-a and Ig-B 
is highly instructive. The amino terminus of each protein 
lies outside the cell and corresponds to a single immuno- 
globulin fold, and the carboxyl terminus, which lies inside 
the cell, includes a sequence of 18 amino acids called an 
immunoreceptor tyrosine-based activation motif (ITAM; see 
Figure 34.20). As its name suggests, each ITAM includes 
key tyrosine residues, which are subject to phosphory- 
lation by particular protein kinases present in immune- 
system cells. 

A fundamental observation with regard to the mecha- 
nism by which the binding of antigen to membrane-bound 
antibody triggers the subsequent steps of the immune 
response is that oligomerization or clustering of the antibody 
molecules is required (Figure 34.21). The requirement for 
oligomerization is reminiscent of the dimerization of 
receptors triggered by epidermal growth factor and insulin 


Figure 34.20 B-cell receptor. This complex 
consists of a membrane-bound IgM molecule 
noncovalently bound to two Ig-a-Ig-B 
heterodimers. The intracellular domains of 
each of the Ig-a and Ig-8 chains include an 
immunoreceptor tyrosine-based activation 
motif (ITAM). 


Phosphorylates targets including an inhibitor 


encountered in Section 14.3; indeed, the associated signal- 
ing mechanisms appear to be quite similar. The oligomer- 
ization of the membrane-bound antibodies results in the 
phosphorylation of the tyrosine residues within the ITAMs 
by protein tyrosine kinases including Lyn, a homolog of 
Src (Section 14.5). The phosphorylated ITAMs serve as 
docking sites for a protein kinase termed spleen tyrosine 
kinase (Syk), which has two SH2 domains that interact 
with the pair of phosphorylated tyrosine residues in each 


of the transcription factor NF-«B 
Kinase 
domain 


Figure 34.21 B-cell activation. The binding of a multivalent antigen 
such as a bacterial or viral surface links membrane-bound IgM 
molecules. This oligomerization triggers the phosphorylation of tyrosine 
residues in the ITAM sequences by protein tyrosine kinases such as 
Lyn. After phosphorylation, the ITAMs serve as docking sites for Syk, a 
protein kinase that phosphorylates a number of targets, including 
transcription factors. 
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ITAM. Syk, when activated by phosphorylation, proceeds to phosphorylate 
other signal-transduction proteins including an inhibitory subunit of a tran- 
scription factor called NF-kB and an isoform of phospholipase C. The sig- 
naling processes continue downstream to activate gene expression, leading 
to the stimulation of cell growth and initiating further B-cell differentia- 
tion. 


¥e Drugs that modulate the immune system have served as sources of 
insight into immune-system signaling pathways. For example, 
cyclosporin, a powerful suppressor of the immune system, acts by blocking 
a phosphatase called calcineurin, which normally activates a transcription 
factor called NF-AT by dephosphorylating it. 
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The potent inhibition of the immune system, or immunosuppvession, induced 
by cyclosporin reveals how crucial the activity of NF-AT is to the develop- 
ment of an immune response. Without such drugs, organ transplantation 
would be extremely difficult because transplanted tissue expresses a wide 
range of foreign antigens, causing the host immune system to reject the new 
tissue. 

The role of oligomerization in the B-cell signaling pathway is illuminated 
when we consider the nature of many antigens presented by pathogens. The 
surfaces of many viruses, bacteria, and parasites are characterized by arrays 
of identical membrane proteins or membrane-linked carbohydrates. Thus, 
most pathogens present multiple binding surfaces that will naturally cause 
membrane-associated antibodies to oligomerize as they bind adjacent 
epitopes. In addition, the mechanism accounts for the observation that most 
small molecules do not induce an immune response; however, coupling 
multiple copies of a small molecule to a large oligomeric protein such as 
keyhole limpet hemocyanin (KLH), which has a molecular mass of close to 
1 million daltons or more, promotes antibody oligomerization and, hence, the 
production of antibodies against the small-molecule epitope. The large protein 
is called the carrier of the attached chemical group, which is called a haptenic 
determinant. The small foreign molecule by itself is called a hapten. Antibodies 
elicited by attached haptens will bind unattached haptens as well. 


Different classes of antibodies are formed by the hopping of V4, genes 


The development of an effective antibody-based immune response depends 
on the secretion into the blood of antibodies that have appropriate effector 
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functions. At the beginning of this response, an alternative mRNA-splicing 
pathway is activated so that the production of membrane-linked IgM is 
supplanted by the synthesis of secreted IgM. As noted in Section 33.1, 
secreted IgM is pentameric and has a high avidity for antigens containing 
multiple identical epitopes. Later, the antibody-producing cell makes either 
IgG, IgA, IgD, or IgE of the same specificity as that of the intially secreted 
IgM. In this switch, the light chain and the variable region of the heavy 
chain are unchanged. Only the constant region of the heavy chain changes. 
This step in the differentiation of an antibody-producing cell is called class 
switching (Figure 34.22). In undifferentiated cells, the genes for the constant 
region of each class of heavy chain, called C,, Cs, Cy, Cz, and Cg, are next 
to one another. There are eight in all, including four genes for the constant 
regions of y chains. A complete gene for the heavy chains of IgM antibody 
is formed by the translocation of a Vy gene segment to a DJy gene 
segment. 

How are other heavy chains formed? Class switching is mediated by a 
gene-rearrangement process that moves a VDJ gene from a site near one C 
gene to asite near another C gene. Importantly, the antigen-binding specificity 
is conserved in class switching because the entive V},D] - gene is translocated in 
an intact form. For example, the antigen-combining specificity of IgA 
produced by a particular cell is the same as that of IgM synthesized at an 
earlier stage of its development. The biological significance of Cy switching 
is that a whole recognition domain (the variable domain) is shifted from the 
early constant region (C,,) to one of several other constant regions that 
mediate different effector functions. 


34.4 Major-Histocompatibility-Complex Proteins Present 
Peptide Antigens on Cell Surfaces for Recognition 
by T-Cell Receptors 


Soluble antibodies are highly effective against extracellular 
pathogens, but they confer little protection against microorgan- 
isms that are predominantly intracellular, such as some viruses 
and mycobacteria (which cause tuberculosis and leprosy). These 
pathogens are shielded from antibodies by the host-cell mem- 
brane (Figure 34.23). A different and more subtle strategy, cell- 
mediated immunity, evolved to cope with intracellular viral 
pathogens. T cells continually scan the surfaces of all cells and 
kill those that exhibit foreign markings. The task is not simple; 
intracellular microorganisms are not so obliging as to intention- 
ally leave telltale traces on the surface of their host. Quite the 
contrary, successful pathogens are masters of the art of 
camouflage. Vertebrates have evolved an ingenious mecha- 
nism—cut and display—to reveal the presence of stealthy 
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Figure 34.22 Class switching. Further 
rearrangement of the heavy-chain locus 
results in the generation of genes for antibody 
classes other than IgM. In the case shown, 
rearrangement places the VDJ region next to 
the Cy1 region, resulting in the production of 
IgG1. Note that no further rearrangement of 
the VDJ region takes place, and so the 
specificity of the antibody is not affected. 


: rae . Figure 34.23 Intracellular pathogen. An electron 
intruders. Nearly all vertebrate cells exhibit on their surfaces a imicrograph showing mycobacteria (arrows) inside an infected 


sample of peptides derived from the digestion of proteins intheir macrophage. [Courtesy of Dr. Stanley Falkow] 
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Ls Figure 34.25 Class | MHC protein. A 
protein of this class consists of two chains. 
Notice that the a chain begins with two 
domains (a1, a2) that include « helices and 
continues with an immunoglobulin domain 
(az), a transmembrane domain, and a 
cytoplasmic tail. The second chain, B2- 


microglobulin, adopts an immunoglobulin fold. 


[Drawn from 1HHK.pdb.] 
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Figure 34.24 Presentation of peptides 
from cytoplasmic proteins. Class | MHC 
proteins on the surfaces of most cells display 
peptides that are derived from cytoplasmic 
proteins by proteolysis. 


cytoplasm. These peptides are displayed by integral membrane proteins 
that are encoded by the major histocompatibility complex (MHC). Specifically, 
peptides derived from cytoplasmic proteins are bound to and displayed by 
class I MHC proteins. The dendritic cells of the innate immune system that 
subject pathogens to phagocytosis migrate to lymphatic tissue, where they 
use an MHC-like mechanism to present foreign peptides or lipid compo- 
nents to T cells—thus linking the innate and adaptive immune responses to 
pathogens. 

How are these peptides generated and delivered to the plasma mem- 
brane? The process starts in the cytoplasm with the degradation of proteins— 
self-proteins as well as those of pathogens (Figure 34.24). Digestion is 
carried out by proteasomes (Section 23.2). The resulting peptide fragments 
are transported from the cytoplasm into the lumen of the endoplasmic 
reticulum by the TAP protein (for transporter associated with antigen pro- 
cessing), a member of the ABC transporter family of ATP-driven pumps 
(Section 13.2). In the ER, peptides combine with nascent class I MHC 
proteins; these complexes are then targeted to the plasma membrane. 

MHC proteins embedded in the plasma membrane tenaciously grip 
their bound peptides so that they can be touched and scrutinized by T-cell 
receptors on the surface of a killer cell. Foreign peptides bound to class I 
MHC proteins signal that a cell is infected and mark it for destruction by 
cytotoxic T cells. An assembly consisting of the foreign peptide—MHC 
complex, the T-cell receptor, and numerous accessory proteins triggers a 
cascade that induces apoptosis in the infected cell. Strictly speaking, infected 
cells are not killed but, instead, are triggered to commit suicide to aid the 
organism. 


Peptides presented by MHC proteins occupy a deep groove 
flanked by alpha helices 


The three-dimensional structure of a large extracellular fragment of a 
human MHC class I protein, human leukocyte antigen A2 (HLA-A2), was 
solved in 1987 by Don Wiley and Pamela Bjorkman. Class I MHC proteins 
consist of a 44-kd a chain noncovalently bound to a 12-kd polypeptide 
called Bo-microglobulin. The a chain has three extracellular domains (a, a2, 
and a3), a transmembrane segment, and a tail that extends into the cytoplasm 
(Figure 34.25). The Bo microglobulin and the a3 domains have immuno- 
globulin folds, although the pairing of the two domains differs from that in 


Top view Side view 


Peptide 


hk Figure 34.26 Class | MHC peptide-binding site. The «; and a2 domains come together 
to form a groove in which peptides are displayed. Notice that that the peptide is surrounded on 
three sides by a 8 sheet and two « helices, but it is accessible from the top of the structure. 
[Drawn from 1HHK.pdb_] 


antibodies. The a, and a2 domains exhibit a novel and remarkable architec- 
ture. They associate intimately to form a deep groove that serves as the 
peptide-binding site (Figure 34.26). The floor of the groove, which is about 
25 A long and 10 A wide, is formed by eight B strands, four from each 
domain. A long helix contributed by the a; domain forms one side, and a 
helix contributed by the a2 domain forms the other side. This groove is the 
binding site for the presentation of peptides. 

The groove can be filled by a peptide from 8 to 10 residues long in an 
extended conformation. As we shall see (p. 998), MHC proteins are remark- 
ably diverse in the human population; each person expresses as many as six 
distinct class | MHC proteins, and many different forms are present in dif- 
ferent people. The first structure determined, HLA-A2, binds peptides 
that almost always have leucine in the second position and valine in the last 
position (Figure 34.27). Side chains from the MHC molecule interact with 
the amino and carboxyl termini and with the side chains in these two key 
positions. These two residues are often referred to as the anchor residues. 
The other residues are highly variable. Thus, many millions of different 
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Figure 34.27 Anchor residues. (A) The 
amino acid sequences of three peptides that 
bind to the class | MHC protein HLA-A2 are 
shown. Each of these peptides has leucine in 
the second position and valine in the carboxyl- 
terminal position. (B) Comparison of the 
structures of these peptides reveals that the 
amino and carboxyl termini, as well as the 
side chains of the leucine and valine residues, 
are in essentially the same positions in each 
peptide, whereas the remainder of the 
structures are quite different. 


Valine 
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Figure 34.28 T-cell receptor. This 
protein consists of an a chain and a chain, 
linked by a disulfide bond. Notice that each 
chain consists of two immunoglobulin 
domains on the cell surface, a membrane- 


spanning domain, and a short cytoplasmic tail. 


[Drawn from 1BD2.pdb.] 


T-cell receptor 


MHC class | 


h Figure 34.29 T-Cell Receptor—class | 
MHC complex. The T-cell receptor binds to 
a class | MHC protein containing a bound 
peptide. Notice that the T-cell receptor 
contacts both the MHC protein and the 
peptide. [Drawn from 1BD2.pdb.] 


peptides can be presented by this particular class | MHC protein; the iden- 
tities of only two of the nine residues are crucial for binding. Each class of 
MHC molecules requires a unique set of anchor residues. Thus, a tremen- 
dous range of peptides can be presented by these molecules. Note that one 
face of the bound peptide is exposed to solution, where it can be examined by 
other molecules, particularly T-cell receptors. An additional remarkable fea- 
ture of MHC-—peptide complexes is their kinetic stability; once bound, a 
peptide is not released, even after a period of days. 


T-cell receptors are antibody-like proteins containing 
variable and constant regions 


We are now ready to consider the receptor that recognizes peptides dis- 
played by MHC proteins on target cells. The T-cell receptor consists of a 
43-kd a chain joined by a disulfide bond to a 43-kd 8 chain (Figure 34.28). 
Each chain spans the plasma membrane and has a short carboxyl-terminal 
region on the cytoplasmic side. A small proportion of T cells express a 
receptor consisting of y and 6 chains in place of a and 8. The a and B chains 
of the T-cell receptor, like immunoglobulin L and H chains, consist of vari- 
able and constant regions. Indeed, these domains of the T-cell receptor are 
homologous to the V and C' domains of immunoglobulins. Furthermore, hyper- 
variable sequences present in the V regions of the a and £ chains of the 
T-cell receptor form the binding site for the epitope. 

The genetic architecture of these proteins is similar to that of immuno- 
globulins, though the antibody genetic diversity is distributed over all the 
CDR loops, whereas T-cell-receptor genetic diversity is concentrated in the 
CDR3 loop that interacts with the peptide bound to the MHC. The variable 
region of the T-cell receptor a chain is encoded by about 50 V gene seg- 
ments and 70 J gene segments. The T-cell receptor B chain is encoded by 
two D gene segments in addition to 57 V gene segments and 13 J gene seg- 
ments. Again, the diversity of gene segments and the use of slightly impre- 
cise modes of joining them increase the number of distinct proteins formed. 
At least 10’? different specificities could arise from combinations of this reper- 
toire of genes. Thus, T-cell receptors, like immunoglobulins, can recognize a 
very large number of different epitopes. All the receptors on a particular T 
cell have the same specificity. 

How do T cells recognize their targets? The variable regions of the a and 
8 chains of the T-cell receptor form a binding site that recognizes a com- 
bined epitope—foreign peptide bound to an MHC protein (Figure 34.29). 
Neither the foreign peptide alone nor the MHC protein alone forms a 
complex with the T-cell receptor. Thus, fragments of an intracellular patho- 
gen are presented in a context that allows their detection, leading to the 
initiation of an appropriate response. 


CD8 on cytotoxic T cells acts in concert with T-cell receptors 


The T-cell receptor does not act alone in recognizing and mediating the fate 
of target cells. Cytotoxic T cells also express a protein termed CD8 on their 
surfaces that is crucial for the recognition of the class 1 MHC-—peptide com- 
plex. The abbreviation CD stands for cluster of differentiation, referring to a 
cell-surface marker that is used to identify a lineage or stage of differentia- 
tion. Antibodies specific for particular CD proteins have been invaluable in 
following the development of leukocytes and in discovering new interac- 
tions between specific cell types. 

Each chain in the CD8 dimer contains a domain that resembles an 
immunoglobulin variable domain (Figure 34.30). CD8 interacts primarily 
with the constant a3 domain of class I MHC proteins. This interaction 


hk Figure 34.30 The coreceptor CD8. 
This dimeric protein extends from the surface 
of a cytotoxic T cell and binds to class | MHC 
molecules that are expressed on the surface 
of the cell that is bound to the T cell. The 
dashed lines represent extended polypeptide 
chains that link the immunoglobulin domains 
of CD8 to the membrane. Notice that the 
coreceptor interacts primarily with the constant 
az domain of the class | MHC domain. [Drawn 
from 1AKJ.pdb.] 


further stabilizes the interactions between the T cell and its target. The 
cytoplasmic tail of CD8 contains a docking site for Lck, a cytoplasmic 
tyrosine kinase akin to Src. The T-cell receptor itself is associated with six 
polypeptides that form the CD3 complex (Figure 34.31). The y, 5, and 
chains of CD3 are homologous to Ig-a and Ig-8 associated with the B-cell 
receptor (see Figure 34.20); each chain consists of an extracellular immuno- 
globulin domain and an intracellular [TAM region. These chains associate 
into CD3-ye and CD3- de heterodimers. An additional component, the 
CD3-¢ chain, has only a small extracellular domain and a larger intracellular 
domain containing three ITAM sequences. 

On the basis of these components, a model for T-cell activation can 
be envisaged that is closely parallel to the pathway for B-cell activation 
(Section 34.3; Figure 34.32). The binding of the T-cell receptor to the class 
I MHC-peptide complex and the concomitant binding of CD8 from the 
T-cell to the MHC molecule link the kinase Lck to the ITAM substrates of 
the components of the CD3 complex. The phosphorylation of the tyrosine 
residues in the ITAM sequences generates docking sites for a protein kinase 
called ZAP-70 (for 70-kd zeta-associated protein) that is homologous to 
Syk in B cells. Docked by its two SH2 domains, ZAP-70 phosphorylates 


Figure 34.32 T-cell activation. 
The interaction between the T-cell 
receptor and a class | MHC— 
peptide complex results in the 
binding of CD8 to the MHC 
protein, the recruitment of the 
protein tyrosine kinase Lck, and the 
phosphorylation of tyrosine 
residues in the ITAM sequences 
of the CD3 chains. After 
phosphorylation, the ITAM regions 
serve as docking sites for the 


> Phosphorylates targets proteins protein kinase ZAP-70, which 
Kinase phosphorylates protein targets to 


domain transmit the signal. 


995 
34.4 Major Histocompatibility Complexes 


Figure 34.31 T-cell receptor complex. The 
T-cell receptor is associated with six CD3 
molecules: a CD3-y—CD3-« heterodimer, a 
CD3-8-CD3-e heterodimer, and two chains of 
CD3-¢. Single ITAM sequences are present in 
the cytoplasmic domains of CD3-y, CD3-8, 
and CD3-e, whereas three such sequences 
are found in each CD3-¢ chain. 
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Figure 34.33 Presentation of peptides from 


internalized proteins. Antigen-presenting cells bind and 
internalize foreign proteins and display peptides that are 


downstream targets in the signaling cascade. Additional molecules, includ- 
ing a membrane-bound protein phosphatase called CD45 and a cell-surface 
protein called CD28, play ancillary roles in this process. 

T-cell activation has two important consequences. First, the activation 
of cytotoxic T cells results in the secretion of several proteins, including 
perforin and granzymes. Perforin is a 70-kd protein that destabilizes the 
plasma membrane of the target cell, enabling the entry of granzymes into the 
cytoplasm of the target cell. Granzymes are serine proteases (Section 9.1) 
that initiate the pathway of apoptosis, leading to the death of the target cell 
and the fragmentation of its DNA, including any viral DNA that may be 
present. Second, after it has stimulated its target cell to commit suicide, the 
activated T cell disengages and is stimulated to reproduce. Thus, additional 
T cells that express the same T-cell receptor are generated to continue the 
battle against the invader after these T cells have been identified as a suit- 
able weapon. 


Helper T cells stimulate cells that display foreign peptides 
bound to class II MHC proteins 


Not all T cells are cytotoxic. Helper T cells, a different class, stimulate the 
proliferation of specific B lymphocytes and cytotoxic T cells and thereby serve 
as partners in determining the immune responses that are produced. The 
importance of helper T cells is graphically revealed by the devastation 
wrought by AIDS, a condition that destroys these cells. Helper T cells, like 
cytotoxic T cells, detect foreign peptides that are presented on cell surfaces 
by MHC proteins. However, the source of the peptides, the MHC proteins 
that bind them, and the transport pathway are different. 

Helper T cells recognize peptides bound to MHC molecules 
referred to as class II. Their helping action is focused on B cells, 
macrophages, and dendritic cells. Class IT MHC proteins are 
expressed only by these antigen-presenting cells, unlike class I MHC 
proteins, which are expressed on nearly all cells. The peptides 
presented by class IT MHC proteins do not come from the cyto- 
plasm. Rather, they arise from the degradation of proteins that have 
been internalized by endocytosis. Consider, for example, a virus 
particle that is captured by membrane-bound immunoglobulins 
on the surface of a B cell (Figure 34.33). This complex is delivered 
to an endosome, a membrane-enclosed acidic compartment, 
where it is digested. The resulting peptides become associated 
with class IT MHC proteins, which move to the cell surface. 
Peptides from the cytoplasm cannot reach class II proteins, 
whereas peptides from endosomal compartments cannot reach 
class I proteins. This segregation of displayed peptides is biologi- 
cally critical. The association of a foreign peptide with a class II 
MHC protein signals that a cell has encountered a pathogen and 
serves as a call for help. In contrast, association with a class I 


formed from the digestion of these proteins in class II MHC MHC protein signals that a cell has succumbed to a pathogen and 


proteins. 


is a call for destruction. 


Helper T cells rely on the T-cell receptor and CD4 to 
recognize foreign peptides on antigen-presenting cells 


The overall structure of a class II MHC molecule is remarkably similar to 
that of a class I molecule. Class II molecules consist of a 33-kd a chain and 
a noncovalently bound 30-kd B chain (Figure 34.34). Each contains two 
extracellular domains, a transmembrane segment, and a short cytoplasmic 
tail. The peptide-binding site is formed by the a; and B; domains, each of 


Side view 


Top view 


which contributes a long helix and part of a B sheet. Thus, the same struc- 
tural elements are present in class I and class I] MHC molecules, but they 
are combined into polypeptide chains in different ways. The peptide-binding 
site of aclass II molecule is open at both ends, and so this groove can accom- 
modate longer peptides than can be bound by class I molecules; typically, 
peptides between 13 and 18 residues long are bound. The peptide-binding 
specificity of each class II molecule depends on binding pockets that 
recognize particular amino acids, also known as anchor residues, in specific 
positions along the sequence. 

Helper T cells express T-cell receptors that are produced from the 
same genes as those on cytotoxic T cells. These T-cell receptors interact 
with class IT MHC molecules in a manner that is analogous to T-cell-receptor 
interaction with class I MHC molecules. Nonetheless, helper T cells and 
cytotoxic T cells are distinguished by other proteins that they express on 
their surfaces. In particular, helper T cells express a protein called CD4 
instead of expressing CD8. CD4 consists of four immunoglobulin domains 
that extend from the T-cell surface, as well as a small cytoplasmic region 
(Figure 34.35). The amino-terminal immunoglobulin domains of CD4 
interact with the base on the class II MHC molecule. Thus, helper T cells 
bind cells expressing class II MHC specifically because of the interactions 
with CD4 (Figure 34.36). 


(A) Cell presenting cytoplasmically (B) 
derived foreign peptide in class | MHC 


Antigen-presenting cell 


Class I] MHC 
Peptide —G () 


Cytotoxic T cell 


Helper T cell 
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h Figure 34.34 Class Il MHC protein. A 
class Il MHC protein consists of homologous 
a and B chains, each of which has an amino- 
terminal domain that constitutes half of the 
peptide-binding structure, as well as a 
carboxyl-terminal immunoglobulin domain. 
Notice the trough-like peptide-binding site, 
which is similar to that in class | MHC proteins 
except that it is open at both ends, allowing 
class Il MHC proteins to bind longer peptides 
than those bound by class |. [Drawn from 
1DLH.pdb_] 


Cc 


h Figure 34.35 Coreceptor CD4. 
This protein comprises four tandem 
immunoglobulin domains that extend from 
the surface of a helper T cell. [Drawn from 
1WIO.pdb.] 


Figure 34.36 Variations on a theme. 

(A) Cytotoxic T cells recognize foreign peptides 
presented in class | MHC proteins with the aid 
of the coreceptor CD8. (B) Helper T cells 
recognize peptides presented in class Il MHC 
proteins by specialized antigen-presenting cells 
with the aid of the coreceptor CD4. 
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Figure 34.37 Helper-T-cell action. The 
engagement of the T-cell receptor in helper T 
cells results in the secretion of cytokines. 
These cytokines bind to cytokine receptors 
expressed on the surface of the antigen- 
presenting cell, stimulating cell growth, 
differentiation, and, in regard to a B cell, 
antibody secretion. 
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When a helper T cell binds to an antigen-presenting cell expressing an 
appropriate class II MHC —peptide complex, signaling pathways analogous 
to those in cytotoxic T cells are initiated by the action of the kinase Lck on 
ITAMs in the CD3 molecules associated with the T-cell receptor. However, 
rather than triggering events leading to the death of the attached cell, these 
signaling pathways result in the secretion of cytokines from the helper cell. 
Cytokines area family of molecules that include, among others, interleukin-2 
and interferon-y. Cytokines bind to specific receptors on the antigen- 
presenting cell and stimulate growth, differentiation, and, in regard to plasma 
cells, which are derived from B cells, antibody secretion (Figure 34.37). 
Thus, the internalization and presentation of parts of a foreign pathogen 
help to generate a local environment in which cells taking part in the defense 
against this pathogen can flourish through the action of helper T cells. 


MHC proteins are highly diverse 


e MHC class I and II proteins, the presenters of peptides to T cells, 
were discovered because of their role in transplantation rejection. A 
tissue transplanted from one person to another or from one mouse to another 
is usually rejected by the immune system. In contrast, tissues transplanted 
from one identical twin to another or between mice of an inbred strain 
are accepted. Genetic analyses revealed that rejection occurs when tissues 
are transplanted between individual organisms having different genes in the 
major histocompatibility complex, a cluster of more than 75 genes playing 
key roles in immunity. The 3500-kb span of the MHC is nearly the length 
of the entire EF. coli chromosome. The MHC encodes class I proteins (pre- 
senters to cytotoxic T cells) and class II proteins (presenters to helper T 
cells), as well as class III proteins (components of the complement cascade) 
and many other proteins that play key roles in immunity. 

Human beings express six different class I genes (three from each par- 
ent) and six different class II genes. The three loci for class I genes are called 
HLA-A, -B, and -C; those for class II genes are called HLA-DP, -DQ, and 
-DR. These loci are highly polymorphic: many alleles of each are present in 


the population. For example, more than 50 each of HLA-A, -B, and -C 
alleles are known; the numbers discovered increase each year. Hence, the 
likelihood that two unrelated persons have identical class I and II proteins is 
very small (<10~*), accounting for transplantation rejection unless the 
genotypes of donor and acceptor are closely matched in advance. 

Differences between class I proteins are located mainly in the a; and a 
domains, which form the peptide-binding site (Figure 34.38). The a3 
domain, which interacts with a constant B2-microglobulin, is largely con- 
served. Similarly, the differences between class II proteins cluster near the 
peptide-binding groove. Why are MHC proteins so highly variable? Their 
diversity makes the presentation of a very wide range of peptides to T cells 
possible. A particular class I or class II molecule may not be able to bind any 
of the peptide fragments of a viral protein. The likelihood of a fit is markedly 
increased by having several kinds (usually six) of each class of presenters in 
each individual organism. If all members of a species had identical class I or 
class IT molecules, the population would be much more vulnerable to dev- 
astation by a pathogen that had mutated and thereby evaded presentation. 
The evolution of the diverse human MHC repertoire has been driven by the 
selection for individual members of the species who resist infections to 
which other members of the population may be susceptible. 


Human immunodeficiency viruses subvert the 
immune system by destroying helper T cells 


e In 1981, the first cases of a new disease now called acquired immune 

deficiency syndrome (AIDS) were recognized. The victims died of rare 
infections because their immune systems were crippled. The cause was 
identified 2 years later by Luc Montagnier and coworkers. AIDS is pro- 
duced by human immunodeficiency virus (HIV), of which two major classes 
are known: HIV-1 and the much less common HIV-2. Like other retroviruses, 
HIV contains a single-stranded RNA genome that is replicated through a 
double-stranded DNA intermediate. This viral DNA becomes integrated 
into the genome of the host cell. In fact, viral genes are transcribed only 
when they are integrated into the host DNA. 

The HIV virion is enveloped by a lipid-bilayer membrane containing 
two glycoproteins: gp41 spans the membrane and is associated with gp120, 
which is located on the external face (Figure 34.39). The core of the virus 
contains two copies of the RNA genome and associated transfer RNAs, as 
well as several molecules of reverse transcriptase. They are surrounded by 
many copies of two proteins called p18 and p24. The host cell for HIV is the 
helper T cell. The gp120 molecules on the membrane of HIV bind to CD4 
molecules on the surface of the helper T cell (Figure 34.40). This interaction 
allows the associated viral gp41 to insert its amino-terminal head into the 
host-cell membrane. The viral membrane and the helper-T-cell membrane 
fuse, and the viral core is released directly into the cytoplasm. Infection by 


CD4 
(N-terminal domains) 


hk Figure 34.40 HIV receptor. 
A complex between a modified 
form of the envelope glycoprotein 
gp120 from HIV and a peptide 
corresponding to the two 
amino-terminal domains from the 
helper-T-cell protein CD4 reveals 
how viral infection of helper T cells 
is initiated. [Drawn from 1GC1.pdb.] 


hk Figure 34.38 Polymorphism in class 
| MHC proteins. Notice that the positions of 
sites with a high degree of polymorphism in 
the human population are displayed as red 
spheres on the structure of the amino- 
terminal part of a class | MHC protein. [Drawn 
from 1HHK.pdb.] 
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Figure 34.39 Human immunodeficiency 
virus. A schematic representation of HIV 
reveals its proteins and nucleic acid 
components. The membrane-envelope 
glycoproteins gp41 and gp120 are shown in 
dark and light green. The viral RNA is shown 
in red, and molecules of reverse transcriptase 
are shown in blue. [After R. C. Gallo. The AIDS 
virus. Copyright © 1987 by Scientific 
American, Inc. All rights reserved.] 
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Figure 34.41 T-cell selection. A population 
of thymocytes is subjected first to positive 
selection to remove cells that express T-cell 
receptors that will not bind to MHC proteins 
expressed by the individual organism. The 
surviving cells are then subjected to negative 
selection to remove cells that bind strongly to 
MHC complexes bound to self-peptides. 


HIV leads to the destruction of helper T cells because the permeability of 
the host plasma membrane is markedly increased by the insertion of viral 
glycoproteins and the budding of virus particles. The influx of ions and 
water disrupts the ionic balance, causing osmotic lysis. 


34.5 The Immune System Contributes to the Prevention 
and the Development of Human Diseases 


The primary function of the immune system is to protect the host from 
invasion by foreign organisms. But how does the immune system prevent 
itself from mounting attacks against the host organism? In other words, how 
does the immune system distinguish between self and nonself? Clearly, pro- 
teins from the organism itself do not bear some special tag identifying them. 
Instead, selection processes early in the developmental pathways for 
immune cells kill or suppress those immune cells that react strongly with 
self-antigens. The evolutionary paradigm still applies; immune cells that 
recognize self-antigens are generated, but selective mechanisms eliminate 
such cells in the course of development. 


T cells are subjected to positive and negative selection in the thymus 


T cells derive their name from the location of their production—the thymus, 
asmall organ situated just above the heart. Examination of the developmen- 
tal pathways leading to the production of mature cytotoxic and helper T 
cells reveals the selection mechanisms that are crucial for distinguishing self 
from nonself. These selection criteria are quite stringent; approximately 
98% of the thymocytes, the precursors of T cells, die before the completion 
of the maturation process. 

Thymocytes produced in the bone marrow do not express the T-cell 
receptor complex, CD4, or CD8. On relocation to the thymus and rear- 
rangement of the T-cell-receptor genes, the immature thymocyte expresses 
all of these molecules. These cells are first subjected to positive selection 
(Figure 34.41). Cells for which the T-cell receptor can bind with reasonable 
affinity to either class I or class II MHC molecules survive this selection; 
those for which the T-cell receptor does not participate in such an interac- 
tion undergo apoptosis and die. The role of the positive selection step is to 
prevent the production of T cells that will not bind to any MHC complex 
present, regardless of the peptide bound. 

The cell population that survives positive selection is subjected to a 
second step, negative selection. Here, T cells that bind with high affinity to 
MHC complexes bound to self-peptides expressed on the surfaces of 


Cytotoxic T cell 
(CD8 positive) 


Positive selection 


Only cells that 
bind to some MHC 
molecule survive 


Negative selection 


Cells that bind strongly to 
MHC or MHC-self-peptide 
complexes are eliminated 


Helper T cell 
(CD4 positive) 


antigen-presenting cells in the thymus undergo apoptosis or are otherwise 
suppressed. Those that do not bind too avidly to any such MHC complex 
complete development and become mature cytotoxic T cells (which express 
only CD8) or helper T cells (which express only CD4). The negative 
selection step leads to self-tolerance; cells that bind an MHC-self-peptide 
complex are removed from the T-cell population. Similar mechanisms 
apply to developing B cells, suppressing B cells that express antibodies that 
interact strongly with self-antigens. 


Autoimmune diseases result from the generation of immune responses 
against self-antigens 


¥e Although thymic selection is remarkably efficient in suppressing the 
immune response to self-antigens, failures do occur. Such failures 
result in autoimmune diseases. These diseases include common illnesses such 
as insulin-dependent diabetes mellitus, multiple sclerosis, and rheumatoid 
arthritis. In these illnesses, immune responses against self-antigens result in 
damage to selective tissues that express the antigen (Figure 34.42). 

In many cases, the cause of the generation of self-reactive antibodies or 
T cells is unclear. However, in other cases, infectious organisms such as 
bacteria or viruses may play a role. Infection leads to the generation of anti- 
bodies and T cells that react with many different epitopes from the infectious 
organism. If one of these antigens closely resembles a self-antigen, an 
autoimmune response can result. For example, Streptococcus infections 
sometimes lead to rheumatic fever owing to the production of antibodies 
to streptococcal antigens that cross-react with exposed epitopes in heart 
muscle. 


(A) 


Figure 34.42 Consequences of autoimmunity. Photomicrographs of an islet of Langerhans 

(A) in the pancreas of a normal mouse and (B) in the pancreas of a mouse with an immune response 
against pancreatic B cells, which results in a disease resembling insulin-dependent diabetes mellitus in 
human beings. Notice that the relatively pale cellular area in the center of part A is populated with the 
dark nuclei of inflammatory cells in part B. [From M. A. Atkinson and N. K. Maclaren. What causes 
diabetes? Copyright © 1990 by Scientific American, Inc. All rights reserved.] 


The immune system plays a role in cancer prevention 


e The development of immune responses against proteins encoded by 
our own genomes can be beneficial in some circumstances. Cancer 
cells have undergone significant changes that often result in the expression 
of proteins that are not normally expressed. For example, the mutation of 
genes can generate proteins that do not correspond in amino acid sequence 
to any normal protein. Such proteins may be recognized as foreign, and an 
immune response will be generated specifically against the cancer cell. 
Alternatively, cancer cells often produce proteins that are expressed during 
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embryonic development but are not expressed or are expressed at very low 
levels after birth. For example, a membrane glycoprotein called carcinoem- 
bryonic antigen (CEA) appears in the gasterointestinal cells of developing 
fetuses but is not normally expressed at significant levels after birth. More 
than 50% of patients with colorectal cancer have elevated serum levels of 
CEA. Immune cells recognizing epitopes from such proteins will not be 
subjected to negative selection and, hence, will be present in the adult 
immune repertoire. These cells may play a cancer surveillance role, killing 
cells that overexpress antigens such as CEA and preventing genetically 
damaged cells from developing into tumors. 


Vaccines are a powerful means to prevent and eradicate disease 


e The discovery and utilization of vaccines, biological preparations that 

stimulate immunity to a particular pathogen or disease, represent a 
significant milestone in modern medical history. The pioneering work of 
Edward Jenner and Louis Pasteur in the eighteenth and nineteenth centu- 
ries revealed that inoculation with inactivated forms of a pathogen could 
protect against subsequent infection with the active pathogen. Treatment of 
a person with a vaccine, or immunization, leads to the development of immu- 
nological memory, the ability of the immune system to respond more rapidly 
and effectively to pathogens that have been encountered previously. At the 
cellular level, memory arises because positively selected B cells and T cells 
that have a high affinity for their target persist within the organism as 
memory B cells and memory T cells. 

In order for a vaccine to be effective, it must generate a sustained protec- 
tive response appropriate for the targeted pathogen. For example, patho- 
gens such as Clostridium tetani, the causative agent of tetanus, cause disease 
by the release of destructive proteins called toxins. Hence, an effective teta- 
nus vaccine should generate a strong B-cell-mediated response. In contrast, 
a vaccine designed to prevent disease caused by an intracellular pathogen 
should generate a strong T-cell-mediated response. The active component 
of a vaccine that is responsible for stimulating this immune response can 
take many forms: 


Killed or inactivated vaccines contain pathogens that have been rendered 
harmless by treatment with chemicals or high heat. 


Live attenuated vaccines contain live pathogens that have accumulated 
mutations so that they are no longer virulent to human cells. These vaccines 
are most commonly generated by repeated infection of the pathogen in cul- 
tured cells until its virulence is lost. 


Subunit vaccines contain a purified protein component of the pathogen. 
Such proteins can be either isolated from infected material (such as blood 
from chronically infected patients) or generated by recombinant methods. 


Toxoid vaccines are used against pathogens that employ an extracellular 
toxin to cause disease. These vaccines contain a form of the toxin that has 
been inactivated by treatment with chemicals or high heat. 


The effectiveness of immunization efforts has made the global eradica- 
tion of certain diseases possible. In 1966, 20 million cases of smallpox were 
reported worldwide. By 1980, coordinated vaccination efforts in regions of 
endemic disease had enabled the World Health Assembly to declare that 
smallpox had been eradicated. Nevertheless, despite the success of vaccines 
in the prevention of many devastating diseases, several pathogens have 
posed a significant challenge to vaccine development. For example, the 


development of an effective HIV vaccine has been complicated by the anti- 
genic diversity of HIV strains. Because its mechanism for replication is 
prone to error, a population of HIV presents an ever-changing array of coat 
proteins. Indeed, the mutation rate of HIV is more than 65 times higher 
than that of influenza virus. Nevertheless, the identification of protective 
antibodies in individuals resistant to HIV and advancements in vaccine 
design suggest that the development of an effective HIV vaccine remains a 
real possibility. 


Summary 


34.1 


34.2 


34.3 


Two lines of defense against pathogens are the innate immune system 
and the adaptive immune system. The innate immune system targets 
features present on many different pathogens but misses those patho- 
gens lacking the targeted features. The adaptive immune system is 
both more specific and wide reaching. To respond effectively to a vast 
array of pathogens, this type of immune system must be tremendously 
adaptable. Adaptation by the adaptive immune system follows the 
principles of evolution: an enormously diverse set of potentially useful 
proteins is generated; these proteins are then subjected to intense 
selection so that only cells that express useful proteins flourish and 
continue development, until an effective immune response to a spe- 
cific invader is generated. 


Antibodies Possess Distinct Antigen-Binding and Effector Units 

The major immunoglobulin in the serum is immunoglobulin G. An 
IgG protein is a heterotetramer with two heavy chains and two light 
chains. Treatment of IgG molecules with proteases such as papain 
produces three fragments: two F,, fragments that retain antigen- 
binding activity and an F, fragment that retains the ability to activate 
effector functions such as the initiation of the complement cascade. 
The F,, fragments include the L chain and the amino-terminal half of 
the H chain; the F, domain is a dimer consisting of the carboxyl- 
terminal halves of two H chains. Five different classes of antibody— 
IgG, IgM, IgA, IgD, and IgE—differ in their heavy chains and, hence, 
in their effector functions. 


Antibodies Bind Specific Molecules Through Hypervariable Loops 
One particular protein fold is found in many of the key proteins of the 
immune system. The immunoglobulin fold consists of a pair of B 
sheets that pack against each other, linked by a single disulfide bond. 
Loops projecting from one end of the structure form a binding surface 
that can be varied by changing the amino acid sequences within the 
loops. Domains with immunoglobulin folds are linked to form anti- 
bodies and other classes of proteins in the immune system, including 
T-cell receptors. Two chains come together to form the binding surface 
of an antibody. Three loops from each domain, the complementarity- 
determining regions, form an essentially continuous surface that can 
vary tremendously in shape, charge, and other characteristics to allow 
particular antibodies to bind to molecules ranging from small mole- 
cules to large protein surfaces. 


Diversity Is Generated by Gene Rearrangements 

The tremendous diversity of the amino acid sequences of antibodies is 
generated by segmental rearrangements of genes. For antibody k light 
chains, 1 of 40 variable regions is linked to 1 of 5 joining regions. The 
combined VJ unit is then linked to the constant region. Thousands of 
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34.4 


34.5 


different genes can be generated in this manner. Similar arrays are 
rearranged to form the genes for the heavy chains, but an additional 
region called the diversity region lies between the V and the J regions. 
The combination of L and H chains, each obtained through such rear- 
ranged genes, can produce more than 10° distinct antibodies. Different 
classes of antibodies are also generated by gene rearrangements that 
lead to class switching. Oligomerization of membrane-bound anti- 
body molecules initiates a signal-transduction cascade inside B cells. 
Key steps in this signaling process include the phosphorylation of 
specific tyrosine residues in sequences termed immunoreceptor 
tyrosine-based activation motifs, present in proteins that associate 
with the membrane-bound antibodies. 


Major-Histocompatibility-Complex Proteins Present Peptide Antigens 
on Cell Surfaces for Recognition by T-Cell Receptors 

Intracellular pathogens such as viruses and mycobacteria cannot 
be easily detected. Intracellular proteins are constantly being cut 
into small peptides by proteasomes and displayed in class I major- 
histocompatibility-complex proteins on cell surfaces. Such peptides lie 
in a groove defined by two helices in the class I MHC proteins. The 
combination of MHC protein and peptide can be bound by an appro- 
priate T-cell receptor. T-cell receptors resemble the antigen-binding 
domains of antibodies in structure, and diversity in T-cell-receptor 
sequence is generated by V(D)J gene rearrangements. The T-cell recep- 
tor recognizes features of both the peptide and the MHC molecule that 
presents it. Cytotoxic T cells initiate apoptosis in cells to which they 
bind through interactions between T-cell receptors and class I MHC-— 
peptide complexes aided by interactions with the coreceptor molecule 
CD8. Helper T cells recognize peptides presented in class I] MHC 
proteins, a distinct type of MHC protein expressed only on antigen- 
presenting cells, such as B cells and macrophages. Helper T cells 
express the coreceptor CD4 rather than CD8. CD4 interacts with class 
II MHC proteins present on antigen-presenting cells. Signaling path- 
ways, analogous to those in B cells, are initiated by interactions between 
MHC-peptide complexes and T-cell receptors and the CD8 and CD4 
coreceptors. Human immunodeficiency virus damages the immune 
system by infecting cells that express CD4, such as helper T cells. 


The Immune System Contributes to the Prevention and the 
Development of Human Diseases 

In principle, the immune system is capable of generating antibodies 
and T-cell receptors that bind to self-molecules—that is, molecules that 
are normally present in a healthy and uninfected individual organism. 
Selection mechanisms prevent such self-directed molecules from being 
expressed at high levels. The selection process includes both positive 
selection, to enrich the population of cells that express molecules that 
have the potential to bind foreign antigens in an appropriate context, 
and negative selection, which eliminates cells that express molecules 
with too high an affinity for self-antigens. Autoimmune diseases such 
as insulin-dependent diabetes mellitus can result from the amplifica- 
tion of a response against a self-antigen. Vaccines stimulate immuno- 
logical memory so as to prevent, and even eradicate, disease. 
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Problems 


1. First things first. Distinguish between the innate and 
adaptive immune systems. 


2. Antibody diversity. What are the mechanisms used by 
B cells to generate antibody diversity? 


3. Hang in there. Explain the difference between affinity 
and avidity. For which immunoglobulin class might avidity 
be particularly important in antigen recognition? 


4. Innate abilities. A strain of mice has been identified that 
does not respond to LPS. This lack of response is due to a 
single amino acid change in the intracellular domain of 
mouse TLR-4. Propose an explanation for the lack of 
response. 


5. TLR ligands. The PAMP recognized by TLR-3 is 
double-stranded RNA (dsRNA). Against which pathogens 
would TLR-3 be an effective immune receptor? 


6. Energetics and kinetics. Suppose that the dissociation 
constant of an F,,—hapten complex is 3 X 10’ M at 25°C. 


(a) What is the standard free energy of binding? 


(b) Immunologists often speak of affinity (K,), the recipro- 
cal of the dissociation constant, in comparing antibodies. 
What is the affinity of this Fy? 


(c) The rate constant for the release of hapten from the com- 
plex is 120s !. What is the rate constant for association? What 
does the magnitude of this value imply about the extent of 
structural change in the antibody on binding hapten? 
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human leukocyte antigen (HLA) (p. 992) 

B»-microglobulin (p. 992) 
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Problems 


T-cell receptor (p. 994) 
CD8 (p. 994) 
perforin (p. 996) 
granzyme (p. 996) 
helper T cell (p. 996) 
class II MHC protein (p. 996) 
CD4 (p. 997) 
human immunodeficiency virus (HIV) 
(p. 999) 
positive selection (p. 1000) 
negative selection (p. 1000) 
autoimmune disease (p. 1001) 
carcinoembryonic antigen (CEA) 
(p. 1002) 
vaccine (p. 1002) 
immunological memory (p. 1002) 
killed, or inactivated, vaccines (p. 1002) 
live attenuated vaccines (p. 1002) 
subunit vaccines (p. 1002) 
toxoid vaccines (p. 1002) 


7. A brilliant emitter. Certain naphthalene derivatives, such 
as the dansyl group, exhibit a weak yellow fluorescence 
when they are ina highly polar environment (such as water) 
and an intense blue fluorescence when they are in a mark- 
edly nonpolar environment (such as hexane). The binding 
of e-dansyl-lysine to specific antibody is accompanied by a 
marked increase in its fluorescence intensity and a shift in 
color from yellow to blue. What does this finding reveal 
about the hapten—antibody complex? 


8. Miniantibody. The F,, fragment of an antibody molecule 
has essentially the same affinity for a monovalent hapten as 
does intact IgG. 


(a) What is the smallest unit of an antibody that can retain 
the specificity and binding affinity of the whole protein? 


(b) Design a compact single-chain protein that is likely to 
specifically bind antigen with high affinity. 


9. Turning on B cells. B lymphocytes, the precursors of 
plasma cells, are triggered to proliferate by the binding of 
multivalent antigens to receptors on their surfaces. The cell- 
surface receptors are transmembrane immunoglobulins. 
Univalent antigens, in contrast, do not activate B cells. 


(a) What do these findings reveal about the mechanism of 
B-cell activation? 


(b) How might antibodies be used to activate B cells? 


10. An ingenious cloning strategy. In the cloning of the gene 
for the a chain of the T-cell receptor, T-cell cDNAs were 
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hybridized with B-cell mRNAs. What was the purpose of 
this hybridization step? Can the principle be applied 
generally? 


11. Pathogen susceptibility. Patients carrying specific muta- 
tions in the gene encoding the TLR-4 protein are susceptible 
to infections from Gram-negative bacteria. Why are these 
patients vulnerable to this particular type of pathogen? 


12. Matchmaker, matchmaker. Why is it important to 
match HLA alleles between donor and recipient in organ 
transplantation? 


13. Instruction. Before the mechanism for generating anti- 
body diversity had been established, a mechanism based on 
protein folding around an antigen was proposed, primarily 
by Linus Pauling. In this model, antibodies that had differ- 
ent specificities had the same amino acid sequence but were 
folded in different ways. Propose a test of this model. 


14. Dealing with nonsense. Cells, including immune cells, 
degrade mRNA molecules in which no long open reading 
frame is present. The process is called nonsense-mediated 
RNA decay. Suggest a role for this process in immune cells. 


15. Down, but not out. To understand the genes responsible 
for growth and infectivity in a disease-causing bacterial 
strain, you perform chemical mutagenesis on a culture of 
these bacteria. In the course of your investigation into the 
properties of the resulting mutants, you identify a set of 
mutant bacteria that is still viable but their virulence is sig- 
nificantly impaired. How might these mutants be useful for 
vaccine development? 


16. Presentation. The amino acid sequence of a small 
protein is 


MSRLASKNLIRSDHAGGLLQATYSAVSS- 
IKNTMSFGAWSNAALNDSRDA 


Predict the most likely peptide to be presented by the class 
I MHC molecule HLA-A2. 


Mechanism Problem 


17. Catalytic antibody. Antibody is generated against a 
transition state for the hydrolysis of the following ester. 


Transition-state analog 


Some of these antibodies catalyze the hydrolysis of the ester. 
What amino acid residue might you expect to find in the 
binding site on the antibody? 


Chapter Integration Problem 


18. Signaling. Protein tyrosine phosphatases, such as the 
molecule CD45 expressed in both B cells and T cells, play 
important roles in activating such protein tyrosine kinases 
as Fyn and Lck, which are quite similar to Src. Suggest a 
mechanism for the activation of such protein kinases by the 
removal of a phosphoryl group from a phosphotyrosine 
residue. 


Data Interpretation Problem 


19. Affinity maturation. A mouse is immunized with an 
oligomeric human protein. Shortly after immunization, a 
cell line that expresses a single type of antibody molecule 
(antibody A) is derived. The ability of antibody A to bind 
the human protein is assayed with the results shown in the 
graph below. 
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After repeated immunizations with the same protein, 
another cell line is derived that expresses a different anti- 
body (antibody B). The results of analyzing the binding of 
antibody B to the protein also are shown. From these data, 
estimate 


(a) the dissociation constant (Kq) for the complex between 
the protein and antibody A. 


(b) the dissociation constant for the complex between the 
protein and antibody B. 


Comparison of the amino acid sequences of antibody A and 
antibody B reveals them to be identical except for a single 
amino acid. What does this finding suggest about the 
mechanism by which the gene encoding antibody B was 
generated? 


Molecular Motors 


(© Seep from human beings to bacteria, move to adapt to changes in 
their environments, navigating toward food and away from danger. 
Cells themselves are not static but are bustling assemblies of moving pro- 
teins, nucleic acids, and organelles. This motion is enabled by two elements: 
molecular-motor proteins and complex networks of filamentous proteins 
termed the cytoskeleton (Figure 35.1). The dynamic networks that deter- 
mine the shape and mobility of calls are among the most active areas of 
investigation in modern cell biology. Remarkably, the fundamental bio- 
chemical mechanisms that produce contractions in our muscles are the same 
as those that propel organelles along the cytoskeleton. In fact, many of the 
proteins that play key roles in converting chemical energy into kinetic 
energy are members of the same protein family, the P-loop NTPases, the 
hugely important group of proteins that we first examined in Chapter 9. 
These molecular motors are homologous to proteins that we have encoun- 
tered in other contexts, including the G proteins in protein synthesis, sig- 
naling, and other processes. Once again, we see the economy of evolution in 
adapting existing proteins to perform new functions. 

Molecular motors operate by small increments, converting changes in 
protein conformation into directed motion. Orderly motion across distances 
requires a track that steers the motion of the motor assembly. Indeed, we 
have already encountered a class of molecular motors that utilize mechanisms 
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The powerful muscles of the horse, like the muscles of all animals, is 
powered by the molecular-motor protein myosin. A part of myosin moves 
dramatically (as shown above) in response to ATP binding, hydrolysis, and 
product release, propelling myosin along an actin filament. This molecular 
movement is translated into movement of the entire animal, vividly depicted 
in da Vinci's rearing horse. [(Left) Leonardo da Vinci's study of a rearing 
horse for the Battle of Anghiari (ca. 1504) from The Royal Collection © Her 
Royal Majesty Queen Elizabeth II,] 


OUTLINE 


35.1 Most Molecular-Motor Proteins 
Are Members of the P-Loop 
NTPase Superfamily 


35.2 Myosins Move Along Actin 
Filaments 


35.3 Kinesin and Dynein Move Along 
Microtubules 


35.4 A Rotary Motor Drives Bacterial 
Motion 
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Figure 35.1 Motion within cells. This 
high-voltage electron micrograph shows 

the mitotic apparatus in a metaphase 
mammalian cell. The large cylindrical objects 
are chromosomes, and the threadlike structures 
stretched across the center are microtubules, 
key components of the skeleton. Microtubules 
serve as tracks for the molecular motors that 
move chromosomes. Many processes, 
including chromosome segregation in mitosis, 
depend on the action of molecular-motor 
proteins. [Courtesy of Dr. J. R. McIntosh.] 


Figure 35.2 Myosin structure at low 
resolution. Electron micrographs of myosin 
molecules reveal a two-headed structure with 
a long, thin tail. [Courtesy of Dr. Paula Flicker, 
Dr. Theo Walliman, and Dr. Peter Vibert.] 
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that we will examine here—namely, the helicases that move along DNA 
during DNA replication (Section 28.1). The proteins on which we will focus 
in this chapter move along actin and microtubules—protein filaments com- 
posed of repeating subunits. The motor proteins cycle between forms hav- 
ing high or low affinity for the filament tracks in response to ATP binding 
and hydrolysis, enabling a bind, pull, and release mechanism that generates 
motion. 

We will also consider a completely different strategy for generating 
motion, one used by bacteria such as E. coli. A set of flagella act as propel- 
lers, rotated by a motor in the bacterial cell membrane. This rotary motor is 
driven by a proton gradient across the membrane, rather than by ATP 
hydrolysis. The mechanism for coupling the proton gradient to rotatory 
motion is analogous to that used by the Fy subunit of ATP synthase. Thus, 
both of the major modes for storing biochemical energy—namely, ATP 
and ion gradients—have been harnessed by evolution to drive organized 
molecular motion. 


35.1 Most Molecular-Motor Proteins Are Members of the 
P-Loop NTPase Superfamily 


Eukaryotic cells contain three major families of motor proteins: myosins, 
kinesins, and dyneins. Members of each of these classes move along compo- 
nents of the cytoskeleton, but, at first glance, these protein families appear 
to be quite different from one another. Myosin, first characterized on the 
basis of its role in muscle, moves along filaments of the protein actin. Each 
molecule of muscle myosin consists of two copies each of a heavy chain with 
a molecular mass of 220 kd, an essential light chain, and a regulatory light 
chain. The human genome encodes more than 40 distinct myosins; some 
function in muscle contraction, and others participate in a variety of other 
processes. Kinesins, which have roles in protein, mRNA, and vesicle trans- 
port as well as construction of the mitotic spindle and chromosome segrega- 
tion, are generally dimers of two polypeptides. The human genome encodes 
more than 40 kinesins. Dyneins power the motion of cilia and flagella, and a 
general cytoplasmic dynein contributes to a variety of motions in all cells, 
including vesicle transport and various transport events in mitosis. Dyneins 
are enormous, with heavy chains of molecular mass greater than 500 kd. 
The human genome encodes approximately 10 dyneins. 

Initially, comparison of the amino acid sequences of myosins, kinesins, 
and dyneins did not reveal significant relationships between these protein 
families but, after their three-dimensional structures were determined, 
members of the myosin and kinesin families were found to have remarkable 
similarities. In particular, both myosin and kinesin contain P-loop NTPase 
cores homologous to those found in G proteins. Sequence analysis of the 
dynein heavy chain reveals it to be a member of the AAA subfamily of 
P-loop NTPases that we encountered in the context of the 19S proteasome 
(Section 23.2). Dynein has six sequences encoding such P-loop NTPase 
domains arrayed along its length, although only four actually appear to bind 
nucleotides. Thus, we expect similarities in the mechanisms of action, and 
we can draw on our knowledge of P-loop NTPases in general as we analyze 
the mechanisms of action of these motor proteins. 


Molecular motors are generally oligomeric proteins with an ATPase 
core and an extended structure 


Let us first consider the structure of myosin, which we examined briefly in 
Chapter 9. The results of electron microscopic studies of skeletal-muscle 
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Trypsin 


Light meromyosin (LMM) 


Heavy meromyosin (HMM) 


Figure 35.3 Myosin dissection. Treatment of muscle myosin with proteases forms stable 
fragments, including subfragments S1 and S2 and light meromyosin. Each S1 fragment includes a 
head (shown in yellow or purple) from the heavy chain and one copy of each light chain (shown in 
blue and orange). 


myosin show it to be a two-headed structure linked to a long stalk 
(Figure 35.2). The treatment of myosin with trypsin and papain results in 
the formation of four fragments: two S1 fragments; heavy meromyosin 
(HMM) which consists of the 51 fragments and an additional region termed 
52; and a fragment called light meromyosin (LMM; Figure 35.3). Each S1 
fragment corresponds to one of the heads of the intact structure and includes 
850 amino-terminal amino acids from one of the two heavy chains as well as 
one copy of each of the light chains. Examination of the structure of an 51 
fragment at high resolution reveals the presence of the P-loop NTPase- 
domain core that is the site of ATP binding and hydrolysis (Figure 35.4). 
We examined the structure and mechanism of action of this motor domain 
in Chapter 9. 


Essential light chain 


Regulatory light chain 


hk Figure 35.4 Myosin structure at high 
resolution. The structure of the S1 fragment 
from muscle myosin reveals the presence of a 
P-loop NTPase domain (shaded in purple). 
Notice that an « helix extending from this 
domain is the binding site for the two light 
chains. [Drawn from 1DFL.pdb.] 


Extending away from this structure is a long a helix from the heavy chain. 
This helix is the binding site for the two light chains. The light chains are mem- 
bers of the EF-hand family, similar to calmodulin (see Figure 14.16), although 
most of the EF hands in light chains do not bind metal ions (Figure 35.5). 


Essential light chain Regulatory light chain Calmodulin 


Cc a2t 


hk Figure 35.5 Myosin light chains. The 
structures of the essential and regulatory light 
chains of muscle myosin are compared with 
the structure of calmodulin. Notice the 
similarities in the structures that allow each 
of these homologous proteins to bind an 

a helix (not shown) by wrapping around it. 
[Drawn from 1DFL.pdb and 1CM1.pdb.] 


hk Figure 35.6 Myosin two-stranded 
coiled coil. The two « helices form 
left-handed supercoiled structures that spiral 
around each other. Such structures are 
stabilized by hydrophobic residues at the 
contact points between the two helices. 
[Drawn from 2TMA.pdb.] 


Nucleotide- 
binding site 


hk Figure 35.8 Structure of head domain 
of kinesin at high resolution. Notice that 
the head domain of kinesin has the structure 
of a P-loop NTPase core (indicated by purple 
shading). [Drawn from 116l.pdb.] 
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Like calmodulin, these proteins wrap around an a helix, serving to thicken 
and stiffen it. The remaining fragments of myosin—S2 and light meromyosin— 
are largely a helical, forming two-stranded coiled coils created by the 
remaining lengths of the two heavy chains wrapping around each other 
(Figure 35.6). These structures, together extending approximately 1700 A, 
link the myosin heads to other structures. In muscle myosin, several LMM 
domains come together to form higher-order bundles. 

Conventional kinesin (kinesin 1), the first kinesin discovered, has several 
structural features in common with myosin. The dimeric protein has two 
heads connected by an extended structure (Figure 35.7). The size of the 
head domain is approximately one-third that of myosin. Determination of 
the three-dimensional structure of a kinesin fragment revealed that the 
head domain also is built around a P-loop NTPase core (Figure 35.8). The 
myosin domain is so much larger than that of kinesin because of two large 
insertions in the myosin domain that bind to actin filaments. For conven- 
tional kinesin, a region of approximately 500 amino acids extends from the 
head domain. Like the corresponding region in myosin, the extended part 
of kinesin forms an a-helical coiled coil. Conventional kinesin also has light 
chains, but, unlike those of myosin, these light chains bind near the car- 
boxyl terminus of the heavy chain and are thought to link the motor to 
intracellular cargo. 


Light chains 


Neck linker 
Pai ie 
4 ; - ins i ~ 
se) Coiled coil 


Figure 35.7 Structure of kinesin. The elongated structure has the head domains at one end and 
the cargo-binding domains at the other, linked by a long coiled-coil region. 


Dynein has a somewhat different structure. As noted earlier, the dynein 
heavy chain includes six regions that are homologous to the AAA subfamily 
of ATPase domains. Although no crystallographic data are yet available, 
the results of electron microscopic studies and comparison with known 
structures of other AAA ATPases have formed the basis for the construc- 
tion of a model of the dynein structure (Figure 35.9). The head domain is 
appended to a region of approximately 1300 amino acids that forms an 
extended structure that links dynein units together to form oligomers and 
interacts with other proteins. 

Although the structures of these three classes of molecular motors have 
significant differences, some common features emerge. Each structure is 
dimeric with two head domains, has regions of extended but quite rigid 
structures, and has regions for interacting with other proteins. As we shall 
see, these structures are suitable for actions that resemble climbing a rope, 
hand over hand. The regions for interacting with other proteins represent 
the grasping hands, the extended structures represent the arms that act as 
levers to promote larger-scale motion, and the head domains are the engines 
that provide the necessary mechanical energy. 


ATP binding and hydrolysis induce changes in the conformation and 
binding affinity of motor proteins 


In Chapter 9, we examined the conformational changes that take place in 
the myosin ATPase domain from the slime mold Dictyostelium. The struc- 
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hk Figure 35.9 Dynein structure. (A) The overall structure of dynein. (B) A model of the motor 
domain of dynein. Notice the six P-loop NTPase domains, some of which bind to and hydrolyze ATP. 
(Drawn from 1HN5.pdb.] 


tures of myosin ATP domains from other sources have been elucidated in a 
variety of forms as well. The 51 fragment of myosin from scallop muscle 
provides a striking example of the changes observed (Figure 35.10). The 
structure of this 51 fragment has been determined in a number of forms 
including that without bound nucleotide and that bound to a complex 
formed of ADP and vanadate (VO,° ), which, as mentioned in Chapter 9, 
is an analog of the ATP-hydrolysis transition state. The long helix that 
binds the light chains (hereafter referred to as the lever arm) protrudes out- 
ward from the head domain. Comparison of the structures reveals that the 
lever arm has rotated by nearly 90 degrees in the ADP-VO,>— complex 
compared with its position in the nucleotide-free form. How does the 
species in the nucleotide-binding site cause this dramatic transition? Two 
regions around the nucleotide-binding site (termed switch I and switch IT) 
tightly conform to the shape of the y-phosphoryl group of the ATP analog 


(A) Myosin-ADP-VO,>- complex (B) Myosin-ADP complex 


Switch | and 
switch II 


5K s 
VAL 


h& Figure 35.10 Lever-arm motion. Two forms of the S1 fragment of scallop-muscle myosin. 
Notice the dramatic conformational changes when the identity of the bound nucleotide changes 
from the ADP-VO,°~ complex to nucleotide-free form or vice versa, including a nearly 90-degree 
reorientation of the lever arm. [Drawn from 1DFL.pdb and 1SR6.pdb.] 


Position of 
lever arm when 
no nucleotide is bound 
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and adopt a looser conformation when the y-phosphoryl 
group 1s absent (Figure 35.11). This conformational change 
allows a long « helix (termed the relay helix) to adjust its 
position. The carboxyl-terminal end of the relay helix 
interacts with structures at the base of the lever arm, and 
so a change in the position of the relay helix leads to a 
reorientation of the lever arm. 

Analogous conformational changes take place in kine- 
sin. The kinesins also have a relay helix that can adopt 
different configurations when kinesin binds different 


nucleotides. Kinesin lacks an a-helical lever arm, howev- 
er. Instead, a relatively short segment termed the neck 
linker changes conformation in response to nucleotide 
binding (Figure 35.12). The neck linker binds to the head 
domain of kinesin when ATP is bound but is released 
when the nucleotide-binding site is vacant or occupied by 
ADP. 


Figure 35.11 Relay helix. A superposition of key elements in two 
forms of scallop myosin reveals the structural changes that are 
transmitted by the relay helix from the switch | and switch II loops to the 
base of the lever arm. Notice that the switch | and switch II loops interact 
with VO,°~ in the position that would be occupied by the -y-phosphoryl 
group of ATP. The structure of the myosin-ADP-VO,>~ complex is 
shown in lighter colors. [Drawn from 1DFL.pdb and 1SR6.pdb.] 


35.2 Myosins Move Along Actin Filaments 


Myosins, kinesins, and dyneins move by cycling between 
states with different affinities for the long, polymeric mac- 
romolecules that serve as tracks along which they move. 
For myosin, the molecular track is a polymeric form of 
actin, a 42-kd protein that is among the most abundant 
proteins in eukaryotic cells, typically accounting for as much as 10% of the 
total protein. We begin with a general discussion of the polymeric structure 
of actin and its assembly. We then examine the interactions between myosin 
and actin, including both structure and the dynamic interactions between 
these two proteins. Finally, we turn to the structure of muscle and the roles 


KF ee eee of myosin and actin in muscle contraction. 
igure a eck linker. 


comparison of the structures of a kinesin 
bound to ADP and bound to an ATP analog. 
Notice that the neck linker (orange), which 
connects the head domain to the remainder 
of the kinesin molecule, is bound to the head 
domain in the presence of the ATP analog but 
is free in the presence of ADP only. [Drawn 
from 116l.pdb and 115S.pdb.] 


Actin is a polar, self-assembling, dynamic polymer 


The structure of the actin monomer has been determined to atomic resolu- 
tion by x-ray crystallography and has been used to interpret the structure of 
actin filaments, already somewhat understood through electron microscopy 
studies at lower resolution. Each actin monomer comprises four domains 
(Figure 35.13). These domains come together to surround a bound nucleotide, 
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either ATP or ADP. The ATP form can be converted into the ADP form by 
hydrolysis. 

Actin monomers (often called G-actin for globular) come together to 
form actin filaments (often called F-actin; see Figure 35.13). F-actin has a 
helical structure; each monomer is related to the preceding one by a transla- 
tion of 27.5 A and a rotation of 166 degrees around the helical axis. Because 
the rotation is nearly 180 degrees, F-actin resembles a two-stranded cable. 
Note that each actin monomer is oriented in the same direction along the 
F-actin filament, and so the structure is polar, with discernibly different ends. 
One end is called the barbed (plus) end, and the other is called the pointed 
(minus) end. The names “barbed” and “pointed” refer to the appearance of 
an actin filament when myosin 51 fragments are bound to it. 

How are actin filaments formed? Like many biological structures, actin 
filaments self-assemble; that is, under appropriate conditions, actin mono- 
mers will come together to form well-structured, polar filaments. The 
aggregation of the first two or three monomers to form a filament is highly 
unfavorable. Thus, specialized protein complexes, including one called 
Arp2/3, serve as nuclei for actin assembly in cells. Once such a filament 
nucleus exists, the addition of subunits is more favorable. Let us consider 
the polymerization reaction in more detail. We designate an actin filament 
with n subunits A,. This filament can bind an additional actin monomer, 
A, to form A, +1. 


[A,][A] 
K —_— 
Agen 
a An A Anu 


The dissociation constant, Kg, for this reaction, defines the monomer 
concentrations at which the polymerization reaction will take place, because 
the concentration of polymers of length n + 1 will be essentially equal to 
that for polymers of length n. Thus, 


[An]J~[An+i] and = Kg =~ ~[A] 
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hk Figure 35.13 Actin structure. (Left) 
Schematic view of actin monomers (one in 
blue) of an actin filament. (Right) The domains 
in the four-domain structure of an actin 
monomer are identified by different shades of 
blue. Notice the nucleotide-binding site at the 
center of the structure. [Drawn from 1J6Z.pdb.] 
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Figure 35.14 The structure of myosin 
bound to actin. (A) The gray surface 
represents the structure observed by 
cryoelectron microscopy, with the green 
space-filling model representing one actin 
subunit. The ribbon diagram shows the 
structure of the S1 fragment of myosin 
docked into the cryoelectron microscopic 
structure. Notice that some of the myosin 
structure lies outside the gray surface. (B) The 
structure after the structure of the myosin S1 
fragment has been allowed to adjust to more 
closely match the structure observed by 
cryoelectron microscopy. Notice that the 
myosin structure now more closely matches the 
gray surface. [M. Lorenz and K. C. Holmes. 
PNAS 107:12529-12534, 2010. Copyright 
2010 National Academy of Sciences, U.S.A] 


In other words, the polymerization reaction will proceed until the monomer 
concentration is reduced to the value of Kg. If the monomer concentration 
is below the value of Kg, the polymerization reaction will not proceed at all; 
indeed, existing filaments will depolymerize until the monomer concentra- 
tion reaches the value of Kg. Because of these phenomena, Kg is referred to 
as the critical concentration for the polymer. Recall that actin contains a 
nucleotide-binding site that can contain either ATP or ADP. The critical 
concentration for the actin-ATP complex is approximately 20-fold lower 
than that for the actin—ADP complex; actin—ATP polymerizes more readily 
than does actin—ADP. 

Actin filaments inside cells are highly dynamic structures that are con- 
tinually gaining and losing monomers. Nucleation by complexes such as 
Arp2/3 can initiate the polymerization of actin-ATP. In contrast, the 
hydrolysis of bound ATP to ADP favors actin depolymerization. This reac- 
tion acts as a timer to make actin filaments kinetically unstable. Proteins 
that bind actin monomers or promote the severing of actin filaments also 
play roles. Polymerization reactions can exert force, pushing or pulling on 
cell membranes. Regulated actin polymerization is central to the changes in 
cell shape associated with cell motility in amebae as well as in human cells such 
as macrophages. 


Myosin head domains bind to actin filaments 


It has not been possible to determine the in vivo structure of a complex 
between actin and myosin at sufficiently high resolution to discern molecu- 
lar details. However, treatment of actin filaments with myosin 51 fragments 
in the absence of ATP results in a complex referred to as decorated actin for 
which the structure has been determined by cryoelectron microscopy to a 
resolution of 13 A. Although a structure at this resolution alone would not 
be adequate to observe molecular details, superimposition of the high- 
resolution structures of actin monomers and the myosin 51 fragment on the 
structure of decorated actin can be a source of insight into the details of its 
structure (Figure 35.14). The myosin head domain is in a conformation 
close to that observed for the nucleotide-free form. This structure also 
reveals the interaction surfaces between myosin and actin. The modeling 
suggests that the myosin head-domain conformation changes somewhat to 
increase its interaction with the actin filament. These conformational 
changes result in a slight opening of the nucleotide-binding site in myosin. 
This observation has implications for the mechanism by which myosin 
moves along actin filaments. 


Motions of single motor proteins can be directly observed 


Now that we understand the conformational changes behind myosin’s 
action, we can explore how myosin “walks” along its actin track. Studies of 
single myosin molecules moving relative to actin filaments have been sources 
of deep insight into the mechanisms underlying muscle contraction and 
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other complex processes. A powerful tool for these studies, called an optical 
trap, relies on highly focused laser beams (Figure 35.15). Small beads can 
be caught in these traps and held in place in solution. 

The position of the beads can be monitored with nanometer precision. 
James Spudich and his coworkers designed an experimental arrangement 
consisting of an actin filament that had a bead attached to each end. Each 
bead could be caught in an optical trap (one at each end of the filament) and 
the actin filament could be pulled taut over a microscope slide containing 
other beads that had been coated with fragments of myosin such as the 
heavy meromyosin fragment (see Figure 35.15). On the addition of ATP, 
transient displacements of the actin filament were observed along its long 
axis. The size of the displacement steps was fairly uniform with an average 
size of 11 nm (110 A). 

The results of these studies, performed in the presence of varying con- 
centrations of ATP, are interpreted as showing that individual myosin 
heads bind the actin filament and undergo a conformational change (the 
power stroke) that pulls the actin filament, leading to the displacement of the 
beads. After a period of time, the myosin head releases the actin, which then 
snaps back into place. 


Phosphate release triggers the myosin power stroke 


How does ATP hydrolysis drive the power stroke? A key observation is that 
the addition of ATP to a complex of myosin and actin results in the disso- 
ciation of the complex. Thus, ATP binding and hydrolysis cannot be 
directly responsible for the power stroke. We can combine this fact with the 
structural observations described earlier to construct a mechanism for the 
motion of myosin along actin (Figure 35.16). Let us begin with nucleotide- 
free myosin bound to actin. The binding of ATP to actin results in the dis- 
sociation of myosin from actin. With ATP bound and free of actin, the 
myosin domain can undergo the conformational change associated with the 
formation of the transition state for ATP hydrolysis. This conformational 
change results in the reorientation of the lever arm. In this form, the myosin 
head can dock onto the actin filament; phosphate is released with concomi- 
tant motion of the lever arm. This conformational change represents the power 
stroke and moves the body of the myosin molecule relative to the actin filament 
by approximately 110 A. The release of ADP completes the cycle. 


Muscle is a complex of myosin and actin 


The mechanism of moving a single myosin molecule relative to an actin 
filament explains how muscles contract. Vertebrate muscle that is under 
voluntary control, such as the biceps and triceps in your upper arm, has a 
banded (striated) appearance when examined under a light microscope. It 
consists of multinucleated cells that are bounded by an electrically excitable 
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Figure 35.15 Watching a single motor 
protein in action. (A) An actin filament 
(blue) is placed above a heavy meromyosin 
(HMM) fragment (yellow) that projects from a 
bead on a glass slide. A bead attached to 
each end of the actin filament is held in an 
optical trap produced by a focused, intense 
infrared laser beam (orange). The position of 
these beads can be measured with 
nanometer precision. (B) Recording of the 
displacement of an actin filament due to a 
myosin derivative attached to a bead, 
influenced by the addition of ATP. Note the 
fairly uniform step sizes that are observed. 
((A) After J. T. Finer, R. M. Simmons, and J. A. 
Spudich. Nature 368:113-119, 1994. 

(B) From R. S. Rock, M. Rief, A. D. Metra, and 
J. A. Spudich. Methods 22:378 —381, 2000.] 
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Figure 35.16 Myosin motion along 
actin. A myosin head (yellow) in the 
apo form is bound to an actin filament 
(blue). The binding of ATP (1) results 


Myosin 


in the release of myosin from actin. Actin 


The reversible hydrolysis of ATP bound 
to myosin (2) can result in the 
reorientation of the lever arm. With ATP 
hydrolyzed but still bound to actin, 
myosin can bind actin (3). The release 
of Pj (4) results in the reorientation of 
the lever arm and the concomitant 
motion of actin relative to myosin. The 
release of ADP (5) completes the 
cycle. 
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plasma membrane. A muscle cell contains many par- 
allel myofibrils, each about 1 m in diameter. The 
functional unit, called a sarcomere, typically repeats 
every 2.3 wm (23,000 A) along the fibril axis in 
relaxed muscle (Figure 35.17). A dark A band anda 
light I band alternate regularly. The central region of 
the A band, termed the H zone, is less dense that the 
rest of the band. The I band is bisected by a very 
dense, narrow Z line. 

The underlying molecular plan of a sarcomere is 
revealed by cross sections of a myofibril. These cross 
sections show the presence of two kinds of interact- 
ing protein filaments. The thick filaments have diam- 
eters of about 15 nm (150 A) and consist primarily of 
myosin. The thin filaments have diameters of approx- 
imately 8 nm (80 A) and consist of actin as well as 
tropomyosin and the troponin complex. Muscle con- 
traction is achieved through the sliding of the thin 
filaments along the length of the thick filaments, 
driven by the hydrolysis of ATP (Figure 35.18). 

To form the thick filaments, myosin molecules 
self-assemble into thick bipolar structures, with the 
myosin heads protruding at both ends of a bare 
region in the center (Figure 35.19A). Approximately 
500 head domains line the surface of each thick fila- 
ment. Each head-rich region associates with two 
actin filaments, one on each side of the myosin mol- 
ecules (Figure 35.19B). The interaction of individual 
myosin heads with actin units creates the sliding 
force that gives rise to muscle contraction. 


Figure 35.17 Sarcomere. (A) Structure of muscle cell and myofibril 
containing sarcomeres. (B) Electron micrograph of a longitudinal 
section of a skeletal-muscle myofibril showing a single sarcomere. 
(©) Schematic representation of the sarcomere corresponding to the 
regions in the micrograph. [(B) Courtesy of Dr. Hugh Huxley.] 


Tropomyosin and the troponin complex regulate this sliding in response 
to nerve impulses. Under resting conditions, tropomyosin blocks the inti- 
mate interaction between myosin and actin. A nerve impulse leads to an 
increase in calcium ion concentration within the muscle cell. A component 
of the troponin complex senses the increase in Ca’* and, in response, 
relieves the inhibition of myosin—actin interactions by tropomyosin. 

How does the myosin reaction cycle apply to muscle contraction? Recall 
that hundreds of head domains project from the ends of each thick filament. 
The head domains are paired in myosin dimers, but the two heads within 
each dimer act independently. Actin filaments associate with each head-rich 
region, with the barbed ends of actin toward the Z line. In the presence of 
normal levels of ATP, most of the myosin heads are detached from actin. 
Each head can independently hydrolyze ATP, bind to actin, release P;, and 
undergo its power stroke. Because few other heads are attached, the actin 
filament is relatively free to slide. Each head cycles approximately five 
times per second with a movement of 110 A per cycle. However, when hun- 
dreds of heads are interacting with the same actin filament, the overall rate 
of movement of myosin relative to the actin filament may reach 80,000 A 
per second, allowing a sacromere to contract from its fully relaxed to its fully 
contracted form rapidly. Having many myosin heads briefly and indepen- 
dently attaching and moving an actin filament allows for much greater 
speed than could be achieved by a single motor protein. 
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Figure 35.19 Thick filament. (A) An electron micrograph of a reconstituted thick filament reveals 
the presence of myosin head domains at each end and a relatively narrow central region. A 
schematic view below shows how myosin molecules come together to form the thick filament. 

(B) A diagram showing the interaction of thick and thin filaments in skeletal-muscle contraction. 
[(A, top) Courtesy of Dr. Hugh Huxley.] 
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Figure 35.18 Sliding-filament model. 
Muscle contraction depends on the motion of 
thin filaments (blue) relative to thick filaments 
(red). [After H. E. Huxley. The mechanism of 
muscular contraction. Copyright © 1965 by 
Scientific American, Inc. All rights reserved.] 
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Number of light-chain binding sites 


Figure 35.20 Myosin lever-arm length. 
Examination of the rates of actin movement 
supported by a set of myosin mutants with 
different numbers of light-chain binding sites 
revealed a linear relation; the greater the 
number of light-chain binding sites (and, 
hence, the longer the lever arm), the faster 
the sliding velocity. [After T. Q. P. Uyeda, P. D. 
Abramson, and J. A. Spudich. Proc. Natl, Acad. 
Sci. U. S.A. 93:4459 -4464, 1996] 
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Figure 35.21 Microtubule structure. 
Schematic views of the helical structure of a 
microtubule. a-Tubulin is shown in dark red 
and £-tubulin in light red. (A) Top view. 

(B) Side view. 
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The length of the lever arm determines motor velocity 


A key feature of myosin motors is the role of the lever arm as an amplifier. 
The lever arm amplifies small structural changes at the nucleotide-binding 
site to achieve the 110-A movement along the actin filament that takes place 
in each ATP hydrolysis cycle. A strong prediction of the mechanism pro- 
posed for the movement of myosin along actin is that the length traveled per 
cycle should depend on the length of this lever arm. Thus, the length of the 
lever arm should influence the overall rate at which actin moves relative to 
a collection of myosin heads. 

This prediction was tested with the use of mutated forms of myosin with 
lever arms of different lengths. The lever arm in muscle myosin includes 
binding sites for two light chains (Section 35.1). Thus investigators short- 
ened the lever arm by deleting the sequences that correspond to one or both 
of these binding sites. They then examined the rates at which actin filaments 
were transported along collections of these mutated myosins (Figure 35.20). 
As predicted, the rate decreased as the lever arm was shortened. A mutated 
form of myosin with an unusually long lever arm was generated by inserting 
23 amino acids corresponding to the binding site for an additional regula- 
tory light chain. Remarkably, this form was found to support actin move- 
ment that was faster than that of the wild-type protein. These results strong- 
ly support the proposed role of the lever arm in contributing to myosin 
motor activity. 


35.3 Kinesin and Dynein Move Along Microtubules 


In addition to actin, the cytoskeleton includes other components, notably 
intermediate filaments and microtubules. Microtubules serve as tracks for 
two classes of motor proteins—namely, kinesins and dyneins. Kinesins 
moving along microtubules usually carry cargo such as organelles and 
vesicles from the center of a cell to its periphery. Dyneins are important in 
sliding microtubules relative to one other during the beating of cilia and 
flagella on the surfaces of some eukaryotic cells. Additionally, dynein 
carries cargo from the cell periphery to the cell center. 


¥e Some members of the kinesin family are crucial to the transport of 
organelles and other cargo to nerve endings at the peripheries of neu- 
rons. It is not surprising, then, that mutations in these kinesins can lead to 
nervous system disorders. For example, mutations in a kinesin called KIF1B 
can lead to the most common peripheral neuropathy (weakness and pain in the 
hands and feet), Charcot-Marie-Tooth disease, which affects 1 in 2500 people. 
A glutamine-to-leucine mutation in the P-loop of the motor domain of this 
kinesin has been found in some affected persons. Knockout mice with a dis- 
ruption of the orthologous gene have been generated. Mice heterozygous for 
the disruption show symptoms similar to those observed in human beings; 
homozygotes die shortly after birth. Mutations in other kinesin genes have 
been linked to human spastic paraplegia. In these disorders, defects in kinesin- 
linked transport may impair nerve function directly, and the decrease in the 
activity of specific neurons may lead to other degenerative processes. 


Microtubules are hollow cylindrical polymers 


Microtubules are a major component of the cytoskeleton. They are built 
from two kinds of homologous 50-kd subunits, a- and B-tubulin, which 
assemble in a helical array of alternating tubulin types to form the wall of 
a hollow cylinder (Figure 35.21). Alternatively, a microtubule can be 
regarded as 13 protofilaments that run parallel to its long axis. The outer 


diameter of a microtubule is 30 nm, much larger than that of actin (8 nm). 
Like actin, microtubules are polar structures. The minus end of a microtu- 
bule is anchored near the center of a cell, whereas the plus end extends 
toward the cell surface. 

Microtubules are also key components of cilia and flagella present on 
some eukaryotic cells. For example, sperm propel themselves through the 
motion of flagella containing microtubules. The microtubules present in 
these structures adopt a common architecture (Figure 35.22). A bundle of 
microtubules called an axoneme is surrounded by a membrane contiguous 
with the plasma membrane. The axoneme is composed of a peripheral 
group of nine microtubule pairs surrounding two singlet microtubules. This 
recurring motif is often called a 9 + 2 array. Dynein drives the motion of one 
member of each outer pair relative to the other, causing the overall structure 
to bend. 

Microtubules are important in determining the shapes of cells and in 
separating daughter chromosomes in mitosis. They are highly dynamic 
structures that grow through the addition of a- and B-tubulin to the ends of 
existing structures. Like actin, tubulins bind and hydrolyze nucleoside tri- 
phosphates, although, for tubulin, the nucleotide is GTP rather than ATP. 
The critical concentration for the polymerization of the GTP forms of tubu- 
lin is lower than that for the GDP forms. Thus, a newly formed microtubule 
consists primarily of GT P-tubulins. Through time, the GTP is hydrolyzed 
to GDP. The GDP-tubulin subunits in the interior length of a microtubule 
remain stably polymerized, whereas GDP subunits exposed at an end have 
a strong tendency to dissociate. Marc Kirschner and Tim Mitchison found 
that some microtubules in a population lengthen while others simultane- 
ously shorten. This property, called dynamic instability, arises from random 
fluctuations in the number of GTP- or GDP-tubulin subunits at the plus 
end of the polymer. The dynamic character of microtubules is crucial for 
processes such as mitosis, which require the assembly and disassembly of 
elaborate microtubule-based structures such as spindle fibers. 


ve The structure of tubulin was determined at high 

resolution by electron crystallographic methods 
(Figure 35.23). As expected from their 40% sequence 
identity, a- and B-tubulin have very similar three- 
dimensional structures. Further analysis revealed that 
the tubulins are members of the P-loop NTPase family 
and contain a nucleotide-binding site adjacent to the 
P-loop. Tubulins are present only in eukaryotes, although 
a prokaryotic homolog has been found. Sequence analy- 
sis identified a prokaryotic protein called FtsZ (for fila- 
mentous temperature-sensitive mutant Z) that is quite 
similar to the tubulins. The homology was confirmed 
when the structure was determined by x-ray crystallog- 
raphy. Interestingly, this protein participates in bacterial 
cell division, forming ring-shaped structures at the con- 
striction that arises when a cell divides. These observa- 
tions suggest that tubulins may have evolved from an 
ancient cell-division protein. 


Minus end 


¥e The continual lengthening and shortening of 
microtubules is essential to their role in cell divi- 
sion. Taxol, a compound isolated from the bark of the 
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Figure 35.22 Microtubule arrangement. 
Electron micrograph of a cross section of a 
flagellar axoneme shows nine microtubule 


doublets surrounding two singlets. [Courtesy 
of Dr. Joel Rosenbaum.] 


o-Tubulin 
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h& Figure 35.23 Tubulin. Microtubules can be viewed as an 
assembly of a-tubulin— B-tubulin dimers. The structures of e-tubulin 
and §-tubulin are quite similar. Notice that each includes a P-loop 


Pacific yew tree, was discovered through its ability to NTPase domain (shown in purple) and a bound guanine nucleotide. 


interfere with cell proliferation. [Drawn from 1JFF.pdb] 
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Figure 35.24 Monitoring movements 
mediated by kinesin. (A) The movement of 
beads or vesicles, carried by individual kinesin 
dimers along a microtubule, can be directly 
observed. (B) A trace shows the displacement 
of a bead carried by a kinesin molecule. 
Multiple steps are taken in the 6-s interval. 
The average step size is about 8 nm (80 A). 
[(B) After K. Svoboda et al. Nature 365:721 
-727, 1993] 


(A) 


Taxol binds to microtubules and stabilizes the polymerized form. Taxol 
and its derivatives have been developed as anticancer agents because they 
preferentially affect rapidly dividing cells, such as those in tumors. 


Kinesin motion is highly processive 


Kinesins are motor proteins that move along microtubules. We have seen 
that myosin moves along actin filaments by a process in which actin is 
released in each cycle; a myosin head group acting independently dissoci- 
ates from actin after every power stroke. In contrast, when a kinesin mole- 
cule moves along a microtubule, the two head groups of the kinesin molecule 
operate in tandem: one binds, and then the next one does. A kinesin mole- 
cule may take many steps before both head groups are dissociated at the 
same time. In other words, the motion of kinesin is highly processive. 
Single-molecule measurements allow processive motion to be observed 
(Figure 35.24). A single kinesin molecule will typically take 100 or more 
steps toward the plus end of a microtubule in a period of seconds before the 
molecule becomes detached from the microtubule. These measurements 
also revealed that the average step size is approximately 80 A, a value that 
corresponds to the distance between consecutive a- or B-tubulin subunits 
along each protofilament. 

An additional fact is crucial to the development of a mechanism for kine- 
sin motion—namely, that the addition of ATP strongly increases the affin- 
ity of kinesin for microtubules. This behavior stands in contrast with the 
behavior of myosin; ATP binding to myosin promotes its dissociation from 
actin. Do these differences imply that kinesin and myosin operate by com- 
pletely different mechanisms? Indeed not. Kinesin-generated movement 
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appears to proceed by a mechanism that is quite similar to that used by 
myosin (Figure 35.25). Let us begin with a two-headed kinesin molecule in 
its ADP form, dissociated from a microtubule. Recall that the neck linker 
binds the head domain when ATP is bound and is released when ADP is 
bound. The initial interaction of one of the head domains with a tubulin 
dimer on a microtubule stimulates the release of ADP from this head 
domain and the subsequent binding of ATP. The binding of ATP triggers 
a conformational change in the head domain that leads to two important 
events. First, the affinity of the head domain for the microtubule increases, 
essentially locking this head domain in place. Second, the neck linker binds 
to the head domain. This change, transmitted through the coiled-coil 
domain that connects the two kinesin monomers, repositions the other head 
domain. In its new position, the second head domain is close to a second 
tubulin dimer, 80 A along the microtubule in the direction of the plus end. 
Meanwhile, the intrinsic ATPase activity of the first head domain hydro- 
lyzes the ATP to ADP and P;. When the second head domain binds to the 
microtubule, the first head releases ADP and binds ATP. Again, ATP 
binding favors a conformational change that pulls the first domain forward. 
This process can continue for many cycles until, by chance, both head 
domains are in the ADP form simultaneously and kinesin dissociates from 
the microtubule. Because of the relative rates of the component reactions, a 
simultaneous dissociation takes place approximately every 100 cycles. 
Thus, kinesin “walks” hand over hand along the microtubule for approxi- 
mately 100 steps before both “hands” simultaneously let go. Although 
recent studies have led to a structural model for the dynein power stroke, 
the mechanism by which it works is unknown and an interesting research 
frontier. 

Kinesin hydrolyzes ATP at a rate of approximately 80 molecules per 
second. Thus, given the step size of 80 A per molecule of ATP, kinesin 
moves along a microtubule at a speed of 6400 A per second. This rate is 
considerably lower than the maximum rate for myosin, which moves 


H,0 


Figure 35.25 Kinesin moving along a 
microtubule. (1) One head of a two-headed 
kinesin molecule, initially with both heads in 
the ADP form, binds to a microtubule. (2) The 
release of ADP and the binding of ATP results 
in a conformational change that locks the 
head to the microtubule and pulls the neck 
linker (orange) to the head domain, throwing 
the second domain toward the plus end of 
the microtubule. (3) ATP undergoes hydrolysis 
while the second head interacts with the 
microtubule. (4) The exchange of ATP for ADP 
in the second head pulls the first head off the 
microtubule, releasing P; and moving the first 
domain along the microtubule. (5) The cycle 
repeats, moving the kinesin dimer farther 
down the microtubule. 
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Figure 35.26 Bacterial flagella. Electron 
micrograph of Salmonella typhimurium shows 
flagella in a bundle. [Courtesy of Dr. Daniel 
Koshland, Jr.] 


hk Figure 35.27 Structure of flagellin. A 
bacterial flagellum is a helical polymer of the 
protein flagellin. Notice that each subunit 
corresponds to a bent structure with a 
relatively flat surface facing the hollow core of 
the flagellum. [Drawn from 1101.pdb.] 


relative to actin at 80,000 A per second. Recall, however, that myosin 
movement depends on the independent action of hundreds of different 
head domains working along the same actin filament, whereas the move- 
ment of kinesin is driven by the processive action of kinesin head groups 
working in pairs. Muscle myosin evolved to maximize the speed of the 
motion, whereas kinesin functions to achieve steady, but slower, trans- 
port in one direction along a filament. 


35.4 A Rotary Motor Drives Bacterial Motion 


In 1s, a motile bacterium can move approximately 25 jum, or about 10 body 
lengths. A human being sprinting at a proportional rate would complete the 
100-meter dash in slightly more than 5 s. The motors that power this 
impressive motion are strikingly different from the eukaryotic motors that 
we have seen so far. In the bacterial motor, an element spins around a central 
axis rather than moving along a polymeric track. The direction of rotation 
can change rapidly, a feature that is central to chemotaxis, the process by 
which bacteria swim preferentially toward an increasing concentration of 
certain useful compounds and away from potentially harmful ones. One type 
of flagellar motor, powered by a Na” gradient, turns at a rate of 200,000 
revolutions per minute. 


Bacteria swim by rotating their flagella 


Bacteria such as Escherichia coli and Salmonella typhimurium swim by rotat- 
ing flagella that lie on their surfaces (Figure 35.26). When the flagella rotate 
in a counterclockwise direction (viewed from outside a bacterium), the 
separate flagella form a bundle that very efficiently propels the bacterium 
through solution. 

Bacterial flagella are polymers approximately 15 nm in diameter and as 
much as 15 pm in length, composed of 53-kd subunits of a protein called 
flagellin (Figure 35.27). These subunits associate into a helical structure that 
has 5.5 subunits per turn, giving the appearance of 11 protofilaments. Each 
flagellum has a hollow core. Remarkably, flagella form not by growing at 
the base adjacent to the cell body but, instead, by the addition of new 
subunits that pass through the hollow core and add to the free end. Each 
flagellum is intrinsically twisted in a left-handed sense. At its base, each 
flagellum has a rotory motor. 


Flagellum Flagellin 


Proton flow drives bacterial flagellar rotation 


Early experiments by Julius Adler demonstrated that ATP is not required 
for flagellar motion. What powers these rotary motors? The necessary free 


energy is derived from the proton gradient that exists across the 
plasma membrane. The flagellar motor is quite complex, contain- 
ing as many as 40 distinct proteins (Figure 35.28). Five compo- 
nents particularly crucial to motor function have been identified 
through genetic studies. MotA is a membrane protein that appears 
to have four transmembrane helices as well as a cytoplasmic 
domain. MotB is another membrane protein with a single trans- 
membrane helix and a large periplasmic domain. Approximately 
11 MotA—MotB pairs form aring around the base of the flagellum. 
The proteins FliG, FliiM, and FIiN are part of a disc-like structure 
called the MS (membrane and supramembrane) ring, with approx- 
imately 30 FliG subunits coming together to form the ring. The 
three-dimensional structure of the carboxyl-terminal half of FliG 
reveals a wedge-shaped domain with a set of charged amino acids, 
conserved among many species, lying along the thick edge of the 
wedge (Figure 35.29). 

The MotA—MotB pair and FliG combine to create a proton 
channel that drives the rotation of the flagellum. How can proton 
flow across a membrane drive mechanical rotation? We have seen 
such a process earlier in regard to ATP synthase (Section 18.4). 
Recall that the key to driving the rotation of the y subunit of ATP 
synthase is the a subunit of the Fo fragment. This subunit appears 
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Figure 35.28 Flagellar motor. A schematic view of the 
flagellar motor, a complex structure containing as many as 
AO distinct types of protein. The approximate positions of the 
proteins MotA and MotB (red), FliG (orange), FIiN (yellow), 
and FliM (green) are shown. 


to have two half-channels; protons can move across the membrane only by 
moving into the half-channel from the side of the membrane with the 
higher local proton concentration, binding to a disc-like structure formed 
by the c subunits, riding on this structure as it rotates to the opening of the 
other half-channel, and exiting to the side with the lower local proton con- 
centration. Could a similar mechanism apply to flagellar rotation? Indeed, 
such a mechanism was first proposed by Howard Berg to explain flagellar 
rotation before the rotary mechanism of ATP synthase was elucidated. Each 
MotA—MotB pair is conjectured to form a structure that has two half- 
channels; FliG serves as the rotating proton carrier, perhaps with the par- 
ticipation of some of the charged residues identified in crystallographic 
studies (Figure 35.30). In this scenario, a proton from the periplasmic space 
passes into the outer half-channel and is transferred to an FliG subunit. The 
MS ring rotates, rotating the flagellum with it and allowing the proton to 
pass into the inner half-channel and into the cell. Ongoing structural and 


mutagenesis studies are testing and refining this hypothesis. 


h Figure 35.29 Flagellar motor components. Approximately 30 subunits of FliG assemble to 
form part of the MS ring. The ring is surrounded by approximately 11 structures consisting of MotA 
and MotB. Notice that the carboxyl-terminal domain of FliG includes a ridge lined with charged 
residues that may participate in proton transport. [Drawn from 1QC7.pdb.] 
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Figure 35.30 Proton-transport-coupled 
rotation of the flagellum. (A) MotA—MotB 
may form a structure having two half- 
channels. (B) One model for the mechanism 
of coupling rotation to a proton gradient 
requires protons to be taken up into the outer 
half-channel and transferred to the MS ring. 
The MS ting rotates in a counterclockwise 
direction, and the protons are released into 
the inner half-channel. The flagellum is linked 
to the MS ring and so the flagellum rotates as 
well. 
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Figure 35.31 Charting a course. This 
projection of the track of an £. coli bacterium 
was obtained with a microscope that 
automatically follows bacterial motion in three 
dimensions. The points show the locations of 
the bacterium at 80-ms intervals. [After H. C. 
Berg. Nature 254:389-392, 19751] 
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Bacterial chemotaxis depends on reversal of the 
direction of flagellar rotation 


Many species of bacteria respond to changes in their environments by 
adjusting their swimming behavior. Examination of the paths taken is 
highly revealing (Figure 35.31). The bacteria swim in one direction for some 
length of time (typically about a second), tumble briefly, and then set off in 
anew direction. The tumbling is caused by a brief reversal in the direction 
of the flagellar motor. When the flagella rotate counterclockwise, the helical 
filaments form a coherent bundle favored by the intrinsic shape of each fila- 
ment, and the bacterium swims smoothly. When the rotation reverses, the 
bundle flies apart because the screw sense of the helical flagella does not 
match the direction of rotation. Each flagellum then pulls in a different 
direction and the cell tumbles. 

In the presence of a gradient of certain substances such as glucose, bac- 
teria swim preferentially toward the direction of the higher concentration of 
the substance. Such compounds are referred to as chemoattractants. Bacteria 
also swim preferentially away from potentially harmful compounds such as 
phenol, a chemorepellant. The process of moving in specific directions in 
response to environmental cues is called chemotaxis. In the presence of a 
gradient of a chemoattractant, bacteria swim for longer periods of time 
without tumbling when moving toward higher concentrations of the 
chemoattractant. In contrast, they tumble more frequently when moving 
toward lower concentrations of the chemoattractant. This behavior is reversed 
for chemorepellants. The result of these actions is a biased random walk that 
facilitates net motion toward conditions more favorable to the bacterium. 

Chemotaxis depends on a signaling pathway that terminates at the 
flagellar motor. The signaling pathway begins with the binding of molecules 
to receptors in the plasma membrane (Figure 35.32). In their unoccupied 
forms, these receptors initiate a pathway leading eventually to the phospho- 
rylation of a specific aspartate residue on a soluble protein called CheY. In 
its phosphorylated form, CheY binds to the base on the flagellar motor. 
When bound to phosphorylated CheY, the flagellar motor rotates in a 
clockwise rather than a counterclockwise direction, causing tumbling. 

The binding of a chemoattractant to a surface receptor blocks the signal- 
ing pathway leading to CheY phosphorylation. Phosphorylated CheY spon- 
taneously hydrolyzes and releases its phosphoryl group in a process acceler- 
ated by another protein, CheZ. The concentration of phosphorylated CheY 
drops, and the flagella are less likely to rotate in a clockwise direction. Under 
these conditions, bacteria swim smoothly without tumbling. Thus, the revers- 
ible rotary flagellar motor and a phosphorylation-based signaling pathway 
work together to generate an effective means for responding to environmen- 
tal conditions. 
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Bacteria sense spatial gradients of chemoattractants by measurements 
separated in time. A bacterium sets off in a random direction and, if the 
concentration of the chemoattractant has increased after the bacterium has 
been swimming for a period of time, the likelihood of tumbling decreases 
and the bacterium continues in roughly the same direction. If the 
concentration has decreased, the tumbling frequency increases and the bac- 
terium tests other random directions. The success of this mechanism once 
again reveals the power of evolutionary problem solving: many possible 
solutions are tried at random, and those that are beneficial are selected and 
exploited. 


Summary 


35.1 Most Molecular-Motor Proteins Are Members of the P-Loop NTPase 

Superfamily 

Eukaryotic cells contain three families of molecular-motor proteins: 
myosins, kinesins, and dyneins. These proteins move along tracks 
defined by the actin and microtubule cytoskeletons of eukaryotic cells, 
contributing to cell and organismal movement and to the intracellular 
transport of proteins, vesicles, and organelles. Despite considerable 
differences in size and a lack of similarity detectable at the level of 
amino acid sequence, these proteins are homologous, containing core 
structures of the P-loop NTPase family. The ability of these core 
structures to change conformations in response to nucleoside triphos- 
phate binding and hydrolysis is key to molecular-motor function. 
Motor proteins consist of motor domains attached to extended struc- 
tures that serve to amplify the conformational changes in the 
core domains and to link the core domains to one another or to other 
structures. 


35.2 Myosins Move Along Actin Filaments 
The motile structure of muscle consists of a complex of myosin and 
actin, along with accessory proteins. Actin, a highly abundant 42-kd 
protein, polymerizes to form long filaments. Each actin monomer can 
bind either ATP or ADP. A myosin motor domain moves along actin 
filaments in a cyclic manner, beginning with myosin free of bound 
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Figure 35.32 Chemotaxis signaling 
pathway. Receptors in the plasma 
membrane initiate a signaling pathway leading 
to the phosphorylation of the CheY protein. 
Phosphorylated CheY binds to the flagellar 
motor and favors clockwise rotation. When an 
attractant binds to the receptor, this pathway 
is blocked, and counterclockwise flagellar 
rotation and, hence, smooth swimming result. 
When a repellant binds, the pathway is 
stimulated, leading to an increased 
concentration of phosphorylated CheY and, 
hence, more-frequent clockwise rotation and 
tumbling. 
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Key Terms 


cytoskeleton (p. 1007) 
myosin (p. 1008) 


35.3 


35.4 


nucleotides bound to actin: (1) ATP binds to myosin and the myosin 
is released from actin; (2) a reversible conformational change associ- 
ated with the hydrolysis of ATP while still bound to myosin leads to a 
large motion of a lever arm that extends from the motor domain; 
(3) myosin with bound ADP and P; binds to actin; (4) P; is released 
from myosin, resulting in resetting the position of the lever arm and 
moving actin relative to myosin; and (5) the release of ADP returns the 
motor domain to its initial state. The length of the lever arm deter- 
mines the size of the step taken along actin in each cycle. The ability to 
monitor single molecular-motor proteins has provided key tests for 
hypotheses concerning motor function. Muscle contraction entails the 
rapid sliding of thin filaments, composed of actin, relative to thick 
filaments, composed of myosin. The thick filaments consist of multiple 
myosin molecules bound together. Each myosin molecule has two heads 
that can bind to actin and move it relative to myosin, driven by the 
hydrolysis of ATP by myosin. Muscle contraction is regulated by tro- 
pomyosin and the troponin complex. These proteins prevent actin and 
myosin from interacting until an increase in calcium concentration 
associated with a nerve impulse results in calctum-induced changes in 
troponin and tropomyosin. 


Kinesin and Dynein Move Along Microtubules 

Kinesin and dynein move along microtubules rather than actin. 
Microtubules are polymeric structures composed of a- and B-tubulin, 
two very similar guanine-nucleotide-binding proteins. Each microtu- 
bule comprises 13 protofilaments with alternating a- and §-tubulin 
subunits. Kinesins move along microtubules by a mechanism quite 
similar to that used by myosin to move along actin but with several 
important differences. First, ATP binding to kinesin favors motor- 
domain binding rather than dissociation. Second, the power stroke is 
triggered by the binding of ATP rather than the release of P;. Finally, 
kinesin motion is processive. The two heads of a kinesin dimer work 
together, taking turns binding and releasing the microtubule, and 
many steps are taken along a microtubule before both heads dissociate. 
Most kinesins move toward the plus end of microtubules. 


A Rotary Motor Drives Bacterial Motion 

Many motile bacteria use rotating flagella to propel themselves. When 
rotating counterclockwise, multiple flagella on the surface of a bac- 
terium come together to form a bundle that effectively propels the 
bacterium through solution. A proton gradient across the plasma 
membrane, rather than ATP hydrolysis, powers the flagellar motor. 
The mechanism for coupling transmembrane proton transport to 
macromolecular rotation appears to be similar to that used by ATP 
synthase. When rotating clockwise, the flagella fly apart and the bac- 
terium tumbles. Bacteria swim preferentially toward chemoattractants 
in a process called chemotaxis. When bacteria are swimming in the 
direction of an increasing concentration of a chemoattractant, clock- 
wise flagellar motion predominates and tumbling is suppressed, lead- 
ing to a biased random walk in the direction of increasing chemoat- 
tractant concentration. 


kinesin (p. 1008) S1 fragment (p. 1009) 
dynein (p. 1008) conventional kinesin (p. 1010) 


lever arm (p. 1011) 

relay helix (p. 1012) 

neck linker (p. 1012) 

actin (p. 1012) 

G-actin (p. 1013) 

F-actin (p. 1013) 

critical concentration (p. 1014) 


myofibril (p. 1016) 
sarcomere (p. 1016) 


tubulin (p. 1019) 
optical trap (p. 1015) 


Problems 


1. Diverse motors. Skeletal muscle, eukaryotic cilia, and 
bacterial flagella use different strategies for the conversion 
of free energy into coherent motion. Compare and contrast 
these motility systems with respect to (a) the free-energy 
source and (b) the number of essential components and 
their identity. 


2. You call that slow? At maximum speed, a kinesin molecule 
moves at a rate of 6400 A per second. Given the dimensions 
of the motor region of a kinesin dimer of approximately 
80 A, calculate its speed in “body lengths” per second. To 
what speed does this body-length speed correspond for an 
automobile 10 feet long? 


3. Heavy lifting. A single myosin motor domain can gener- 
ate a force of approximately 4 piconewtons (4 pN). How 
many times its “body weight” can a myosin motor domain 
lift? Note that 1 newton = 0.22 pounds (100 g). Assume a 
molecular mass of 100 kd for the motor domain. 


4. Compare and contrast. Describe two similarities and two 
differences between actin filaments and microtubules. 


5. Lighten up. What is the primary role of the light chains 
in myosin? In kinesin? 


6. Rigor mortis. Why does the body stiffen after death? 


7. Now you see it, now you don’t. Under certain stable con- 
centration conditions, actin monomers in their ATP form 
will polymerize to form filaments that disperse again into 
free actin monomers over time. Explain. 


8. Helicases as motors. Helicases can use single-stranded 
DNA as tracks. Consider a helicase that moves one base in 
the 3’ —> 5’ direction in each cycle. Assuming that the heli- 
case can hydrolyze ATP at a rate of 50 molecules per second 
in the presence of a single-stranded DNA template, calcu- 
late the velocity of the helicase in micrometers per second. 
How does this velocity compare with that of kinesin? 


9. New moves. When bacteria such as E. coli are starved to 
a sufficient extent, they become nonmotile. However, when 
such bacteria are placed in an acidic solution, they resume 
swimming. Explain. 


power stroke (p. 1015) 


tropomyosin (p. 1016) 
troponin complex (p. 1016) 
microtubule (p. 1018) 
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Problems 


flagellin (p. 1022) 
MotA—MotB pair (p. 1023) 
FliG (p. 1023) 
chemoattractant (p. 1024) 
chemorepellant (p. 1024) 
chemotaxis (p. 1024) 

CheY (p. 1024) 


dynamic instability (p. 1019) 


10. Going straight. Suppose you measure the mean distance 
that an E. coli bacterium moves along a straight path before 
it tumbles over a period of time. Would you expect this 
distance to change in the presence of a gradient of a 
chemoattractant? Would it be longer or shorter? 


11. Hauling a load. Consider the action of a single kinesin 
molecule in moving a vesicle along a microtubule track. The 
force required to drag a spherical particle of radius a at a 
velocity v ina medium having a viscosity 7 is 


F = oT av 


Suppose that a 2-m-diameter bead is carried at a velocity 
of 0.6 ums | in an aqueous medium (7 = 0.01 poise = 
0.01gcm 's_!). 

(a) What is the magnitude of the force exerted by the kinesin 
molecule? Express the value in dynes (1 dyne = 1 gcms ”). 


(b) How much work is performed in 1 s? Express the value 
in ergs (1 erg = 1 dyne cm). 


(c) A kinesin motor hydrolyzes approximately 80 mole- 
cules of ATP per second. What is the energy associated 
with the hydrolysis of this much ATP in ergs? Compare this 
value with the actual work performed. 


12. Unusual strides. A publication describes a kinesin mol- 
ecule that is claimed to move along microtubules with a step 
size of 6 nm. You are skeptical. Why? 


13. The sound of one hand clapping. KIF1A is a motor pro- 
tein that moves toward the plus end of microtubules as a 
monomer. KIF1A has only a single motor domain. What 
additional structural elements would you expect to find in 
the KIF1A structure? 


14. Building blocks. Actin filaments, microtubules, and 
bacterial flagella are all built from small subunits. Describe 
three advantages of assembling long filamentous structures 
from subunits rather than from single, long proteins. 


Mechanism Problem 


15. Backward rotation. On the basis of the proposed struc- 
ture in Figure 35.30 for the bacterial flagellar motor, suggest 
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a pathway for transmembrane proton flow when the flagellar 
motor is rotating clockwise rather than counterclockwise. 


Chapter Integration Problem 


16. Smooth muscle. Smooth muscle, in contrast with skeletal 
muscle, is not regulated by a tropomyosin-troponin mecha- 
nism. Instead, vertebrate smooth-muscle contraction is 
controlled by the degree of phosphorylation of its light 
chains. Phosphorylation induces contraction, and dephos- 
phorylation leads to relaxation. Like that of skeletal muscle, 
smooth-muscle contraction is triggered by an increase in 
the cytoplasmic calcium ion level. Propose a mechanism for 
this action of calcium ion on the basis of your knowledge of 
other signal-transduction processes. 


Data Interpretation Problem 


17. Myosin V. An abundant myosin-family member, 
myosin V is isolated from brain tissue. This myosin has a 
number of unusual properties. First, on the basis of its 
amino acid sequence, each heavy chain has six tandem 
binding sites for calmodulin-like light chains. Second, it 
forms dimers but not higher-order oligomers. Finally, 
unlike almost all other myosin-family members, myosin V 
is highly processive. 

The rate of ATP hydrolysis by myosin has been examined 
as a function of ATP concentration, as shown in graph A. 
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(a) Estimate the values of k,,, and Ky4 for ATP. 


With the use of optical-trap measurements, the motion 
of single myosin V dimers could be followed, as shown in 
graph B. 
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[Based on M. Rief et al. Proc. Natl. Acad. Sci. U. S. A. 97:9482-9486, 
2000.] 


(b) Estimate the step size for myosin V. 


The rate of ADP release from myosin V is found to be 
approximately 13 molecules per second. 


(c) Combine the observations about the amino acid 
sequence of myosin, the observed step size, and the kinetics 
results to propose a mechanism for the processive motion of 
myosin V. 


Drug Development 


he development of drugs is among the most important interfaces between 

biochemistry and medicine. In most cases, drugs act by binding to spe- 
cific receptors or enzymes and inhibiting or otherwise modulating their 
activities. Thus, knowledge of these molecules and the pathways in which 
they participate is crucial to drug development. An effective drug is much 
more than a potent modulator of its target, however. Drugs must be readily 
administered to patients, preferably as small tablets taken orally, and must 
survive within the body long enough to reach their targets. Furthermore, to 
prevent unwanted physiological effects, drugs must not modulate the prop- 
erties of biomolecules other than the target molecules. These requirements 
tremendously limit the number of compounds that have the potential to be 
clinically useful drugs. 

Drugs have been discovered by two fundamentally opposite approaches 
(Figure 36.1). The first approach identifies a substance that has a desirable 
physiological consequence when administered to a human being, to an 
appropriate animal, or to cells. Such substances can be discovered by seren- 
dipity, by the fractionation of plants or other materials known to have 
medicinal properties, or by screening natural products or other “libraries” 
of compounds. In this approach, a biological effect is known before the 
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Many drugs are based on natural products. Aspirin (above) 
is a chemical derivative of a compound isolated from 
willow bark (near left). Extracts of willow bark had long 
been known to have medicinal properties. The active 
compound was isolated, modified, and, beginning in 
1899, packaged for consumers (far left). [(Far left) Used 
with permission of Bayer Corporation. (Near left) Image 
Ideas/ Picture Quest.] 
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36.3 Analyses of Genomes Hold 
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Stages 
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Pharmacology 

The science that deals with the discovery, 
chemistry, composition, identification, 
biological and physiological effects, uses, 
and manufacture of drugs. 
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Figure 36.2 Ligand binding. The titration of 
a receptor, R, with a ligand, L, results in the 
formation of the complex RL. In uncomplicated 
cases, the binding reaction follows a simple 
saturation curve. Half of the receptors are 
bound to ligand when the ligand concentration 
equals the dissociation constant, Kg, for the RL 
complex. 
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Figure 36.1 Two paths to drug discovery. (A) A compound is discovered to have a desirable 
physiological effect. The molecular target can be identified in a separate step as needed. (B) A 
molecular target is selected first. Drug candidates that bind to the target are identified and then 
examined for their physiological effects. 


molecular target is identified. The mode of action of the substance is only 
later identified after substantial additional work. The second approach 
begins with a known molecular target. Compounds are sought, either by 
screening or by designing molecules with desired properties, that bind to 
the target molecule and modulate its properties. After such compounds are 
available, scientists can explore their effects on appropriate cells or organ- 
isms. Many unexpected results may be encountered in this process as the 
complexity of biological systems reveals itself. 

In this chapter, we explore the science of pharmacology. We examine a 
number of case histories that illustrate drug development—including many 
of its concepts, methods, and challenges. We then see how the concepts 
and tools from genomics are influencing approaches to drug development. 
The chapter ends with asummary of the stages along the way to developing 
a drug. 


36.1 The Development of Drugs Presents Huge Challenges 


Many compounds have significant effects when taken into the body, but 
only a very small fraction of them have the potential to be useful drugs. A 
foreign compound, not adapted to its role in the cell through long evolution, 
must have a range of special properties to function effectively without caus- 
ing serious harm. We next consider some of the challenges faced by drug 
developers. 


Drug candidates must be potent modulators of their targets 


Most drugs bind to specific proteins, usually receptors or enzymes, within 
the body. To be effective, a drug needs to bind a sufficient number of its 
target proteins when taken at a reasonable dose. One factor in determining 
drug effectiveness is the strength of the interaction between the drug and its 
target. A molecule that binds to some target molecule is often referred to as 
a ligand. A ligand-binding curve is shown in Figure 36.2. Ligand molecules 
occupy progressively more target binding sites as ligand concentration 
increases until essentially all of the available sites are occupied. The ten- 
dency of a ligand to bind to its target is measured by the dissociation con- 
stant, Ka, defined by the expression 


Ka = [RI[L]/[RL] 


where [R] is the concentration of the free receptor, [L] is the concentration 
of the free ligand, and [RL] is the concentration of the receptor—ligand 
complex. The dissociation constant is a measure of the strength of the 


interaction between the drug candidate and the target; the lower the value, 
the stronger the interaction. The concentration of free ligand at which one-half 
of the binding sites are occupied equals the dissociation constant, as long as 
the concentration of binding sites is substantially less than the dissociation 
constant. 

In many cases, biological assays in the context of living cells or tissues 
(rather than direct enzyme or binding assays) are used to examine the 
potency of drug candidates. For example, the fraction of bacteria killed by a 
drug might indicate the potency of a potential antibiotic. In these cases, 
values such as ECs are used. ECso is the concentration of the drug candi- 
date required to elicit 50% of the maximal biological response (Figure 36.3). 
Similarly, ECo9 is the concentration required to achieve 90% of the maximal 
response. In the example of an antibiotic, ECo9 would be the concentration 
required to kill 90% of bacteria exposed to the drug. For drug candidates 
that are inhibitors, the corresponding terms ICs9 and [Coo are often used to 
describe the concentrations of the inhibitor required to reduce a response to 
50% or 90% of its value in the absence of inhibitor, respectively. 

Values such as the [C59 and ECso are measures of the potency of a drug 
candidate in modulating the activity of the desired biological target. To 
prevent unwanted effects, often called side effects, ideal drug candidates 
should not bind biomolecules other than the target to any significant extent. 
Developing such a drug can be quite challenging, particularly if the drug 
target is a member of a large family of evolutionarily related proteins. The 
degree of specificity can be described in terms of the ratio of the Kg values 
for the binding of the drug candidate to any other molecules to the Kg value 
for the binding of the drug candidate to the desired target. 

Many complicating factors are present under physiological conditions. 
Many drug targets also bind ligands normally present in tissues; often, the 
drug and these ligands will compete for binding sites on the target. We 
encountered this situation when we considered competitive inhibitors in 
Chapter 8. Suppose that the drug target is an enzyme and the drug candi- 
date is a competitive inhibitor. The concentration of the drug candidate 
necessary to inhibit the enzyme will depend on the physiological concentra- 
tion of the enzyme’s normal substrate (Figure 36.4). Biochemists Yung-Chi 
Cheng and William Prusoff described the relation between the ICso of an 
enzyme inhibitor and its inhibition constant K; (analogous to the dissociation 
constant, Kg, of a ligand): 


ICs = K,(1 + [S]/Kw) 


This relation, referred to as the Cheng—Prusoff equation, demonstrates that 
the I[Cs9 of a competitive inhibitor will depend on the concentration and the 
Michaelis constant (Ky) for the substrate S. The higher the concentration 
of the natural substrate, the higher the concentration of drug needed to 
inhibit the enzyme to a given extent. 


Drugs must have suitable properties to reach their targets 


Thus far, we have focused on the ability of molecules to act on specific tar- 
get molecules. However, an effective drug must also have other characteris- 
tics. It must be easily administered and must reach its target at sufficient 
concentration to be effective. A drug molecule encounters a variety of 
obstacles on its way to its target, related to its absorption, distribution, 
metabolism, and excretion after it has entered the body. These processes are 
interrelated, as summarized in Figure 36.5. Taken together, a drug’s ease of 
absorption, distribution, metabolism, and excretion are often referred to as 
ADME (pronounced “add-me’”’) properties. 
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Figure 36.3 Effective concentrations. The 
concentration of a ligand required to elicit a 
biological response can be quantified in terms 
of ECs, the concentration required to give 
50% of the maximum response, and ECg9, 
the concentration required to give 90% of the 
maximum response. 
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Figure 36.4 Inhibitors compete with 
substrates for enzyme active sites. The 
measured |Cso of a competitive inhibitor to its 
target enzyme depends on the concentration 
of substrate present. 
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Figure 36.5 Absorption, distribution, 
metabolism, and excretion (ADME). The 
concentration of a compound at its target site 
(yellow) is affected by the extents and rates 
of absorption, distribution, metabolism, and 
excretion. 


CH; 


Two hydrogen- 
bond ie) 
donors 


N< bond 


Morphine (C17H1903N) 


Molecular weight = 285 
log(P) = 1.27 


Figure 36.6 Lipinski’s rules applied to morphine. 
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Administration and absorption. Ideally, a drug can be taken orally as a 
small tablet. An orally administered active compound must be able to sur- 
vive the acidic conditions in the gut and then be absorbed through the intes- 
tinal epithelium. Thus, the compound must be able to pass through cell 
membranes at a significant rate. Larger molecules such as proteins cannot 
be administered orally, because they often cannot survive the acidic condi- 
tions in the stomach and, if they do, are not readily absorbed. Even many 
small molecules are not absorbed well; they may be too polar and not pass 
through cell membranes readily, for example. The ability to be absorbed is 
often quantified in terms of the oral bioavailability. This quantity is defined 
as the ratio of the peak concentration of a compound given orally to the 
peak concentration of the same dose injected directly into the bloodstream. 
Bioavailability can vary considerably from species to species, and so results 
from animal studies may be difficult to apply to human beings. Despite this 
variability, some useful generalizations have been made. One powerful set 
consists of Lipinski’s rules. 
Lipinski’s rules tell us that poor absorption is likely when 


the molecular weight is greater than 500. 
the number of hydrogen-bond donors is greater than 5. 


the number of hydrogen-bond acceptors is greater than 10. 


ook te 


the partition coefficient [measured as log(P)] is greater than 5. 


| Four hydrogen- 


The partition coefficient is a way to measure the tendency of a mol- 
ecule to dissolve in membranes, which correlates with its ability to 
dissolve in organic solvents. It is determined by allowing a com- 
pound to equilibrate between water and an organic phase, n-octanol. 
The log(P) value is defined as log;y of the ratio of the concentration 
of a compound in n-octanol to the concentration of the compound 
in water. For example, if the concentration of the compound in the 
n-octanol phase is 100 times that in the aqueous phase, then log(P) 
is 2. Although the ability of a drug to partition in organic solvents is 
ideal, because it implies that the compound can penetrate mem- 
branes, a log(P) value that is too high suggests that the molecule may 
be poorly soluble in an aqueous environment. 


Morphine satisfies all of Lipinski’s rules and has an oral Morphine, for example, satisfies all of Lipinski’s rules and has 


bioavailability in human beings of 33%. 


moderate bioavailability (Figure 36.6). A drug that violates one or 


Figure 36.7 Structure of the drug 
carrier human serum albumin. 
Seven hydrophobic molecules (in red) 
are shown bound to the molecule. 
[Drawn from 1BKE.pdb.] 


more of these rules may still have satisfactory bioavailability. Nonetheless, 
these rules serve as guiding principles for evaluating new drug candidates. 


Distribution. Compounds taken up by intestinal epithelial cells can pass 
into the bloodstream. However, hydrophobic compounds and many others 
do not freely dissolve in the bloodstream. These compounds bind to pro- 
teins, such as albumin (Figure 36.7), that are abundant in the blood serum 
and by this means are carried everywhere that the bloodstream goes. 

When a compound has reached the bloodstream, it is distributed to 
different fluids and tissues, which are often referred to as compartments. 
Some compounds are highly concentrated in their target compartments, 
either by binding to the target molecules themselves or by other mechanisms. 
Other compounds are distributed more widely (Figure 36.8). An effective 
drug will reach the target compartment in sufficient quantity; the concen- 
tration of the compound in the target compartment is reduced whenever the 
compound is distributed into other compartments. 

Some target compartments are particularly hard to reach. Many com- 
pounds are excluded from the central nervous system by the blood—brain 
barrier, the tight junctions between endothelial cells that line blood vessels 
within the brain and spinal cord. 
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Fluconazole 


Figure 36.8 Distribution of the drug 
fluconazole. After having been taken in, 
compounds distribute themselves to various 
organs within the body. The distribution of the 
antifungal agent fluconazole has been 
monitored through the use of positron 
emission tomography (PET) scanning. These 
images were taken of a healthy human 
volunteer 90 minutes after injection of a dose 
of 5 mg kg™! of fluconazole containing trace 
amounts of fluconazole labeled with the 
positron-emitting isotope '8F. [From A. J. 
Fischman et al., Antimicrob. Agents 
Chemother. 37:1270-1277, 1993.] 
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Figure 36.9 P450 conversion of 
ibuprofen. Cytochrome P450 isozymes, 
primarily in the liver, catalyze xenobiotic 
metabolic reactions such as hydroxylation. The 
reaction introduces an oxygen atom derived 
from molecular oxygen. 


NADPH + H* + 0, + 


Cyclophosphamide-glutathione conjugate 


Metabolism and excretion. A final challenge to a potential drug molecule 
is to evade the body’s defenses against foreign compounds. Many such com- 
pounds (called xenobiotic compounds) are released from the body in the urine 
or stool, often after having been metabolized to aid in excretion. This drug 
metabolism poses a considerable threat to drug effectiveness because the 
concentration of the desired compound decreases as it is metabolized. Thus, 
a rapidly metabolized compound must be administered more frequently or 
at higher doses. 

Two of the most common pathways in xenobiotic metabolism are oxida- 
tion and conjugation. Oxidation reactions can aid excretion in at least two 
ways: by increasing water solubility, and thus ease of transport, and by 
introducing functional groups that participate in subsequent metabolic 
steps. These reactions are often promoted by cytochrome P450 enzymes in 
the liver (Section 26.4). The human genome encodes more than 50 different 
P450 isozymes, many of which participate in xenobiotic metabolism. A typi- 
cal reaction catalyzed by a P450 isozyme is the hydroxylation of ibuprofen 
(Figure 36.9). 


H CH; H CH; 
HO 


NADP* + H,0 + COOH 
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COOH 
H5C 


Ibuprofen 


Conjugation is the addition of particular groups to the xenobiotic com- 
pound. Common groups added are glutathione (Section 20.5), glucuronic 
acid, and sulfate (Figure 36.10). These additions often increase water solu- 
bility and provide labels that can be recognized to target excretion. Examples 
of conjugation include the addition of glutathione to the anticancer drug 
cyclophosphamide, the addition of glucuronidate to the analgesic mor- 
phine, and the addition of a sulfate group to the hair-growth stimulator 
minoxidil. 


H2N 
—==N 
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oe p>) 
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Morphine glucuronidate Minoxidil sulfate 


Interestingly, the sulfation of minoxidil produces a compound that is more 
active in stimulating hair growth than is the unmodified compound. Thus, 
the metabolic products of a drug, though usually less active than the drug, 
can sometimes be more active. 

Note that an oxidation reaction often precedes conjugation because the 
oxidation reaction can generate hydroxyl and other groups to which groups 
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3’-Phosphoadenosine-5’-phosphosulfate (PAPS) 


such as glucuronic acid can be added. The oxidation reactions of xenobiotic 
compounds are often referred to as phase I transformations, and the conjuga- 
tion reactions are referred to as phase II transformations. These reactions 
take place primarily in the liver. Because blood flows from the intestine 
directly to the liver through the portal vein, xenobiotic metabolism often 
alters drug compounds before they ever reach full circulation. This first-pass 
metabolism can substantially limit the availability of compounds taken orally. 

After compounds have entered the bloodstream, they can be removed 
from circulation and excreted from the body by two primary pathways. 
First, they can be absorbed through the kidneys and excreted in the urine. 
In this process, the blood passes through glomeruli, networks of fine capil- 
laries in the kidney that act as filters. Compounds with molecular weights 
less than approximately 60,000 pass though the glomeruli. Many of the water 
molecules, glucose molecules, nucleotides, and other low-molecular-weight 
compounds that pass through the glomeruli are reabsorbed into the blood- 
stream, either by transporters that have broad specificities or by the passive 
transfer of hydrophobic molecules through membranes. Drugs and metab- 
olites that pass through the first filtration step and are not reabsorbed are 
excreted. 

Second, compounds can be actively transported into bile, a process 
that takes place in the liver. After concentration in the gall bladder, bile 
flows into the intestine. In the intestine, the drugs and metabolites can be 
excreted through the stool, reabsorbed into the bloodstream, or further 
degraded by digestive enzymes. Sometimes, compounds are recycled from 


Figure 36.10 Conjugation reactions. 
Compounds that have appropriate groups are 
often modified by conjugation reactions. Such 
reactions include the addition of glutathione 
(top), glucuronic acid (middle), or sulfate 
(bottom). The conjugated product is shown 
boxed. 


1035 


1036 
CHAPTER 36 Drug Development 


Circulation 


Intestine 


Figure 36.11 Enterohepatic cycling. Some 
drugs can move from the blood circulation to 
the liver, into the bile, into the intestine, to the 
liver, and back into circulation. This cycling 
decreases the rate of drug excretion. 
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Figure 36.12 Half-life of drug excretion. 
In the case shown, the concentration of a 
drug in the bloodstream decreases to one-half 
of its value in a period of time, ty,2, referred 
to as its half-life. 


the bloodstream into the intestine and back into the bloodstream, a process 
referred to as enterohepatic cycling (Figure 36.11). This process can signifi- 
cantly decrease the rate of excretion of some compounds because they escape 
from an excretory pathway and reenter the circulation. 

The kinetics of compound excretion is often complex. In some cases, a 
fixed percentage of the remaining compound is excreted over a given period 
of time (Figure 36.12). This pattern of excretion results in exponential loss 
of the compound from the bloodstream that can be characterized by a half- 
life (t1/2). The half-life is the fixed period of time required to eliminate 50% 
of the remaining compound. It is a measure of how long an effective concen- 
tration of the compound remains in the system after administration. As 
such, the half-life is a major factor in determining how often a drug must be 
taken. A drug with a long half-life might need to be taken only once per day, 
whereas a drug with a short half-life might need to be taken three or four 
times per day. 


Toxicity can limit drug effectiveness 


An effective drug must not be so toxic that it seriously harms the person 
who takes it. A drug may be toxic for any of several reasons. First, it may 
modulate the target molecule itself too effectively. For example, the presence 
of too much of the anticoagulant drug coumadin can result in dangerous, 
uncontrolled bleeding and death. Second, the compound may modulate the 
properties of proteins that are distinct from, but related to, the target mol- 
ecule itself. Compounds that are directed to one member of a family of 
enzymes or receptors often bind to other family members. For example, an 
antiviral drug directed against viral proteases may be toxic if it also inhibits 
proteases normally present in the body such as those that regulate blood 
pressure. 

A compound may also be toxic if it modulates the activity of a protein 
unrelated to its intended target. For example, many compounds block ion 
channels such as the potassium channel hERG (p. 393), causing potentially 
life threatening disturbances of the heartbeat. To prevent cardiac side effects, 
many compounds are screened for their ability to block such channels. 

Finally, even if a compound is not itself toxic, its metabolic by-products 
may be. Phase I metabolic processes can generate damaging reactive groups 
in products. An important example is liver toxicity observed with large 
doses of the common pain reliever acetaminophen (Figure 36.13). A par- 
ticular cytochrome P450 isozyme oxidizes acetaminophen to N-acetyl-p- 
benzoquinone imine. The resulting compound is conjugated to glutathione. 
With large doses, however, the liver concentration of glutathione drops 
dramatically, and the liver is no longer able to protect itself from this reac- 
tive compound and others. Initial symptoms of excessive acetaminophen 
include nausea and vomiting. Within 24 to 48 hours, symptoms of liver 
failure may appear. Acetaminophen poisoning accounts for about 35% of 
cases of severe liver failure in the United States. A liver transplant is often 
the only effective treatment. 

The toxicity of a drug candidate can be described in terms of the thera- 
peutic index. This measure of toxicity is determined through animal tests, 
usually with mice or rats. The therapeutic index is defined as the ratio of the 
dose of a compound that is required to kill one-half of the animals (referred 
to as the LDso for “lethal dose’’) to a comparable measure of the effective 
dose, usually the ECs). Thus, if the therapeutic index is 1000, then lethality is 
significant only when 1000 times the effective doseis administered. Analogous 
indices can provide measures of toxicity less severe than lethality. 

Many compounds have favorable properties in vitro, yet fail when 
administered to a living organism because of difficulties with ADME and 
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Figure 36.13 Acetaminophen toxicity. A minor metabolic product of acetaminophen is /V-acetyl- 
p-benzoquinone imine. This metabolite is conjugated to glutathione. Large doses of acetaminophen 
can deplete liver glutathione stores. 


toxicity. Expensive and time-consuming animal studies are required to 
verify that a drug candidate is not toxic, yet differences between animal spe- 
cies in their response can confound decisions about moving forward with a 
compound toward human studies. One hope is that, with more understand- 
ing of the biochemistry of these processes, scientists can develop computer- 
based models to replace or augment animal tests. Such models would need 
to accurately predict the fate of a compound inside a living organism from 
its molecular structure or other properties that are easily measured in the 
laboratory without the use of animals. 


36.2 Drug Candidates Can Be Discovered by Serendipity, 
Screening, or Design 


Traditionally, many drugs were discovered by serendipity, or chance obser- 
vation. More recently, drugs have been discovered by screening collections 
of natural products or other compounds for compounds that have desired 
medicinal properties. Alternatively, scientists have designed specific drug 
candidates by using their knowledge about a preselected molecular target. 
We will examine several examples of each of these pathways to reveal com- 
mon principles. 


Serendipitous observations can drive drug development 


Perhaps the most well known observation in the history of drug develop- 
ment is Alexander Fleming’s chance observation in 1928 that colonies of the 
bacterium Staphylococcus aureus died when they were adjacent to colonies of 
the mold Penicillium notatum. Spores of the mold had landed accidentally on 
plates growing the bacteria. Fleming soon realized that the mold produced 
a substance that could kill disease-causing bacteria. This discovery led to a 
fundamentally new approach to the treatment of bacterial infections. 
Howard Florey and Ernest Chain developed a powdered form of the substance, 
termed penicillin, that became a widely used antibiotic in the 1940s. 

The structure of this antibiotic was elucidated in 1945. The most notable 
feature of this structure is the four-membered B-lactam ring. This unusual 
feature is key to the antibacterial function of penicillin, as noted earlier 
(Section 8.5). 
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Three steps were crucial to fully capitalize on Fleming’s discovery. First, 
an industrial process was developed for the production of penicillin from 
Penicillium mold on a large scale. Second, penicillin and penicillin deriva- 
tives were chemically synthesized. The availability of synthetic penicillin 
derivatives opened the way for scientists to explore the relations between 
structure and function. Many such penicillin derivatives have found wide- 
spread use in medicine. Finally, in 1965, Jack Strominger and James Park 
independently determined that penicillin exerts its antibiotic activity by 
blocking a critical transpeptidase reaction in bacterial cell-wall biosynthesis 
(Figure 36.14), as introduced in Section 8.5. 

Many other drugs have been discovered by serendipitous observations. 
The antineuroleptic drug chlorpromazine (Thorazine) was discovered in the 
course of investigations directed toward the treatment of shock in surgical 
patients. In 1952, French surgeon Henri Laborit noticed that, after taking 
the compound, his patients were remarkably calm. This observation sug- 
gested that chlorpromazine could benefit psychiatric patients, and, indeed, 
the drug has been used for many years to treat patients with schizophrenia 
and other disorders. The drug does have significant side effects, and its use 
has been largely superceded by more recently developed drugs. 

Chlorpromazine acts by binding to receptors for the neurotransmitter 
dopamine and blocking them (Figure 36.15). Dopamine D2 receptors are 
the targets of many other psychoactive drugs. In the search for drugs with 
more-limited side effects, studies are undertaken to correlate drug effects 
with biochemical parameters such as dissociation constants and binding 
and release rate constants. 


A more-recent example of a drug discovered by chance observation is 
sildenafil (Viagra). This compound was developed as an inhibitor of phos- 
phodiesterase 5, an enzyme that catalyzes the hydrolysis of CGMP to GMP 
(Figure 36.16). The compound was intended as a treatment for hyperten- 
sion and angina because cGMP plays a central role in the relaxation of 
smooth-muscle cells in blood vessels (Figure 36.17). The inhibition of phos- 
phodiesterase 5 was expected to increase the concentration of cGMP by 
blocking the pathway for its degradation. In the course of early clinical trials 
in Wales, some men reported unusual penile erections. Whether this chance 
observation by a few men was due to the compound or to other effects was 
unclear. However, the observation made some biochemical sense because 
smooth-muscle relaxation due to increased cGMP levels had been discov- 
ered to play a role in penile erection. Subsequent clinical trials directed 
toward the evaluation of sildenafil for erectile dysfunction were successful. 
This account testifies to the importance of collecting comprehensive infor- 
mation from clinical-trial participants. In this case, incidental observations 
led to a new treatment for erectile dysfunction and a multibillion-dollar- 
per-year drug market. 
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Figure 36.16 Sildenafil, a mimic of cGMP. Sildenafil was designed to resemble cGMP, the 
substrate of phosphodiesterase 5. 


Screening libraries of compounds can yield drugs or drug leads 


No drug is as widely used as aspirin. Observers at least as far back as 
Hippocrates (~400 B.C.) have noted the use of extracts from the bark and 
leaves of the willow tree for pain relief. In 1829, a mixture called salicin was 
isolated from willow bark. Subsequent analysis identified salicylic acid as the 
active component of this mixture. Salicylic acid was formerly used to treat 
pain, but this compound often irritated the stomach. Several investigators 
attempted to find a means to neutralize salicylic acid. Felix Hoffmann, a 
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Figure 36.17 Muscle-relaxation pathway. 
Increases in nitric oxide levels stimulate 
guanylate cyclase, which produces cGMP. 

The increased cGMP concentration promotes 


Guanylate Phospho- : : 
cyclase diesterase 5 smooth-muscle relaxation. Phosphodiesterase 
5 hydrolyzes cGMP, which lowers the 
@| | cGMP concentration. The inhibition of 
phosphodiesterase 5 by sildenafil maintains 
Nitric oxide Sildenafil elevated levels of cGMP. 


Figure 36.15 Chlorpromazine targets. 
This positron emission tomographic image 
shows the distribution of dopamine D2 
receptors in the brain. The colors on the 
bar at the right represent the relative 
distribution of the receptor: white regions 
have the highest concentration; dark blue 
regions have no receptors. These sites are 
blocked by treatment with chlorpromazine. 
[From C. Trichard et al. Am. J. Psychiatry 
155:505-508, 1998; reprinted with 
permission conveyed through Copyright 
Clearance Center, Inc] 


1039 


COOH 


COOH 
OH 
Salicylic acid 
Acetyl group 
Oo CH; 


T 


O 


Aspirin (acetylsalicylic acid) 


Compactin 


Lovastatin 


CH; 


chemist working at the German company Bayer, developed a less-irritating 
derivative by treating salicylic acid with a base and acetyl chloride. This 
derivative, acetylsalicylic acid, was named aspirin from “a” for acetyl chloride, 
“spir” for Spiraea ulmaria (meadowsweet, a flowering plant that also contains 
salicylic acid), and “in” (a common ending for drugs). Each year, approxi- 
mately 35,000 tons of aspirin are taken worldwide, nearly the weight of the 
Titanic. 

As discussed in Chapter 12, the acetyl group in aspirin is transferred to 
the side chain of a serine residue that lies along the path to the active site of the 
cyclooxygenase component of prostaglandin H» synthase (see Figure 12.25). 
In this position, the acetyl group blocks access to the active site. Thus, even 
though aspirin binds in the same pocket on the enzyme as salicylic acid, the 
acetyl group of aspirin dramatically increases its effectiveness as a drug. The 
account illustrates the value of screening extracts from plants and other 
materials that are believed to have medicinal properties for active com- 
pounds. The large number of herbal and folk medicines are a treasure trove 
of new drug leads. 

More than 100 years ago, a fatty, yellowish material was discovered on 
the arterial walls of patients who had died of vascular disease. The presence 
of the material was termed atheroma from the Greek word for porridge. This 
material proved to be cholesterol. The Framingham heart study, initiated in 
1948, documented a correlation between high blood-cholesterol levels and 
high mortality rates from heart disease. This observation led to the notion 
that blocking cholesterol synthesis might lower blood-cholesterol levels 
and, in turn, lower the risk of heart disease. Drug developers had to aban- 
don an initial attempt at blocking the cholesterol-synthesis pathway at a late 
step because cataracts and other side effects developed, caused by the accu- 
mulation of the insoluble substrate for the inhibited enzyme. Investigators 
eventually identified a more-favorable target—namely, the enzyme HMG- 
CoA reductase (Section 26.2). This enzyme acts on a substrate, HMG-CoA 
(3-hydroxy-3-methylglutaryl coenzyme A), that can be used by other path- 
ways and is water soluble. 

A promising natural product, compactin, was discovered in a screen of 
compounds from a fermentation broth from Penicillium citrinum in a search 
for antibacterial agents. In some, but not all, animal studies, compactin was 
found to inhibit HMG-CoA reductase and to lower serum-cholesterol lev- 
els. In 1982, a new HMG-CoA reductase inhibitor was discovered in a 
fermentation broth from Aspergillus cereus. This compound, now called 
lovastatin, was found to be structurally very similar to compactin, bearing 
one additional methyl group. 

In clinical trials, lovastatin significantly reduced serum-cholesterol 
levels with few side effects. Most side effects could be prevented by treat- 
ment with mevalonate (the product of HMG-CoA reductase), indicating 
that the side effects were likely due to the highly effective blocking of 
HMG-CoA reductase. One notable side effect is muscle pain or weakness 
(termed myopathy), although its cause remains to be fully established. After 
many studies, the Food and Drug Administration (FDA) approved lovasta- 
tin for treating high serum-cholesterol levels. 

A structurally related HMG-CoA reductase inhibitor was later shown 
to cause a statistically significant decrease in deaths due to coronary heart 
disease. This result validated the benefits of lowering serum-cholesterol 
levels. Further mechanistic analysis revealed that the HMG-CoA reductase 
inhibitor acts not only by lowering the rate of cholesterol biosynthesis, but 
also by inducing the expression of the low-density-lipoprotein (LDL) recep- 
tor (Section 26.3). Cells with such receptors remove LDL particles from the 
bloodstream, and so these particles cannot contribute to atheroma. 
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of totally synthetic molecules that are more-potent inhibitors of HMG- 
CoA reductase (Figure 36.18). These compounds are effective at lower dose 
levels, reducing side effects. 

The original HMG-CoA reductase inhibitors or their precursors were 
found by screening libraries of natural products. More recently, drug devel- 
opers have tried screening large libraries of both natural products and purely 
synthetic compounds prepared in the course of many drug-development 
programs. Under favorable circumstances, hundreds of thousands or even 
millions of compounds can be tested in this process, termed high-throughput 
screening. Compounds in these libraries can be synthesized one at a time for 
testing. An alternative approach is to synthesize a large number of structur- 
ally related compounds that differ from one another at only one or a few 
positions all at once. This approach is often termed combinatorial chemistry. 
Here, compounds are synthesized with the use of the same chemical reac- 
tions but a variable set of reactants. Suppose that a molecular scaffold is 
constructed with two reactive sites and that 20 reactants can be used in the 
first site and 40 reactants can be used in the second site. A total of 20 X 40 = 
800 possible compounds can be produced. 

A key method in combinatorial chemistry is split-pool synthesis 
(Figure 36.19). The method depends on solid-phase synthetic methods, 
first developed for the synthesis of peptides (Section 3.5). Compounds are 
synthesized on small beads. Beads containing an appropriate starting scaf- 
fold are produced and divided (split) into n sets, with n corresponding to the 
number of building blocks to be used at one site. Reactions adding the reac- 
tants at the first site are run, and the beads are isolated by filtration. The 
n sets of beads are then combined (pooled), mixed, and split again into m sets, 
with m corresponding to the number of reactants to be used at the second 
site. Reactions adding these m reactants are run, and the beads are again 
isolated. The important result is that each bead contains only one com- 
pound, even though the entire library of beads contains many. Furthermore, 
although only n + m reactions were run, n X m compounds are produced. 
With the preceding values for n and m, 20 + 40 = 60 reactions produce 
20 X 40 = 800 compounds. In some cases, assays can be performed directly 
with the compounds still attached to the bead to find compounds with 
desired properties (Figure 36.20). Alternatively, each bead can be isolated 
and the compound can be cleaved from the bead to produce free compounds 
for analysis. After an interesting compound has been identified, analytical 
methods of various types must be used to identify which of the n X mcom- 
pounds is present. 


Figure 36.19 Split-pool synthesis. 
Reactions are performed on beads. Each of 
the reactions with the first set of reactants is 
performed on a separate set of beads. The 
beads are then pooled, mixed, and split into 
sets. The second set of reactants is then 
added. Many different compounds will be 
produced, but all of the compounds on a 
single bead will be identical. 


Figure 36.20 Screening a library of 
synthesized carbohydrates. A small 
combinatorial library of carbohydrates 
synthesized on the surface of 130-m beads 
is screened for carbohydrates that are bound 
tightly by a lectin from peanuts. Beads that 
have such carbohydrates are darkly stained 
through the action of an enzyme linked to the 
lectin. [From R. Liang et al., Proc. Natl. Acad. 
Sci. U. S. A. 94:10554—-10559, 1997; © 
2004 National Academy of Sciences, U. S. A] 
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Note that the “universe” of druglike compounds is vast. More than an 
estimated 107” compounds are possible with molecular weights less than 750. 
Thus, even with “large” libraries of millions of compounds, only a tiny frac- 
tion of the chemical possibilities are present for study. 


Drugs can be designed on the basis of three-dimensional structural 
information about their targets 


Many drugs bind to their targets in a manner reminiscent of Emil Fischer’s 
lock and key (see Figure 8.8). Therefore, we should be able to design a key, 
given enough knowledge about the shape and chemical composition of the 
lock. In the idealized case, we would like to design a small molecule that is 
complementary in shape and electronic structure to a target protein so that 
it binds effectively to the targeted site. Despite our ability to determine 
three-dimensional structures rapidly, the achievement of this goal remains 
in the future. Designing stable compounds from scratch that have the cor- 
rect shape and other properties to fit precisely into a binding site is difficult 
because predicting the structure that will best fit into a binding site is dif- 
ficult. Prediction of binding affinity requires a detailed understanding of 
the interactions between a compound and its binding partner and of the 
interactions between the compound and the solvent when the compound is 
free in solution. 


Figure 36.21 Initial design of 
an HIV protease inhibitor. This 
compound was designed by 
OH combining part of one compound 
with good inhibition activity but 
poor solubility (shown in red) with 
part of another compound with 
better solubility (shown in blue). 


Nonetheless, structure-based drug design has proved to be a powerful tool 
in drug development. Among its most prominent successes has been the 
development of drugs that inhibit the protease from the HIV virus. Consider 
the development of the protease inhibitor indinavir (Crixivan; p. 264). Two 
sets of promising inhibitors that had high potency but poor solubility and 
bioavailability were discovered. X-ray crystallographic analysis and molec- 
ular-modeling findings suggested that a hybrid molecule might have both 
high potency and improved bioavailability (Figure 36.21). The synthesized 
hybrid compound did show improvements but required further optimization. 
The structural data suggested one point where modifications could be toler- 
ated. A series of compounds were produced and examined (Figure 36.22). The 
most-active compound showed poor bioavailability, but one of the other 
compounds showed good bioavailability and acceptable activity. The maxi- 
mum serum concentration available through oral administration was sig- 
nificantly higher than the levels required to suppress replication of the 
virus. This drug, as well as other protease inhibitors developed at about the 
same time, has been used in combination with other drugs to treat AIDS 
with much more encouraging results than had been obtained previously 
(Figure 36.23). 

Aspirin targets the cyclooxygenase site in prostaglandin H) synthase, as 
discussed earlier. Animal studies suggested that mammals contain not one 
but two distinct cyclooxygenase enzymes, both of which are targeted by 
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Figure 36.22 Compound optimization. 
Four compounds are evaluated for 
characteristics including the ICso, log(P), and 
Cmax (the maximal concentration of 
compound present) measured in the serum 
of dogs. The compound shown at the bottom 
has the weakest inhibitory power (measured 
by C50) but by far the best bioavailability 
(measured by Cmax). This compound was 
selected for further development, leading to 
the drug indinavir (Crixivan). 
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Figure 36.23 The effect of anti-HIV drug development. Death rates from HIV infection (AIDS) 
reveal the tremendous effect of HIV protease inhibitors and their use in combination with inhibitors 
of HIV reverse transcriptase. The death rates in this graph are from the leading causes of death 
among persons 24 to 44 years old in the United States. [From Centers for Disease Control.] 


aspirin. The more recently discovered enzyme, cyclooxygenase 2 (COX2), 
is expressed primarily as part of the inflammatory response, whereas 
cyclooxygenase 1 (COX1) is expressed more generally. These observations 
suggested that a cyclooxygenase inhibitor that was specific for COX2 might 
be able to reduce inflammation in conditions such as arthritis without pro- 
ducing the gastric and other side effects associated with aspirin. 

The amino acid sequences of COX1 and COX2 were deduced from 
cDNA cloning studies. These sequences are more than 60% identical, 
clearly indicating that the enzymes have the same overall structure. 
Nevertheless, there are some differences in the residues around the aspirin- 
binding site. X-ray crystallography revealed that an extension of the binding 
pocket was present in COX2, but absent in COX1. This structural difference 
suggested a strategy for constructing COX2-specific inhibitors—namely, to 
synthesize compounds that had a protuberance that would fit into the 
pocket in the COX2 enzyme. Such compounds were designed and syn- 
thesized and then further refined to produce effective drugs familiar as 
Celebrex and Vioxx (Figure 36.24). Vioxx was subsequently withdrawn 
from the market because some people experienced adverse effects. These 
effects appear to be due to the inhibition of COX2, the intended target. 
Thus, although the development of these drugs is a triumph for structure- 
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Figure 36.24 COX2-specific inhibitors. These compounds have protuberances (shown in red) 
that fit into a pocket in the COX2 isozyme but sterically clash with the COX1 isozyme. 


based drug design, these outcomes highlight the fact that the inhibition of 
important enzymes can lead to complex physiological responses. 


36.3 Analyses of Genomes Hold Great Promise for 
Drug Discovery 


The completion of the sequencing of the human and other genomes is a 
potentially powerful driving force for the development of new drugs. 
Genomic sequencing and analysis projects have vastly increased our knowl- 
edge of the proteins encoded by the human genome. This new source of 
knowledge may greatly accelerate early stages of the drug-development 
process or even allow drugs to be tailored to the individual patient. 


Potential targets can be identified in the human proteome 


The human genome encodes approximately 25,000 proteins, not counting 
the variation produced by alternative mRNA splicing and posttranslational 
modifications. Many of these proteins are potential drug targets, in particu- 
lar those that are enzymes or receptors and have significant biological effects 
when activated or inhibited. Several large protein families are particularly 
rich sources of targets. For example, the human genome includes genes for 
more than 500 protein kinases that can be recognized by comparing the 
deduced amino acid sequences. Many of these kinases are known to play a 
role in the progression of a variety of diseases. For example, Bcr-Abl kinase, 
a dysregulated kinase formed from a specific chromosomal defect, is known 
to contribute to certain leukemias and is the target of the drug imatinib 
mesylate (Gleevec; Section 14.5). Some of the other protein kinases 
undoubtedly play central roles in particular cancers as well. Similarly, the 
human genome encodes approximately 800 7TM receptors (Section 14.1) of 
which approximately 350 are odorant receptors. Many of the remaining 
7TM receptors are potential drug targets. Some of them are already targets 
for drugs, such as the B-blocker atenolol, which targets the B-adrenergic 
receptor, and the antiulcer medication ranitidine (Zantac). The latter com- 
pound is an antagonist of the histamine H2 receptor, a 7TM receptor that 
participates in the control of gastric acid secretion. 
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Novel proteins that are not part of large families already supplying drug 
targets can be more readily identified through the use of genomic informa- 
tion. There are a number of ways to identify proteins that could serve as 
targets of drug-development programs. One way is to look for changes in 
expression patterns, protein localization, or posttranslational modifications 
in cells from disease-afflicted organisms. Another is to perform studies of 
tissues or cell types in which particular genes are expressed. Analysis of the 
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human genome should increase the number of actively pursued drug targets 
by a factor of an estimated two or more. 


Animal models can be developed to test the validity of 
potential drug targets 


The genomes of a number of model organisms have now been sequenced. 
The most important of these genomes for drug development is that of the 
mouse. Remarkably, the mouse and human genomes are approximately 
85% identical in sequence, and more than 98% of all human genes have rec- 
ognizable mouse counterparts. Mouse studies provide drug developers with 
a powerful tool—the ability to disrupt (“knock out”) specific genes in the 
mouse (p. 164). If disruption of a gene has a desirable effect, then the product 
of this gene is a promising drug target. The utility of this approach has been 
demonstrated retrospectively. For example, disruption of the gene for the a 
subunit of the HT-K* ATPase, the key protein for secreting acid into the 
stomach, produces mice with less acid in their stomachs. The stomach pH 
of such mice 1s 6.9 in circumstances that produce a stomach pH of 3.2 in their 
wild-type counterparts. This protein is the target of the drugs omeprazole 
(Prilosec) and lansoprazole (Prevacid and Takepron), used for treating gas- 
troesophageal reflux disease. 
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Several large-scale efforts are underway to generate hundreds or thou- 
sands of mouse strains, each having a different gene disrupted. The pheno- 
types of these mice are a good indication of whether the protein encoded by 
a disrupted gene is a promising drug target. This approach allows drug 
developers to evaluate potential targets without any preconceived notions 
regarding physiological function. 


Potential targets can be identified in the genomes of pathogens 


Human proteins are not the only important drug targets. Drugs such as 
penicillin and HIV protease inhibitors act by targeting proteins within a 
pathogen. The genomes of hundreds of pathogens have now been sequenced, 
and these genome sequences can be mined for potential targets. 

New antibiotics are needed to combat bacteria that are resistant to many 
existing antibiotics. One approach seeks proteins essential for cell survival 
that are conserved in a wide range of bacteria. Drugs that inactivate such pro- 
teins are expected to be broad-spectrum antibiotics, useful for treating infec- 
tions from any of a range of different bacteria. One such protein is peptide 
deformylase, the enzyme that removes formyl groups present at the amino 
termini of bacterial proteins immediately after translation (see Figure 30.19). 

Alternatively, a drug may be needed against a specific pathogen. A recent 
example of such a pathogen is the organism responsible for severe acute respi- 
ratory syndrome (SARS). Within one month of the recognition of this emerg- 
ing disease, investigators had isolated the virus that causes the syndrome, 
and, within weeks, its 29,751-base genome had been completely sequenced. 
This sequence revealed the presence of a gene encoding a viral protease, 
known to be essential for viral replication from studies of other members of 


Figure 36.25 Emerging drug target. The structure of a protease from the coronavirus that 
causes SARS (severe acute respiratory syndrome) is shown bound to an inhibitor. This structure was 
determined less than a year after the identification of the virus. [Drawn from 1P9S.bdb.] 


the coronavirus family to which the SARS virus belongs. Drug developers are 
already at work seeking specific inhibitors of this protease (Figure 36.25). 


Genetic differences influence individual responses to drugs 


Many drugs are not effective in everyone, often because of genetic differences 
among people. Nonresponding persons may have slight differences in either 
a drug’s target molecule or the proteins taking part in drug transport and 
metabolism. The goal of the emerging fields of pharmacogenetics and phar- 
macogenomics is to design drugs that either act more consistently from per- 
son to person or are tailored to individual persons with particular genotypes. 

Drugs such as metoprolol that target the B;-adrenergic receptor are 
popular treatments for hypertension. These drugs are often referred to as 
“beta-blockers.” 


Metoprolol 


But some people do not respond well. Two variants of the gene coding for 
the B,-adrenergic receptor are common in the U.S. population. The most 
common allele has serine in position 49 and arginine in position 389. In 
some persons, however, glycine replaces one or the other of these residues. 
In studies, participants with two copies of the most common allele respond- 
ed well to metoprolol: their daytime diastolic blood pressure was reduced by 
14.7 + 2.9 mm Hg on average. In contrast, participants with one variant 
allele showed a smaller reduction in blood pressure, and the drug had no 
significant effect on participants with two variant alleles (Figure 36.26). 
These observations suggest the potential utility of genotyping individual 
patients at these positions. One could then predict whether treatment with 
metoprolol or other beta-blockers is likely to be effective. 

Given the importance of ADME and toxicity properties in determin- 
ing drug efficacy, it is not surprising that variations in proteins participat- 
ing in drug transport and metabolism can alter a drug’s effectiveness. An 
important example is the use of thiopurine drugs such as 6-thioguanine, 
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Figure 36.26 Phenotype-genotype 
correlation. Average changes in diastolic 
blood pressure on treatment with metoprolol. 
Persons with two copies of the most common 
allele (S49R3g9) showed significant decreases 
in blood pressure. Those with one variant 
allele (GR or SG) showed more modest 
decreases, and those with two variant alleles 
(GR/SG) showed no decrease. [From J. A. 
Johnson et al., Clin. Pharmacol. Ther. 
74:44-52, 2003] 
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6-mercaptopurine, and azothioprine to treat diseases including leukemia, 
immune disorders, and inflammatory bowel disease. 
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A minority of patients who are treated with these drugs show signs of toxicity 
at doses that are well tolerated by most patients. These differences between 
patients are due to rare variations in the gene encoding the xenobiotic- 
metabolizing enzyme thiopurine methyltransferase, which adds a methyl 
group to sulfur atoms. 
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The variant enzyme is less stable. Patients with these variant enzymes can 
build up toxic levels of the drugs if appropriate care is not taken. Thus, 
genetic variability in an enzyme participating in drug metabolism plays a 
large role in determining the variation in the tolerance of different persons 
to particular drug levels. Many other drug-metabolism enzymes and drug- 
transport proteins have been implicated in controlling individual reactions 
to specific drugs. The identification of the genetic factors will allow a deep- 
er understanding of why some drugs work well in some persons but poorly 
in others. In the future, doctors may examine a patient’s genes to help plan 
drug-therapy programs. 


36.4 The Development of Drugs Proceeds Through 
Several Stages 


In the United States, the FDA requires that drug candidates be demon- 
strated to be effective and safe before they may be used in human beings ona 
large scale. This requirement is particularly true for drug candidates that are 
to be taken by people who are relatively healthy. More side effects are 
acceptable for drug candidates intended to treat significantly ill patients 
such as those with serious forms of cancer, where there are clear, unfavorable 
consequences for not having an effective treatment. 


Clinical trials are time consuming and expensive 


Clinical trials test the effectiveness and potential side effects of a candidate 
drug before it is approved by the FDA for general use. These trials proceed 
in at least three phases (Figure 36.27). In phase 1, a small number (usually 


Phase 1 Phase 2 Phase 3 


Preclinical Clinical 
drug discovery use 


Safety Safety Safety 
Efficacy Efficacy 
Dosage Side effects 


Figure 36.27 Clinical-trial phases. Clinical trials proceed in phases examining safety and efficacy 
in increasingly large groups. 


from 10 to 100) of healthy volunteers take the drug for an initial study of 
safety. These volunteers are given a range of doses and are monitored 
for signs of toxicity. The efficacy of the drug candidate is not specifically 
evaluated. 

In phase 2, the efficacy of the drug candidate is tested in a small number 
of persons who might benefit from the drug. Further data regarding the 
drug’s safety are obtained. Such trials are often controlled and double- 
blinded. In a controlled study, subjects are divided randomly into two 
groups. Subjects in the treatment group are given the treatment under 
investigation. Subjects in the control group are given either a placebo—that 
is, a treatment such as sugar pills known to not have intrinsic value—or the 
best standard treatment available, if withholding treatment altogether 
would be unethical. In a double-blinded study, neither the subjects nor the 
researchers know which subjects are in the treatment group and which are 
in the control group. A double-blinded study prevents bias in the course of 
the trial. When the trial has been completed, the assignments of the subjects 
into treatment and control groups are unsealed and the results for the two 
groups are compared. A variety of doses are often investigated in phase 2 
trials to determine which doses appear to be free of serious side effects and 
which doses appear to be effective. 

The power of the placebo effect—that is, the tendency to perceive 
improvement in a subject who believes that he or she is receiving a poten- 
tially beneficial treatment—should not be underestimated. In a study of 
arthroscopic surgical treatment for knee pain, for example, subjects who 
were led to believe that they had received surgery through the use of video- 
tapes and other means showed the same level of improvement, on average, 
as subjects who were actually operated on. 

In phase 3, similar studies are performed on a larger population. This 
phase is intended to more firmly establish the efficacy of the drug candidate 
and to detect side effects that may develop in a small percentage of the sub- 
jects who receive treatment. Thousands of subjects may participate in a 
typical phase 3 study. 

Clinical trials can be extremely costly. Hundreds or thousands of patients 
must be recruited and monitored for the duration of the trial. Many physi- 
cians, nurses, clinical pharmacologists, statisticians, and others participate 
in the design and execution of the trial. Costs can run from tens of millions 
to hundreds of millions of dollars. Extensive records must be kept, includ- 
ing documentation of any adverse reactions. These data are compiled and 
submitted to the FDA. The full cost of developing a drug is currently esti- 
mated to be from $400 million to $800 million. 

Even after a drug has been approved and is in use, difficulties can arise. 
As mentioned earlier, rofecoxib (Vioxx), for example, was withdrawn from 
the market after significant cardiac side effects were detected in additional 
clinical trials. Such events highlight the necessity for users of any drug to 
balance beneficial effects against potential risks. 
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The evolution of drug resistance can limit the utility of drugs for 
infectious agents and cancer 


Many drugs are used for long periods of time without any loss of effective- 
ness. However, in some cases, particularly for the treatment of cancer or 
infectious diseases, drug treatments that were initially effective become less 
effective. In other words, the disease becomes resistant to the drug therapy. 
Why does this resistance develop? Infectious diseases and cancer have a 
common feature—namely, that an affected person contains many cells (or 
viruses) that can mutate and reproduce. These conditions are necessary for 
evolution to take place. Thus, an individual microorganism or cancer cell 
may by chance have a genetic variation that makes it more suitable for 
growth and reproduction in the presence of the drug than is the population 
of microorganisms or cancer cells at large. These microorganisms or cells are 
more fit than others in their population, and they will tend to take over the 
population. As the selective pressure due to the drug is continually applied, 
the population of microorganisms or cancer cells will tend to become more 
and more resistant to the presence of the drug. Note that resistance can 
develop by a number of mechanisms. 

The HIV protease inhibitors discussed earlier provide an important 
example of the evolution of drug resistance. Retroviruses are very well 
suited to this sort of evolution because reverse transcriptase carries out rep- 
lication without a proofreading mechanism. In a genome of approximately 
9750 bases, each possible single point mutation is estimated to appear in a 
virus particle more than 1000 times per day in each infected person. Many 
multiple mutations also occur. Most of these mutations either have no effect 
or are detrimental to the virus. However, a few of the mutant virus particles 
encode proteases that are less susceptible to inhibition by the drug. In the 
presence of an HIV protease inhibitor, these virus particles will tend to rep- 
licate more effectively than does the population at large. With the passage of 
time, the less-susceptible viruses will come to dominate the population and 
the virus population will become resistant to the drug. 

Pathogens may become resistant to antibiotics by completely different 
mechanisms. Some pathogens contain enzymes that inactivate or degrade 
specific antibiotics. For example, many organisms are resistant to B-lactams 
such as penicillin because they contain B-lactamase enzymes. These 
enzymes hydrolyze the B-lactam ring and render the drugs inactive. 
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Many of these enzymes are encoded in plasmids, small circular pieces 
of DNA often carried by bacteria. Many plasmids are readily transferred 
from one bacterial cell to another, transmitting the capability for antibiotic 
resistance. Plasmid transfer thus contributes to the spread of antibiotic 
resistance, a major health-care challenge. On the other hand, plasmids have 
been harnessed for use in recombinant DNA methods (Section 5.2). 

Drug resistance commonly emerges in the course of cancer treatment. 
Cancer cells are characterized by their ability to grow rapidly without the 


constraints that apply to normal cells. Many drugs used for cancer chemo- 
therapy inhibit processes that are necessary for this rapid cell growth. 
However, individual cancer cells may accumulate genetic changes that 
mitigate the effects of such drugs. These altered cancer cells will tend to 
grow more rapidly than others and will become dominant within the cancer- 
cell population. This ability of cancer cells to mutate quickly has been a 
challenge to one of the major breakthroughs in cancer treatment: the devel- 
opment of inhibitors for proteins specific to cancer cells present in certain 
leukemias (Section 14.5). For example, tumors become undetectable in 
patients treated with imatinib mesylate, which is directed against the 
Bcr-Abl protein kinase. Unfortunately, the tumors of many of the patients 
treated with imatinib mesylate recur after a period of years. In many of these 
cases, mutations have altered the Bcr-Abl protein so that it is no longer 
inhibited by the concentrations of imatinib mesylate used in therapy. 

Cancer patients often take multiple drugs concurrently in the course of 
chemotherapy and, in many cases, cancer cells become simultaneously 
resistant to many or all of them. This multiple-drug resistance can be due to 
the proliferation of cancer cells that overexpress a number of ABC trans- 
porter proteins that pump drugs out of the cell (Section 13.2). Thus, cancer 
cells can evolve drug resistance by overexpressing normal human proteins 
or by modifying proteins responsible for the cancer phenotype. 


Summary 


36.1 The Development of Drugs Presents Huge Challenges 

Most drugs act by binding to enzymes or receptors and modulating 
their activities. To be effective, drugs must bind to these targets with 
high affinity and specificity. However, even most compounds with the 
desired affinity and specificity do not make suitable drugs. Most com- 
pounds are poorly absorbed or rapidly excreted from the body or they 
are modified by metabolic pathways that target foreign compounds. 
Consequently, when taken orally, these compounds do not reach their 
targets at appropriate concentrations for a sufficient period of time. A 
drug’s properties related to its absorption, distribution, metabolism, 
and excretion are called ADME properties. Oral bioavailability is a 
measure of a drug’s ability to be absorbed; it is the ratio of the peak 
concentration of a compound given orally to the peak concentration of 
the same dose directly injected. The structure of a compound can affect 
its bioavailability in complicated ways, but generalizations called 
Lipinski’s rules provide useful guidelines. Drug-metabolism pathways 
include oxidation by cytochrome P450 enzymes (phase I metabolism) 
and conjugation to glutathione, glucuronic acid, and sulfate (phase II 
metabolism). A compound may also not be a useful drug because it is 
toxic, either because it modulates the target molecule too effectively or 
because it also binds to proteins other than the target. The liver and 
kidneys play central roles in drug metabolism and excretion. 


36.2 Drug Candidates Can Be Discovered by Serendipity, 

Screening, or Design 

Many drugs have been discovered by serendipity—that is, by chance 
observation. The antibiotic penicillin is produced by a mold that acci- 
dentally contaminated a culture dish, killing nearby bacteria. Drugs 
such as chlorpromazine and sildenafil were discovered to have benefi- 
cial effects on human physiology that were completely different from 
those expected. The cholesterol-lowering statin drugs were developed 
after large collections of compounds were screened for potentially 
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interesting activities. Combinatorial chemistry methods have been 
developed to generate large collections of chemically related yet 
diverse compounds for screening. In some cases, the three-dimensional 
structure of a drug target is available and can be used to aid the design 
of potent and specific inhibitors. Examples of drugs designed in this 
manner are the HIV protease inhibitors indinavir and cyclooxygenase 
2 inhibitors such as celecoxib. 


36.3 Analyses of Genomes Hold Great Promise for Drug Discovery 


The human genome encodes approximately 25,000 proteins, and 
many more if derivatives due to alternative mRNA splicing and post- 
translational modification are included. The genome sequences can be 
examined for potential drug targets. Large families of proteins known 
to participate in key physiological processes such as the protein kinas- 
es and 7TM receptors have each yielded several targets for which 
drugs have been developed. The genomes of model organisms also are 
useful for drug-development studies. Strains of mice with particular 
genes disrupted have been useful in validating certain drug targets. 
The genomes of bacteria, viruses, and parasites encode many potential 
drug targets that can be exploited owing to their important functions 
and their differences from human proteins, minimizing the potential 
for side effects. Genetic differences between individuals can be exam- 
ined and correlated with differences in responses to drugs, potentially 
aiding both clinical treatments and drug development. 


36.4 The Development of Drugs Proceeds Through Several Stages 


Before compounds can be given to human beings as drugs, they must 
be extensively tested for safety and efficacy. Clinical trials are per- 
formed in stages: first testing safety, then safety and efficacy in a small 
population, and finally safety and efficacy in a larger population to 
detect rarer adverse effects. Largely due to the expenses associated 
with clinical trials, the cost of developing a new drug has been esti- 
mated to be as much as $800 million. Even when a drug has been 
approved for use, complications can arise. With regard to infectious 
diseases and cancer, patients often develop resistance to a drug after it 
has been administered for a period of time because variants of the dis- 
ease agent that are less susceptible to the drug arise and replicate, even 
when the drug is present. 


Key Terms 

ligand (p. 1030) blood-brain barrier (p. 1033) enterohepatic cycling (p. 1036) 
dissociation constant (Kg) (p. 1030) xenobiotic compounds (p. 1034) therapeutic index (p. 1036) 

side effect (p. 1031) drug metabolism (p. 1034) atheroma (p. 1040) 

inhibition constant (K;) (p. 1031) oxidation (p. 1034) myopathy (p. 1040) 

Cheng—Prusoff equation (p. 1031) conjugation (p. 1034) high-throughput screening (p. 1041) 
ADME (p. 1031) phase I transformation (p. 1035) combinatorial chemistry (p. 1041) 
oral bioavailability (p. 1032) phase II transformation (p. 1035) split-pool synthesis (p. 1041) 
Lipinski’s rules (p. 1032) first-pass metabolism (p. 1035) structure-based drug design 
compartment (p. 1033) glomerulus (p. 1035) (p. 1043) 

Problems 

1. Routes to discovery. For each of the following drugs, indi- (a) Penicillin (d) Atorvastatin (Lipitor) 


cate whether the physiological effects of the drug were (b) Sildenafil (Viagra) (e) Aspirin 


known before or after the target was identified. 


(c) Rofecoxib (Vioxx) (f) Indinavir (Crixivan) 


2. Lipinski’s rules. Which of the following compounds sat- 
isfy all of Lipinski’s rules? [Log(P) values are given in 
parentheses. | 


(a) Atenolol (0.23) 
(b) Sildenafil (3.18) 
(c) Indinavir (2.78) 


3. Calculating log tables. Considerable effort has been 
expended to develop computer programs that can estimate 
log(P) values entirely on the basis of chemical structure. 
Why would such programs be useful? 


4, An ounce of prevention. Legislation has been proposed 
that would require that N-acetylcysteine be added to acet- 
aminophen tablets. Speculate about the role of this additive. 


5. Clinical-trial design. Distinguish between phase 1 and 
phase 2 clinical trials in regard to number of persons 
enrolled, the state of health of the subjects, and the goals of 
the study. 


6. Drug interactions. As noted in this chapter, coumadin 
can be a very dangerous drug because too much can cause 
uncontrolled bleeding. Persons taking coumadin must be 
careful about taking other drugs, particularly those that 
bind to albumin. Propose a mechanism for this drug—drug 
interaction. 


7. A bad combination. Explain why drugs that inhibit P450 
enzymes may be particularly dangerous when used in com- 
bination with other medications. 


8. Mechanistically speaking. Name one advantage of a non- 
competitive inhibitor as a potential drug compared with a 
competitive inhibitor. 


9. A helping hand. You have developed a drug that is capa- 
ble of inhibiting the ABC transporter MDR. Suggest a pos- 
sible application for this drug in cancer chemotherapy. 


10. Find the target. Trypanosomes are unicellular parasites 
that cause sleeping sickness. During one stage of their life 
cycle, these organisms live in the bloodstream and derive all 
of their energy from glycolysis, which takes place in a spe- 
cialized organelle called a glycosome inside the parasite. 
Propose potential targets for treating sleeping sickness. 
What are some potential difficulties with your approach? 


11. Knowledge is power. How might genomic information 
be helpful for the effective use of imatinib mesylate (Gleevec) 
in cancer chemotherapy? 


12. Multiple targets, same goal. Sildenafil induces its phys- 
iological effects by increasing the intracellular concentrations 
of cGMP, leading to muscle relaxation. On the basis of the 
scheme shown in Figure 36.17, identify another approach 
for increasing CGMP levels with a small molecule. 
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Problems 


Mechanism Problem 


13. Variations on a theme. The metabolism of amphetamine 
by cytochrome P450 enzymes results in the conversion 
shown here. Propose a mechanism and indicate any addi- 
tional products. 


NH 
O 
H 
CHz CHz 
Amphetamine 


Data Interpretation Problem 


14. HIV protease inhibitor design. Compound A is one of a 
series that were designed to be potent inhibitors of HIV 
protease. 


Compound A 


Compound A was tested by using two assays: (1) direct 
inhibition of HIV protease in vitro and (2) inhibition of viral 
RNA production in HIV-infected cells, a measure of viral 
replication. The results of these assays are shown below. The 
HIV protease activity is measured with a substrate peptide 
present at a concentration equal to its Kyy value. 


Compound A (nM) HIV protease activity (arbitrary units) 

0 11.2 

0.2 9.9 

0.4 7.4 

0.6 5.6 

0.8 4.8 

1 4.0 

2 22 

10 0.9 

100 0.2 
Compound A (nM) Viral RNA production (arbitrary units) 

0 760 

1.0 740 

2.0 380 

3.0 280 

4.0 180 

5.0 100 

10 30 

50 20 
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Estimate the values for the Kj of compound A in the 
protease-activity assay and for its [Cso in the viral-RNA- 
production assay. 

Treating rats with the relatively high oral dose of 20 mg 
kg! results in a maximum concentration of compound A of 


0.4 uM. On the basis of this value, do you expect com- 
pound A to be effective in preventing HIV replication when 
taken orally? 


ANSWERS TO PROBLEMS 


Chapter 1 

1. The hydrogen-bond donors are the NH and NH) groups. 

The hydrogen-bond acceptors are the carbonyl oxygen atoms and 
those ring nitrogen atoms that are not bonded to hydrogen or to 
deoxyribose. 

2. Interchange the positions of the single and double bonds in the 
six-membered ring. 

3. (a) Electrostatic interactions; (b) van der Waals interactions. 

4. Processes a and b 


5.  AS.ystem = —661 J mol”! K ~! (—158 kcal mol K~’) 


AS.urroundings = +842 J mol~* K~! (+201 cal mol~! K~*) 

6. (a) 1.0; (b) 13.0; (c) 1.3; (d) 12.7 

7. 2.88 

8. 1.96 

9. 11.83 

10. 447; 0.00050 

11. 0.00066 M 

12. 6.0 

133. 5253 

14. 6.48 

15. 7.8 

16. 100 

17. (a) 1.6; (b) 0.51; (c) 0.16. 

18. 0.1 M sodium acetate solution: 6.34; 6.03; 5.70; 4.75. 

0.01 M sodium acetate solution: 5.90; 4.75; 3.38; 1.40. 

19. 90 mM acetic acid; 160 mM sodium acetate, 0.18 moles acetic 
acid; 0.32 moles sodium acetate; 10.81 g acetic acid; 26.25 g sodium 
acetate. 

20. 0.50 moles of acetic acid; 0.32 moles of NaOH; 30.03 g of acetic 
acid; 12.80 g of NaOH. 

21. 250 mM; yes; no, it will also contain 90 mM NaCl. 

22. 8.63 g NagHPOg; 4.71 g NaH2PO4 

23. 7.0; this buffer will not be very useful, because the pH value is 
far from the pK, value. 

24. 1.45 kJ mol! (0.35 kcal mol !); 57.9 kJ mol7! (13.8 kcal mol) 
25. There will be approximately 15 million differences. 


Chapter 2 

1. (A) Proline, Pro, P; (B) tyrosine, Tyr, Y; (C) leucine, Leu, L; 
(D) lysine, Lys, K. 

2. (a) C, B, A; (b) D; (c) D, B; (d) B, D; (e) B. 

3. (a) 6; (b) 2 ;() 3; (d) 1 s (e) 4; (£) 5. 

4. (a) Ala; (b) Tyr; (c) Sar (d) His, 

5. Ser, Glu, Tyr, Thr 

6. (a) Alanine-glycine-serine; (b) Alanine; (c and d): 


Peptide bonds 
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At pH ‘5.5, the net charge is +1 At pH 7.5, the net charge is 0 


8. There are 20 choices for each of the 50 amino acids: 507°, or 


5 x 107! 


*H;N—CH—-C—-N—-CH—-C——-O— CH, 


Aspartame at pH 7 


10. The (nitrogen—a carbon—carbonyl carbon) repeating unit. 
11. Side chain is the functional group attached to the a-carbon 
atom of an amino acid. 
12. Amino acid composition refers simply to the amino acids 
that make up the protein. The order is not specified. Amino acid 
sequence is the same as the primary structure—the sequence of 
amino acids from the amino terminal to the carboxyl terminal 
of the protein. Different proteins may have the same amino acid 
composition, but amino acid sequence identifies a unique protein. 
13. (a) Each strand is 35 kd and hence has about 318 residues (the 
mean residue mass is 110 daltons). Because the rise per residue in an 
o helix is 1.5 A, the length is 477 A. More precisely, for an a-helical 
coiled coil, the rise per residue is 1.46 A; so the length is 464 A. 
(b) Eighteen residues in each strand (40 minus 4 divided by 2) are 
in a B-sheet conformation. Because the rise per residue is 3.5 A, 
the length is 63 A. 
14. The methyl group attached to the B-carbon atom of isoleucine 
sterically interferes with a-helix formation. In leucine, this methyl 
group is attached to the y-carbon atom, which is farther from the 
main chain and hence does not interfere. 
15. The first mutation destroys activity because valine occupies 
more space than alanine does, and so the protein must take a dif- 
ferent shape, assuming that this residue lies in the closely packed 
interior. The second mutation restores activity because of a com- 
pensatory reduction of volume; glycine is smaller than isoleucine. 
16. The native conformation of insulin is not the thermodynami- 
cally most stable form, because it contains two separate chains 
linked by disulfide bonds. Insulin is formed from proinsulin, a 
single-chain precursor, that is cleaved to form insulin, a 51-residue 
molecule, after the disulfide bonds have formed. 
17. A segment of the main chain of the protease could hydrogen 
bond to the main chain of the substrate to form an extended paral- 
lel or antiparallel pair of B strands. 
18. Glycine has the smallest side chain of any amino acid. Its size 
is often critical in allowing polypeptide chains to make tight turns 
or to approach one another closely. 
19. Glutamate, aspartate, and the terminal carboxylate can form 
salt bridges with the guanidinium group of arginine. In addition, 
this group can be a hydrogen-bond donor to the side chains of 
glutamine, asparagine, serine, threonine, aspartate, tyrosine, and 
glutamate and to the main-chain carbonyl group. Histidine can 
form hydrogen bonds with arginine at pH7. 
20. Disulfide bonds in hair are broken by adding a thiol-containing 
reagent and applying gentle heat. The hair is curled, and an oxidiz- 
ing agent is added to re-form disulfide bonds to stabilize the desired 
shape. 
21. Some proteins that span biological membranes are “the excep- 
tions that prove the rule” because they have the reverse distribution 
of hydrophobic and hydrophilic amino acids. For example, consider 
Al 
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porins, proteins found in the outer membranes of many bacteria. 
Membranes are built largely of hydrophobic chains. Thus, porins 
are covered on the outside largely with hydrophobic residues that 
interact with the neighboring hydrophobic chains. In contrast, the 
center of the protein contains many charged and polar amino acids 
that surround a water-filled channel running through the middle of 
the protein. Thus, because porins function in hydrophobic environ- 
ments, they are “inside out” relative to proteins that function in 
aqueous solution. 

22. The amino acids would be hydrophobic in nature. An « helix 
is especially suited to crossing a membrane because all of the amide 
hydrogen atoms and carbonyl oxygen atoms of the peptide back- 
bone take part in intrachain hydrogen bonds, thus stabilizing these 
polar atoms in a hydrophobic environment. 

23. This example demonstrates that the pK, values are affected 

by the environment. A given amino acid can have a variety of pK, 
values, depending on the chemical environment inside the protein. 
24. A possible explanation is that the severity of the symptoms 
corresponds to the degree of structural disruption. Hence, substi- 
tution of alanine for glycine might result in mild symptoms, but 
substitution of the much larger tryptophan might prevent little or 
no collagen triple-helix formation. 

25. The energy barrier that must be crossed to go from the poly- 
merized state to the hydrolyzed state is large even though the 
reaction is thermodynamically favorable. 

26. Using the Henderson—Hasselbalch equation, we find the ratio 
of alanine-COOH to alanine-COO™ at pH 7 to be 107+. The ratio 
of alanine-NH) to alanine-NH;", determined in the same fashion, 
is 107‘. Thus, the ratio of neutral alanine to the zwitterionic species 
is10.*x10°-'= 107°. 

27. The assignment of absolute configuration requires the assign- 
ment of priorities to the four groups connected to a tetrahedral 
carbon atom. For all amino acids except cysteine, the priorities are: 
(1) amino group; (2) carbonyl group; (3) side chain; (4) hydrogen. 
For cysteine, because of the sulfur atom in its side chain, the side 
chain has a greater priority than does the carbonyl group, leading 
to the assignment of an R rather than S configuration. 

28. ELVISISLIVINGINLASVEGAS 

29. No, Pro—X would have the characteristics of any other peptide 
bond. The steric hindrance in X—Pro arises because the R group 
of Pro is bonded to the amino group. Hence, in X—Pro, the proline 
R group is near the R group of X, which would not be the case in 
Pro-X. 

30. A, c; B, e; C, d; D, a; E, b. 

31. The reason is that the wrong disulfides formed pairs in urea. 
There are 105 different ways of pairing eight cysteine molecules 

to form four disulfides; only one of these combinations is enzy- 
matically active. The 104 wrong pairings have been picturesquely 
termed “scrambled” ribonuclease. 


Chapter 3 

1. (a) Phenyl isothiocyanate; (b) urea; B-mercaptoethanol to 
reduce disulfides; (c) chymotrypsin; (d) CNBr; (e) trypsin. 

2. Each amino acid residue, except the carboxyl-terminal resi- 
due, gives rise to a hydrazide on reacting with hydrazine. The 
carboxyl-terminal residue can be identified because it yields a 
free amino acid. 

3. The S-aminoethylcysteine side chain resembles that of lysine. 
The only difference is a sulfur atom in place of a methylene group. 
4. A1mg ml! solution of myoglobin (17.8 kd; Table 3.2) corre- 
sponds to 5.62 X 10° M. The absorbance of a 1-cm path length 
is 0.84, which corresponds to an I/I ratio of 6.96. Hence 14.4% of 
the incident light is transmitted. 


5. The sample was diluted 1000-fold. The concentration after 
dialysis is thus 0.001 M, or 1 mM. You could reduce the salt 
concentration by dialyzing your sample, now 1 mM, in more 
buffer free of (NH4)25O4. 

6. Ifthe salt concentration becomes too high, the salt ions interact 
with the water molecules. Eventually, there will not be enough 
water molecules to interact with the protein, and the protein will 
precipitate. If there is lack of salt in a protein solution, the proteins 
may interact with one another—the positive charges on one protein 
with the negative charges on another or several others. Such an 
aggregate becomes too large to be solublized by water alone. If salt 
is added, the salt neutralizes the charges on the proteins, preventing 
protein-protein interactions. 

7. Tropomyosin is rod shaped, whereas hemoglobin is approxi- 
mately spherical. 

8. The frictional coefficient, f, and the mass, m, determine s. 
Specifically, fis proportional to r (see equation 2 on p. 71). Hence, 
fis proportional to m'’3, and so s is proportional to m’ (see the 
equation on p. 76). An 80-kd spherical protein undergoes sedi- 
mentation 1.59 times as rapidly as a 40-kd spherical protein. 

9. The long hydrophobic tail on the SDS molecule (see p. 72) dis- 
rupts the hydrophobic interactions in the interior of the protein. 
The protein unfolds, with the hydrophobic R groups now interact- 
ing with SDS rather than with one another. 

10. 50 kd. 

11. The protein may be modified. For instance, serine, threonine, 
and tyrosine may have phosphoryl groups attached. 

12. A fluorescence-labeled derivative of a bacterial degradation 
product (e.g., a formylmethionyl peptide) would bind to cells 
containing the receptor of interest. 

13. (a) Trypsin cleaves after arginine (R) and lysine (K), generat- 
ing AVGWR, VK, and 8. Because they differ in size, these prod- 
ucts could be separated by molecular exclusion chromatography. 
(b) Chymotrypsin, which cleaves after large aliphatic or aromatic 
R groups, generates two peptides of equal size (AVGW) and (RVKS). 
Separation based on size would not be effective. The peptide 
RVKS has two positive charges (R and K), whereas the other peptide 
is neutral. Therefore, the two products could be separated by 
ion-exchange chromatography. 

14. Antibody molecules bound to a solid support can be used for 
affinity purification of proteins for which a ligand molecule is not 
known or unavailable. 

15. Ifthe product of the enzyme-catalyzed reaction is highly 
antigenic, it may be possible to obtain antibodies to this particular 
molecule. These antibodies can be used to detect the presence of 
product by ELISA, providing an assay format suitable for the 
purification of this enzyme. 

16. An inhibitor of the enzyme being purified might have been 
present and subsequently removed by a purification step. This 
removal would lead to an apparent increase in the total amount of 
enzyme present. 

17. Many proteins have similar masses but different sequences and 
different patterns when digested with trypsin. The set of masses 

of tryptic peptides forms a detailed “fingerprint” of a protein that 
is very unlikely to appear at random in other proteins regardless 

of size. (A conceivable analogy is: “Just as similarly sized fingers 
will give different individual fingerprints, so also similarly sized 
proteins will give different digestion patterns with trypsin.”) 

18. Isoleucine and leucine are isomers and, hence, have identical 
masses. Peptide sequencing by mass spectrometry as described in 
this chapter is incapable of distinguishing these residues. Further 
analytical techniques are required to differentiate these residues. 
19. See the table at the top of the facing page. 


Purification Total protein Total activity 
procedure (mg) (units) 
Crude extract 20,000 4,000,000 
(NH4)25O4 precipitation 5,000 3,000,000 
DEAE-cellulose chromatography 1,500 1,000,000 
Gel-filtration chromatography 500 750,000 
Affinity chromatography 45 675,000 


Specific activity 
(units mg) 
200 
600 
667 
1,500 
15,000 
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Purification Yield 
level (%) 
1 100 
3 75 
3.3 25 
73) 19 
75 17 


20. Protein crystal formation requires the ordered arrange- 
ment of identically positioned molecules. Proteins with flex- 
ible linkers can introduce disorder into this arrangement and 
prevent the formation of suitable crystals. A ligand or binding 
partner may induce an ordered conformation to this linker and 
could be included in the solution to facilitate crystal growth. 
Alternatively, the individual domains separated by the linker 
may be expressed by recombinant methods and their crystal 
structures solved separately. 
21. Treatment with urea will disrupt noncovalent bonds. Thus the 
original 60-kd protein must be made of two 30-kd subunits. When 
these subunits are treated with urea and B-mercaptoethanol, a 
single 15-kd species results, suggesting that disulfide bonds link 
the 30-kd subunits. 
22. (a) Electrostatic repulsion between positively charged e-amino 
groups hinders a-helix formation at pH 7. At pH 10, the side 
chains become deprotonated, allowing a-helix formation. 
(b) Poly-L-glutamate is a random coil at pH 7 and becomes « helical 
below pH 4.5 because the y-carboxylate groups become protonated. 
23. The difference between the predicted and the observed masses 
for this fragment equals 28.0, exactly the mass shift that would be 
expected in a formylated peptide. This peptide is likely formylated 
at its amino terminus, and corresponds to the most N-terminal 
fragment of the protein. 
24. Light was used to direct the synthesis of these peptides. Each 
amino acid added to the solid support contained a photolabile 
protecting group instead of a t-Boc protecting group at its a-amino 
group. Illumination of selected regions of the solid support led 
to the release of the protecting group, which exposed the amino 
groups in these sites to make them reactive. The pattern of masks 
used in these illuminations and the sequence of reactants define 
the ultimate products and their locations. 
25. Mass spectrometry is highly sensitive and capable of detecting 
the mass difference between a protein and its deuterated counter- 
part. Fragmentation techniques can be used to identify the amino 
acids that retained the isotope label. Alternatively, NMR spectros- 
copy can be used to detect the isotopically labeled atoms because the 
deuteron and the proton have very different nuclear-spin properties. 
26. First amino acid: A 

Last amino acid: R (not cleaved by carboxypeptidase). 

Sequence of N-terminal tryptic peptide: AVR (tryptic peptide 

ends in K) 

Sequence of N-terminal chymotryptic peptide: AVRY 

(chymotryptic peptide ends in Y) 

Sequence: AVRYSR 
27. First amino acid: S 

Last amino acid: L 

Cyanogen bromide cleavage: M is 10th position, 

C-terminal residues are: (25,L,W) 

Amino-terminal residues: (G,K,S,Y), tryptic peptide, ends in K 

Amino-terminal sequence: SYGK 

Chymotryptic peptide order: (S,Y), (G,K,L), (ELS), 

(M,T), (S,W), (S,L) 

Sequence: SYGKLSIFTMSWSL 


28. Ifthe protein did not contain any disulfide bonds, then the 
electrophoretic mobility of the trypsin fragments would be the 
same before and after performic acid treatment: all the fragments 
would lie along the diagonal of the paper. If one disulfide bond 
were present, the disulfide-linked trypsin fragments would run as 
a single peak in the first direction, then would run as two separate 
peaks after performic acid treatment. The result would be two 
peaks appearing off the diagonal: 
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These fragments could then be isolated from the chromatography 
paper and analyzed by mass spectrometry to determine their amino 
acid composition and thus identify the cysteines participating in 
the disulfide bond. 


Chapter 4 

1. A nucleoside is a base attached to a ribose sugar. A nucleotide is a 
nucleoside with one or more phosphoryl groups attached to the ribose. 
2. Hydrogen-bond pairing between the base A and the base T as 
well as hydrogen-bond pairing between the base G and the base C 
in DNA. 

3. T is always equal to A, and so these two nucleotides constitute 
40% of the bases. G is always equal to C, and so the remaining 60% 
must be 30% G and 30% C. 

4. Nothing, because the base-pair rules do not apply to single- 
stranded nucleic acids. 

5. (a) TTGATC; (b) GTTCGA,; (c) ACGCGT; (d) ATGGTA. 
6. (a) [T] + [C] = 0.46. (b) [T] = 0.30, [C] = 0.24, and [A] + 
[G] = 0.46. 

7. Stable hydrogen bonding occurs only between GC and AT pairs. 
Moreover, two purines are too large to fit inside the double helix, and 
two pyrimidines are too small to form base pairs with each other. 

8. The thermal energy causes the chains to wiggle about, which 
disrupts the hydrogen bonds between base pairs and the stacking 
forces between bases and thereby causes the strands to separate. 

9. The probability that any sequence will appear is 4", where 4 

is the number of nucleotides and n is the length of the sequence. 

The probability of any 15-base sequence appearing is 1/4", or 
1/1,073,741,824. Thus, a 15-nucleotide sequence would be likely to 
appear approximately three times (3 billion X probability of appear- 
ance). The probability of a 16-base sequence appearing is 1/4!°, which 
is equal to 1/4,294,967,296. Such a sequence will be unlikely to 
appear more than once. 
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10. One end of a nucleic acid polymer ends with a free 5'-hydroxyl 
group (or a phosphory] group esterified to the hydroxyl group), and 
the other end has a free 3’-hydroxyl group. Thus, the ends are differ- 
ent. Two chains of DNA can form a double helix only if the chains 
are running in different directions—that is, have opposite polarity. 
11. Although the individual bonds are weak, the population of 
thousands to millions of such bonds provides much stability. There 
is strength in numbers. 

12. There would be too much charge repulsion from the negative 
charges on the phosphoryl groups. These charges must be coun- 
tered by the addition of cations. 

13. The three forms are the A-DNA, the B-DNA and the 
Z-DNA, with B-DNA being the most common. There are many 
differences (see Table 4.2). Some key differences are: A-DNA 

and B-DNA are right-handed, whereas Z-DNA is left-handed. 
A-DNA forms in less-hydrated conditions than does B-DNA. The 
A form is shorter and wider than the B form. 

14. 5.88 X 10° base pairs 

15. In conservative replication, after 1.0 generation, half of the 
molecules would be SN eae the other half !*N-!4N. After 2.0 
generations, one-quarter of the molecules would be !°N-!°N, the 
other three-quarters 'tN-!N. Hybrid 4N-!°N molecules would 
not be observed in conservative replication. 

16. (a) Tritiated thymine or tritiated thymidine. 
(b) dATP, dGTP, dCTP, and TTP labeled 
with *”P in the innermost («) phosphorus atom. 
17. Molecules in parts a and b would not lead 

to DNA synthesis, because they lack a 3’-OH 
group (a primer). The molecule in part d has a 
free 3'-OH group at one end of each strand but 
no template strand beyond. Only the molecule in 
part c would lead to DNA synthesis. 

18. A retrovirus is a virus that has RNA as its 
genetic material. However, for the information 
to be expressed, it must first be converted into 
DNA, a reaction catalyzed by the enzyme reverse 
transcriptase. Thus, at least initially, information 
flow is opposite that of a normal cell: RNA —> 
DNA rather than DNA —> RNA. 

19. A thymidylate oligonucleotide should 

be used as the primer. The poly(A) template 
specifies the incorporation of T; hence, radio- 
active thymidine triphosphate (labeled in the a 
phosphoryl group) should be used in the assay. 
20. The ribonuclease serves to degrade the 
RNA strand, a necessary step in forming 
duplex DNA from the RNA-DNA hybrid. 

21. Treat one aliquot of the sample with ribonu- 
clease and another with deoxyribonuclease. Test 
these nuclease-treated samples for infectivity. 

22. Deamination changes the original G - C base 
pair into a G - U pair. After one round of rep- 
lication, one daughter duplex will contain a G - C pair and the other 
duplex will contain an A - U pair. After two rounds of replication, 
there will be two G - C pairs, one A - U pair, and one A - T pair. 

23. (a) 4° = 65, 536. In computer terminology, there are 64K 
8-mers of DNA. 

(b) A bit specifies two bases (say, A and C) and a second bit speci- 
fies the other two (G and T). Hence, two bits are needed to specify 
a single nucleotide (base pair) in DNA. For example, 00, 01, 10, 
and 11 could encode A, C, G, and T. An 8-mer stores 16 bits 

(2'° = 65,536), the E. coli genome (4.6 X 10° bp) stores 9.2 X 

10° bits, and the human genome (3.0 X 10° bases) stores 6.0 X 

10° bits of genetic information. 


(c) A standard CD can hold about 700 megabytes, which is equal to 
5.6 X 10” bits. A large number of 8-mer sequences could be stored 
on such a CD. The DNA sequence of E. coli, could be written on a 
single CD with room to spare for a lot of music. One CD would not 
be quite enough to record the entire human genome. 

24. (a) Deoxyribonucleoside triphosphates versus ribonucleoside 
triphosphates. 

(b) 5’ —> 3’ for both. 

(c) Semiconserved for DNA polymerase I; conserved for RNA 
polymerase. 

(d) DNA polymerase I needs a primer, whereas RNA polymerase 
does not. 

25. Messenger RNA encodes the information that, on transla- 
tion, yields a protein. Ribosomal RNA is the catalytic component 
of ribosomes, the molecular complexes that synthesize proteins. 
Transfer RNA is an adaptor molecule, capable of binding a spe- 
cific amino acid and recognizing a corresponding codon. Transfer 
RNAs with attached amino acids are substrates for the ribosome. 
26. (a) 5’-UAACGGUACGAU-3' 

(b) Leu-Pro-Ser-Asp-Trp-Met 

(c) Poly(Leu-Leu-Thr-Tyr) 

27. The 2'-OH group in RNA acts as an intramolecular nucleophile. 
In the alkaline hydrolysis of RNA, it forms a 2'-3’ cyclic intermediate. 
28. 


29. Gene expression is the process of expressing the information 
of a gene in its functional molecular form. For many genes, the 
functional information is a protein molecule. Thus, gene expres- 
sion includes transcription and translation. 

30. A nucleotide sequence whose bases represent the most-common, 
but not necessarily the only, members of the sequence. A consensus 
sequence can be thought of as the average of many similar sequences. 
31. Cordycepin terminates RNA synthesis. An RNA chain con- 
taining cordycepin lacks a 3’-OH group. 

32. Only single-stranded RNA can serve as a template for protein 
synthesis. 

33. Degeneracy of the code refers to the fact that most amino acids 
are encoded by more than one codon. 


34. If only 20 of the 64 possible codons encoded amino acids, then 
a mutation that changed a codon would likely result in a nonsense 
codon, leading to termination of protein synthesis. With degeneracy, 
a nucleotide change might yield a synonym or a codon for an amino 
acid with similar chemical properties. 

35. (a) 2, 4, 8; (b) 1, 6, 10; (c) 3, 5, 7,9. 

36. (a) 3; (b) 6; (c) 2; (d) 5; (e) 7; (£) 1; (g) 4. 

37. Incubation with RNA polymerase and only UTP, ATP, and 
CTP led to the synthesis of only poly(UAC). Only poly(GUA) 
was formed when GTP was used in place of CTP. 

38. A peptide terminating with Lys (UGA is a stop codon), 
another containing -Asn-Glu-, and a third containing -Met-Arg-. 
39. Highly abundant amino acid residues have the most codons 
(e.g., Leu and Ser each have six), whereas the least-abundant amino 
acids have the fewest (Met and Trp each have only one). Degeneracy 
(1) allows variation in base composition and (2) decreases the 
likelihood that a substitution for a base will change the encoded 
amino acid. If the degeneracy were equally distributed, each of the 
20 amino acids would have three codons. Both benefits (1 and 2) are 
maximized by the assignment of more codons to prevalent amino 
acids than to less frequently used ones. 

40. Phe-Cys-His-Val-Ala-Ala 

41. Exon shuffling is a molecular process that can lead to the genera- 
tion of new proteins by the rearrangement of exons within genes. 
Because many exons encode functional protein domains, exon shuf- 
fling is a rapid and efficient means of generating new genes. 

42. It shows that the genetic code and the biochemical means of 
interpreting the code are common to even very distantly related life 
forms. It also testifies to the unity of life; that all life arose from a 
common ancestor. 

43. (a) A codon for lysine cannot be changed to one for aspartate 
by the mutation of a single nucleotide. (b) Arg, Asn, Gln, Glu, Ile, 
Met, or Thr. 

44. The genetic code is degenerate. Of the 20 amino acids, 18 are 
specified by more than one codon. Hence, many nucleotide changes 
(especially in the third base of a codon) do not alter the nature of 
the encoded amino acid. Mutations leading to an altered amino 
acid are usually more deleterious than those that do not and hence 
are subject to more stringent selection. 

45. GC base pairs have three hydrogen bonds compared with two 
for AT base pairs. Thus, the higher content of GC means more 
hydrogen bonds and greater helix stability. 

46. Cot value essentially corresponds to the complexity of the DNA 
sequence—in other words, how long it will take for a sequence of 
DNA to find its complementary strand to form a double helix. 

The more complex the DNA, the slower it reassociates to make the 
double-stranded form. 


Chapter 5 

1. (a) 5'-GGCATAC-3' 

(b) The Sanger dideoxy method of sequencing would give the gel 
pattern shown here. 


Direction of 
electrophoresis 


Dideoxy ATP I I 
Dideoxy CTP | 11 
Dideoxy TTP 11 

Dideoxy GTP | I 


SREGTI LACE G # 
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2. Ovalbumin cDNA should be used. E. coli lacks the machinery 
to splice the primary transcript arising from genomic DNA. 

3. Consistent with its planar, aromatic structure, ethidium bro- 
mide is a DNA intercalator: it aligns itself between the paired 
bases in a DNA duplex. 

4. The presence of the Alul sequence would, on average, be (1/4) 
or 1/256, because the likelihood of any base being at any position is 
one-fourth and there are four positions. By the same reasoning, the 
presence of the NotI sequence would be (1/ 4), or 1/65,536. Thus, 
the average product of digestion by Alul would be 250 base pairs 
(0.25 kb) in length, whereas that by NotI would be 66,000 base 
pairs (66 kb) in length. 

5. No, because most human genes are much longer than 4 kb. A 
fragment would contain only a small part of a complete gene. 

6. Southern blotting of an MstII digest would distinguish between 
the normal and the mutant genes. The loss of a restriction site 
would lead to the replacement of two fragments on the Southern 
blot by a single longer fragment. Such a finding would not prove 
that GTG replaced GAG; other sequence changes at the restric- 
tion site could yield the same result. 

7. Although the two enzymes cleave the same recognition site, they 
each break different bonds within the 6-bp sequence. Cleavage 
by KpnI yields an overhang on the 3’ strand, whereas cleavage 
by Acc65I produces an overhang on the 5’ strand. These sticky 
ends do not overlap. 


4 
, 


GGTACC 
CCATGG 
Cleave with Cleave with 
Kpni\ Acc65| 


G GTACC 
CCATG G 


GTACC 
Ge 


Incompatible sticky ends 


8. A simple strategy for generating many mutants is to synthesize 

a degenerate set of cassettes by using a mixture of activated nucleo- 
sides in particular rounds of oligonucleotide synthesis. Suppose that 
the 30-bp coding region begins with GTT, which encodes valine. 
Ifa mixture of all four nucleotides is used in the first and second 
rounds of synthesis, the resulting oligonucleotides will begin with 
the sequence XYT (where X and Y denote A, C, G, or T). These 

16 different versions of the cassette will encode proteins containing 
either Phe, Leu, Ile, Val, Ser, Pro, Thr, Ala, Tyr, His, Asn, Asp, 
Cys, Arg, or Gly at the first position. Likewise, degenerate cassettes 
can be made in which two or more codons are simultaneously varied. 
9. Because PCR can amplify as little as one molecule of DNA, 
statements claiming the isolation of ancient DNA need to be greet- 
ed with some skepticism. The DNA would need to be sequenced. 
Is it similar to human, bacterial, or fungal DNA? If so, contamina- 
tion is the likely source of the amplified DNA. Is it similar to that 
of birds or crocodiles? This sequence similarity would strengthen 
the case that it is dinosaur DNA because these species are evolu- 
tionarily close to dinosaurs. 

10. PCR amplification is greatly hindered by the presence of 
G-—C-rich regions within the template. Owing to their high melting 
temperatures, these templates do not denature easily, preventing 
the initiation of an amplification cycle. In addition, rigid secondary 
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structures prevent the progress of DNA polymerase along the tem- 
plate strand during elongation. 

11. At high temperatures of hybridization, only very close matches 
between primer and target would be stable because all (or most) 
of the bases would need to find partners to stabilize the primer— 
target helix. As the temperature is lowered, more mismatches 
would be tolerated; so the amplification is likely to yield genes with 
less sequence similarity. In regard to the yeast gene, synthesize 
primers corresponding to the ends of the gene, and then use these 
primers and human DNA as the target. If nothing is amplified at 
54°C, the human gene differs from the yeast gene, but a counter- 
part may still be present. Repeat the experiment at a lower tem- 
perature of hybridization. 

12. Digest genomic DNA with a restriction enzyme, and select the 
fragment that contains the known sequence. Circularize this frag- 
ment. Then carry out PCR with the use of a pair of primers that 
serve as templates for the synthesis of DNA away from the known 
sequence. 

13. The encoded protein contains four repeats of a specific 
sequence. 

14. Use chemical synthesis or the polymerase chain reaction to 
prepare hybridization probes that are complementary to both ends 
of the known (previously isolated) DNA fragment. Challenge 
clones representing the library of DNA fragments with both of 
the hybridization probes. Select clones that hybridize to one of the 
probes but not the other; such clones are likely to represent DNA 
fragments that contain one end of the known fragment along with 
the adjacent region of the particular chromosome. 

15. The codon(s) for each amino acid can be used to determine the 
number of possible nucleotide sequences that encode each peptide 
sequence (see Table 4.5): 


Ala—Met—Ser—Leu—Pro—Trp: 
4x 1xX6xX6xX4xX1 


Gly—Trp—Asp—Met-His-Lys: 
4X1xX 2X 1X2X2 = 32 total sequences 


Cys—Val-Trp—Asn-Lys-lle: 
2x4xX1xX2x*2 x3 


Arg—Ser—Met—Leu—Gln—Asn: 
6X6X 1X 6X 2X 2 = 864 total sequences 


= 576 total sequences 


= 96 total sequences 


The set of DNA sequences encoding the peptide Gly-Trp-Asp- 
Met-His-Lys would be most ideal for probe design because it 
encompasses only 32 total oligonucleotides. 

16. Within a single species, individual dogs show enormous varia- 
tion in body size and substantial diversity in other physical char- 
acteristics. Therefore, genomic analysis of individual dogs would 
provide valuable clues concerning the genes responsible for the 
diversity within the species. 

17. On the basis of the comparative genome map shown in 
Figure 5.27, the region of greatest overlap with human chromo- 
some 20 can be found on mouse chromosome 2. 

18. Ty, is the melting temperature of a double-stranded nucleic 
acid. If the melting temperatures of the primers are too different, 
the extent of hybridization with the target DNA will differ during 
the annealing phase, which would result in differential replications 
of the strands. 

19. Careful comparison of the sequences reveals that there is a 7-bp 
region of complementarity at the 3’ ends of these two primers: 


5'-GGATCGATGCTCGCGA-3' 


ITT I LEI 
3'-GAGCGCTGGGCTAGGA-5’ 


In a PCR experiment, these primers would likely anneal to one 
another, preventing their interaction with the template DNA. 
During DNA synthesis by the polymerase, each primer would act 
as a template for the other primer, leading to the amplification of a 
25-bp sequence corresponding to the overlapped primers. 

20. A mutation in person B has altered one of the alleles for gene X, 
leaving the other intact. The fact that the mutated allele is smaller 
suggests that a deletion has occurred in one copy of the gene. The 
one functioning copy is transcribed and translated and apparently 
produces enough protein to render the person asymptomatic. 

Person C has only the smaller version of the gene. This gene is 
neither transcribed (negative northern blot) nor translated (negative 
western blot). 

Person D has a normal-size copy of the gene but no corre- 
sponding RNA or protein. There may be a mutation in the pro- 
moter region of the gene that prevents transcription. 

Person E has a normal-size copy of the gene that is tran- 
scribed, but no protein is made, which suggests that a mutation 
prevents translation. There are a number of possible explana- 
tions, including a mutation that introduced a premature stop 
codon in the mRNA. 

Person F has a normal amount of protein but still displays the 
metabolic problem. This finding suggests that the mutation affects 
the activity of the protein—for instance, a mutation that compro- 
mises the active site of enzyme Y. 

21. Chongqing: residue 2, L—> R, CTG—> GGG 

Karachi: residue 5, A —> P, GCC —> CCC 

Swan River: residue 6, D —> G, GAC —> GGC 
22. This particular person is heterozygous for this particular 
mutation: one allele is wild type, whereas the other carries a point 
mutation at this position. Both alleles are PCR amplified in this 
experiment, yielding the “dual peak” appearance on the sequenc- 
ing chromatogram. 


Chapter 6 

1. There are 26 identities and two gaps for a score of 210. The two 
sequences are approximately 26% identical. This level of homology 
is likely to be statistically significant. 

2. They are likely related by divergent evolution, because three- 
dimensional structure is more conserved than is sequence identity. 
3. (a) Identity score = —25; Blosum score = 14; (b) identity 
score = 15; Blosum score = 3. 


4.U 


5. There are 4*°, or 1.2 X 1074, different molecules. Each molecule 
has a mass of 2.2 X 107°, because 1 mol of polymer has a mass of 
330 g mol | x 40, and there are 6.02 X 107° molecules per mole. 
Therefore, 26.4 kg of RNA would be required. 

6. Because three-dimensional structure is much more closely asso- 
ciated with function than is sequence, tertiary structure is more 
evolutionarily conserved than is primary structure. In other words, 
protein function is the most important characteristic, and protein 
function is determined by structure. Thus, the structure must be 
conserved but not necessarily a specific amino acid sequence. 


7. Alignment score of sequences (1) and (2) is 6 X 10 = 60. Many 
answers are possible, depending on the randomly reordered 
sequence. A possible result is 


Shuffled sequence: (2) TKADKAGEYL 


(1) ASNFLDKAGK 
(2) TKADKAGEYL 


Alignment score is 4 X 10 = 40. 


Alignment: 


8. (a) Almost certainly diverged from a common ancestor. 

(b) Almost certainly diverged from a common ancestor. (c) May 
have diverged from a common ancestor, but the sequence 
alignment may not provide supporting evidence. (d) May have 
diverged from a common ancestor, but the sequence alignment 
is unlikely to provide supporting evidence. 

9. Replacement of cysteine, glycine, and proline never yields a 
positive score. Each of these residues exhibits features unlike those 
of its other 19 counterparts: cysteine is the only amino acid capable 
of forming disulfide bonds, glycine is the only amino acid without 
a side-chain and is highly flexible, and proline is the only amino 
acid that is highly constrained through the bonding of its side 
chain to its amine nitrogen. 

10. Protein A is clearly homologous to protein B, given 65% 
sequence identity, and so A and B are expected to have quite simi- 
lar three-dimensional structures. Likewise, proteins B and C are 
clearly homologous, given 55% sequence identity, and so B and C 
are expected to have quite similar three-dimensional structures. 
Thus, proteins A and C are likely to have similar three-dimensional 
structures, even though they are only 15% identical in sequence. 
11. The likely secondary structure is 


12. To detect pairs of residues with correlated mutations, there 
must be variability in these sequences. If the alignment is over- 
represented by closely related organisms, there may not be enough 
changes in their sequences to allow the identification of potential 
base-pairing patterns. 

13. After RNA molecules have been selected and reverse transcribed, 
PCR is performed to introduce additional mutations into these 
strands. The use of this error-prone, thermostable polymerase 

in the amplification step would enhance the efficiency of this 
random mutagenesis. 

14. The initial pool of RNA molecules used in a molecular-evolution 
experiment is typically much smaller than the total number of possible 
sequences. Hence, the best possible RNA sequences will likely not be 
represented in the initial set of oligonucleotides. Mutagenesis of the 
initial selected RNA molecules allows for the iterative improvement 
of these sequences for the desired property. 

15. 107 or 108 identities (depending on which annotated human 
sequence is chosen). 


Chapter 7 

1. The whale swims long distances between breaths. A high con- 
centration of myoglobin in the whale muscle maintains a ready 
supply of oxygen for the muscle between breathing episodes. 
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Answers to Problems 


2. (a) 2.96 X 10711 g 

(b) 2.74 X 10° molecules 

(c) No. There would be 3.17 X 108 hemoglobin molecules in a 

red cell if they were packed in a cubic crystalline array. Hence, the 
actual packing density is about 84% of the maximum possible. 

3. 2.65 ¢ (or 4.75 X 10°” mol) of Fe 

4. (a) In human beings, 1.44 X 10°? g (4.49 X 10~* mol) of O» 
per kilogram of muscle. In sperm whale, 0.144 g (4.49 X 103 mol) 
of Oz per kilogram. 

(b) 128 

5. The pK, is (a) lowered; (b) raised; and (c) raised. 

6. Deoxy Hb A contains a complementary site, and so it can add 
on to a fiber of deoxy Hb S. The fiber cannot then grow further, 
because the terminal deoxy Hb A molecule lacks a sticky patch. 

7. 62.7% oxygen-carrying capacity 

8. A higher concentration of BPG would shift the oxygen-binding 
curve to the right, causing an increase in Ps. The larger value of 
Ps9 would promote dissociation of oxygen in the tissues and would 
thereby increase the percentage of oxygen delivered to the tissues. 
9. Oxygen binding appears to cause the copper ions and their 
associated histidine ligands to move closer to one another, thereby 
also moving the helices to which the histidines are attached (in 
similar fashion to the conformational change in hemoglobin). 

10. The modified hemoglobin should not show cooperativity. 
Although the imidazole in solution will bind to the heme iron (in 
place of histidine) and will facilitate oxygen binding, the imidazole 
lacks the crucial connection to the particular a helix that must 
move so as to transmit the change in conformation. 

11. Inositol pentaphosphate (part c) is highly anionic, much like 
2,3-bisphosphoglycerate. 

12. 


wm 


Y (fractional saturation) > 


0 | | | | J 0 | | | 
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13. Release of acid will lower the pH. A lower pH promotes oxy- 
gen dissociation in the tissues. However, the enhanced release of 
oxygen in the tissues will increase the concentration of deoxy-Hb, 
thereby increasing the likelihood that the cells will sickle. 

14. (a) Y = 0.5 when pO, = 10 torr. The plot of Y versus pO 
appears to indicate little or no cooperativity. 

(b) The Hill plot shows slight cooperativity with n - 1.3 in the central 
region. 

(c) Deoxy dimers of lamprey hemoglobin could have lower affinity 
for oxygen than do the monomers. If the binding of the first oxygen 
atom to a dimer causes dissociation of the dimer to give two mono- 
mers, then the process would be cooperative. In this mechanism, 
oxygen binding to each monomer would be easier than binding the 
first oxygen atom to a deoxy dimer. 

15. (a) 2; (b) 4; (c) 2; (d) 1. 

16. The electrostatic interactions between BPG and hemoglobin 
would be weakened by competition with water molecules. The T 
state would not be stabilized. 
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Chapter 8 

. Rate enhancement and substrate specificity 
. A cofactor 
. Coenzymes and metals 
. Vitamins are converted into coenzymes. 

. Enzymes facilitate the formation of the transition state. 

. The intricate three-dimensional structure of proteins allows 
‘he construction of active sites that will recognize only specific 
substrates. 

7. The energy required to reach the transition state (the activa- 
tion energy) is returned when the transition state proceeds to 
product. 

8. Protein hydrolysis has a large activation energy. Protein synthe- 
sis must require energy to proceed. 

9. The enzymes help protect the fluid that surrounds eyes from 
bacterial infection. 

10. Transition states are very unstable. Consequently, molecules 
that resemble transition states are themselves likely to be unstable 
and, hence, difficult to synthesize. 

11. (a) 0; (b) 28.53; (c) —22.84; (d) —11.42; (e) 5.69. 

12. (a) AG°’ = —RT In Keg 


+1.8 = —(1.98 X 1073 kcal7! K7! mol7) (298 K) 
(In[G1P]/[G6P]) 


—3.05 = In [G1P]/[G6P] 
+3.05 = In [G6P]/[G1P] 
Kig (= 21 or Kig=4.8X 107? 


DaFWNE 


Because [G6P]/[G1P] = 21, there is 1 molecule of G1P for every 
21 molecules of G6P. Because we started with 0.1 M, the [G1P] 

is 1/22(0.1 M) = 0.0045 M and [G6P] must be 21/22(0.1 M) or 
0.096 M. Consequently, the reaction does not proceed as written to 
a significant extent. 

(b) Supply G6P at a high rate and remove G1P at a high rate by 
other reactions. In other words, make sure that the [G6P]/[G1P] 

is kept large. 

13. Keg = 19, AG” = —7.41 kJ mol”! (—1.77 kcal mol) 

14. The three-dimensional structure of an enzyme is stabi- 

lized by interactions with the substrate, reaction intermediates, 
and products. This stabilization minimizes thermal 

denaturation. 

15. At substrate concentrations near the Ky, the enzyme dis- 
plays significant catalysis yet is sensitive to changes in substrate 
concentration. 

16. A+S = 10 Ky, Vo = 0.91 Vax 1 + $ = 20 Ky, Vo = 

0.91 Vinay: 

So any Michaelis-Menten curves showing that the enzyme actually 
attains Vinax are pernicious lies. 

17. (a) 31.1 pmol; (b) 0.05 wmol; (c) 622 s~!, a midrange value for 
enzymes (see Table 8.5). 

18. (a) Yes, Kyy = 5.2 X 10° M; (b) Vinax = 
minute; (c) 337 s} 

19. Penicillinase, like glycopeptide transpeptidase, forms an acyl- 
enzyme intermediate with its substrate but transfers the interme- 
diate to water rather than to the terminal glycine residue of the 
pentaglycine bridge. 

20. (a) Vmax is 9.5 pmol minute 
as without inhibitor. 

(b) Noncompetitive 

(c)2.5X 10 °M 

(d) fgg = 0.73, in the presence or absence of this noncompetitive 
inhibitor. 


6.8 X 10 |? mol 


at Kyris 1.1 X 107° M, the same 


21. (a) V= Vmax — (V/[S]) K 
(b) Slope = —Kyyg, y-intercept = Vipax, x-intercept = Vinax/ Ky. 
(c) An Eadie—Hofstee plot 


1_No inhibitor 
2 Competitive inhibitor 


3 Noncompetitive inhibitor 


v/{s] 


22. The rates of utilization of substrates A and B are given by 


_ Reat 
Va = ( re) [EIA] 


_ Reat 
Va = ( re) (EIB) 


Hence, the ratio of these rates is 


Reat Reat 
VANES (Fa), tau Fe) 8 


Thus, an enzyme discriminates between competing substrates on 
the basis of their values of k.,¢/Kjq rather than of Ky alone. 

23. The mutation slows the reaction by a factor of 100 because the 
activation free energy is increased by 53.22 kJ mol”! (12.72 kcal 
mol !), Strong binding of the substrate relative to the transition 
state slows catalysis. 

24. 1.1 wmol minute ! 

25. (a) This piece of information is necessary for determining the 
correct dosage of succinylcholine to administer. 

(b) The duration of the paralysis depends on the ability of the 
serum cholinesterase to clear the drug. If there were one-eighth the 
amount of enzyme activity, paralysis could last eight times as long, 
which is undesirable for two reasons. First, the respirator might 
break from extended use, which would not be good for the patient 
on the respirator; second, the doctors might miss their golf game. 
(c) Kyy is the concentration needed by the enzyme to reach !/7Vinax- 
Consequently, for a given concentration of substrate, the reaction 
catalyzed by the enzyme with the lower Kyy will have the higher 
rate. The mutant patient with the higher Kyy will clear the drug at a 
much lower rate. 

26. If the total amount of enzyme (E7) is increased, Vinay will 
increase, because Vinax = Ro[Eq]. But Kyy = (k~1 + ko)/ky; that 
is, it is independent of substrate concentration. The middle graph 
describes this situation. 


27. (a) 


and 


ee 


1/Vo 


1/[S] 


(b) This behavior is substrate inhibition: at high concentrations, 
the substrate forms unproductive complexes at the active site. 
The adjoining drawing shows what might happen. Substrate 
normally binds in a defined orientation, shown in the drawing 


as red to red and blue to blue. At high concentrations, the sub- 
strate may bind at the active site such that the proper orientation 
is met for each end of the molecule, but two different substrate 
molecules are binding. 


Enzyme 
active site 


tf 


Substrate inhibition 


Enzyme 
active site 


> 


Normal substrate binding at 
the active site. Substrate will 
be cleaved to red and blue 
balls. a) 


28. The first step will be the rate-limiting step. Enzymes Ep and 
Ec are operating at % Vinax, whereas the Ky for enzyme Ea is 
greater than the substrate concentration. Ea would be operating at 
approximately 107~7Vimax- 

29. The fluorescence spectroscopy reveals the existence of an 
enzyme-serine complex and of an enzyme-serine—indole complex. 
30. (a) When [ S*] is much greater than the value of Ky, pH will 
have a negligible effect on the enzyme because S* will interact with 
Eas soon as the enzyme becomes available. 


Vinax 


(b) When [ $*] is much less than the value of Ky, the plot of Vo 
versus pH becomes essentially a titration curve for the ionizable 
groups, with enzyme activity being the titration marker. At low 
pH, the high concentration of H* will keep the enzyme in the EH 
form and inactive. As the pH rises, more and more of the enzyme 
will be in the E~ form and active. At high pH (low H”), all of the 
enzymeisE . 


Vinax 


(c) The midpoint on this curve will be the pK, of the ionizable 
group, which is stated to be pH 6. 

31. (a) Incubating the enzyme at 37°C leads to a denaturation 
of enzyme structure and a loss of activity. For this reason, most 
enzymes must be kept cool if they are not actively catalyzing 
their reactions. 

(b) The coenzyme apparently helps to stabilize enzyme struc- 
ture, because enzyme from PLP-deficient cells denatures faster. 
Cofactors often help stabilize enzyme structure. 
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Chapter 9 

1. For the amide substrate, the formation of the acyl-enzyme 
intermediate is slower than the hydrolysis of the acyl-enzyme 
intermediate, and so no burst is observed. A burst is observed for 
ester substrates; the formation of the acyl-enzyme intermediate is 
faster, leading to the observed burst. 

2. The histidine residue in the substrate can substitute to some 
extent for the missing histidine residue of the catalytic triad of the 
mutant enzyme. 

3. No. The catalytic triad works as a unit. After this unit has been 
made ineffective by the mutation of histidine to alanine, the further 
mutation of serine to alanine should have only a small effect. 

4. The substitution corresponds to one of the key differences 
between trypsin and chymotrypsin, and so trypsinlike specificity 
(cleavage after lysine and arginine) might be predicted. In fact, 
additional changes are required to effect this specificity change. 

5. Imidazole is apparently small enough to reach the active site 

of carbonic anhydrase. Buffers with large molecular components 
cannot do so, and the effects of the mutation are more evident. 

6. No. The odds of such a sequence being present are approximately 
1 in 4'° = 1,048,576. Because a typical viral genome has only 
50,000 bp, the target sequence would be unlikely to be present. 

7. No, because the enzyme would destroy the host DNA before 
protective methylation could take place. 

8. No. The bacteria receiving the enzyme would have their own 
DNA destroyed because they would likely lack the appropriate 
protective methylase. 

9. EDTA will bind to Zn?* and remove the ion, which is required 
for enzyme activity, from the enzyme. 

10. (a) The aldehyde reacts with the active-site serine. (b) A 
hemiacetal is formed. 

11. Trypsin 

12. The reaction is expected to be slower by a factor of 10 because the 
rate depends on the pK, of the zinc-bound water. kat = 60,000 st. 
13. EDTA binds the magnesium necessary for the reaction. 

14. ATP hydrolysis is reversible within the active site. ATP hydro- 
lysis takes place within the active site with the incorporation of 0, 
ATP is re-formed, and the ATP is released back into solution. 

15. If the aspartate is mutated, the protease is inactive and the 
virus will not be viable. 

16. Water substitutes for the hydroxyl group of serine 236 in 
mediating proton transfer from the attacking water and the 
y-phosphoryl group. 

17. (a) Cysteine protease: The same as Figure 9.8, except that 
cysteine replaces serine in the active site and no aspartate is present. 
(b) Aspartyl protease: 
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(c) Metalloprotease: 
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1. The enzyme catalyzes the first step in the synthesis of pyrimi- 
dines. It facilitates the condensation of carbamoyl phosphate and 
aspartate to form N-carbamoylaspartate and inorganic phosphate. 
2. The protonated form of histidine probably stabilizes the nega- 
tively charged carbonyl oxygen atom of the scissile bond in the 
transition state. Deprotonation would lead to a loss of activity. 
Hence, the rate is expected to be half maximal at a pH of about 6.5 
(the pK of an unperturbed histidine side chain in a protein) and to 
decrease as the pH is raised. 

3. The inhibition of an allosteric enzyme by the end product of 
the pathway controlled by the enzyme. It prevents the production 
of too much end product and the consumption of substrates when 
product is not required. 

4. High concentrations of ATP might signal two overlapping situ- 
ations. The high levels of ATP might suggest that some nucleotides 
are available for nucleic acid synthesis, and consequently, CTP should 
be synthesized. The high levels of ATP indicate that energy is available 
for nucleic acid synthesis, and so CTP should be produced. 

5. All of the enzyme would be in the R form all of the time. There 
would be no cooperativity. The kinetics would look like that of a 
Michaelis-Menten enzyme. 

6. The enzyme would show simple Michaelis—Menten kinetics 
because it is essentially always in the R state. 

7. CTP is formed by the addition of an amino group to UTP. 
Evidence indicates the UTP is also capable of inhibiting ATCase. 
8. Homotropic effectors are the substrates of allosteric enzymes. 
Heterotropic effectors are the regulators of allosteric enzymes. 
Homotropic effectors account for the sigmoidal nature of the 
velocity versus substrate concentration curve, whereas heterotropic 
effectors alter the midpoint of Kyy of the curve. Ultimately, both 
types of effectors work by altering the T/R ratio. 

9. The reconstitution shows that the complex quaternary structure 
and the resulting catalytic and regulatory properties are ultimately 
encoded in the primary structure of individual components. 

10. If substrates had been used, the enzyme would catalyze the 
reaction. Intermediates would not accumulate on the enzyme. 
Consequently, any enzyme that crystallized would have been free 
of substrates or products. 

11. (a) 100. The change in the [R]/[T] ratio on binding one 
substrate molecule must be the same as the ratio of the substrate 
affinities of the two forms. 

(b) 10. The binding of four substrate molecules changes the [R]/[T] 
by a factor of 100* = 10°. The ratio in the absence of substrate is 10°’. 
Hence, the ratio in the fully liganded molecule is 10° x 1077 = 10. 
12. The fraction of molecules in the R form is 107°, 0.004, 0.615, 
0.998, and 1 when 0, 1, 2, 3, and 4 ligands, respectively, are bound. 
13. The sequential model can account for negative cooperativity, 
whereas the concerted model cannot. 

14. The binding of PALA switches ATCase from the T to the 

R state because PALA acts as a substrate analog. An enzyme 


molecule containing bound PALA has fewer free catalytic 

sites than does an unoccupied enzyme molecule. However, the 
PALA-containing enzyme will be in the R state and, hence, have 

a higher affinity for the substrates. The dependence of the degree 
of activation on the concentration of PALA is a complex function 
of the allosteric constant Ly and of the binding affinities of the 
Rand T states for the analog and substrates. 

15. The net outcome of the two reactions is the hydrolysis of ATP 
to ADP and P;, which has a AG of —50 kJ mol! (—12 kcal mol!) 
under cellular conditions. 

16. Isozymes are homologous enzymes that catalyze the same reac- 
tion but have different kinetic or regulatory properties. 

17. Although the same reaction may be required in a variety of differ- 
ent tissues, the biochemical properties of tissues will differ according 
to their biological function. Isozymes allow the fine-tuning of catalytic 
and regulatory properties to meet the specific needs of the tissue. 

18. (a) 7; (b) 8; (c) 11; (d) 6; (e) 1; (£) 12; (g) 3; Ch) 4; (a) 5; (9) 2; 
(k) 10; (1) 9. 

19. When phosphorylation takes place at the expense of ATP, suffi- 
cient energy is expended to dramatically alter the structure and hence 
activity of a protein. Moreover, because ATP is the cellular energy 
currency, protein modification is linked to the energy status of the cell. 
20. Covalent modification is reversible, whereas proteolytic 
cleavage is irreversible. 

21. Activation is independent of zymogen concentration because 
the reaction is intramolecular. 

22. Although quite rare, cases of enteropeptidase deficiency have 
been reported. The affected person has diarrhea and fails to thrive 
because digestion is inadequate. In particular, protein digestion is 
impaired. 

23. Add blood from the second patient to a sample from the first. 
If the mixture clots, the second patient has a defect different from 
that of the first. This type of assay is called a complementation test. 
24. Activated factor X remains bound to blood-platelet membranes, 
which accelerates its activation of prothrombin. 

25. Antithrombin III is a very slowly hydrolyzed substrate of 
thrombin. Hence, its interaction with thrombin requires a fully 
formed active site on the enzyme. 

26. Residues a and d are located in the interior of an a-helical 

coiled coil, near the axis of the superhelix. Hydrophobic interactions 
between these side chains contribute to the stability of the coiled coil. 
27. Leucine would bea good choice. It is resistant to oxidation 
and has nearly the same volume and degree of hydrophobicity as 
methionine has. 

28. Inappropriate clot formation could block arteries in the brain, 
causing a stroke, or the heart, causing a heart attack. 

29. Tissue-type plasminogen activator, or TPA, is a serine pro- 
tease that leads to the dissolution of blood clots. TPA activates 
plasminogen that is bound to a fibrin clot, converting it into active 
plasmin, which then hydrolyzes the fibrin of the clot. 

30. A mature clot is stabilized by amide linkages between the side 
chains of lysine and glutamine that are absent in a soft clot. The 
linkages are formed by transglutaminase. 

31. The simple sequential model predicts that the fraction of cata- 
lytic chains in the R state, fr, is equal to the fraction containing 
bound substrate, Y. The concerted model, in contrast, predicts that 
fR increases more rapidly than Y as the substrate concentration is 
increased. The change in fp leads to the change in Y on addition of 
substrate, as predicted by the concerted model. 

32. The binding of succinate to the functional catalytic sites of 
the native c3 moiety changed the visible absorption spectrum of 
nitrotyrosine residues in the other c3 moiety of the hybrid enzyme. 
Thus, the binding of substrate analog to the active sites of one 
trimer altered the structure of the other trimer. 


33. According to the concerted model, an allosteric activator 

shifts the conformational equilibrium of all subunits toward the 

R state, whereas an allosteric inhibitor shifts it toward the T state. 
Thus, ATP (an allosteric activator) shifted the equilibrium to the 
R form, resulting in an absorption change similar to that obtained 
when substrate is bound. CTP had a different effect. Hence, this 
allosteric inhibitor shifted the equilibrium to the T form. Thus, the 
concerted model accounts for the ATP-induced and CTP-induced 
(heterotropic), as well as for the substrate-induced (homotropic), 
allosteric interactions of ATCase. 

34. In the R state, ATCase expands and becomes less dense. This 
decrease in density results in a decrease in the sedimentation value 
(see the formula on p. 76). 

35. The interaction between trypsin and the inhibitor is so stable 
that the transition state is rarely formed. Recall that maximal bind- 
ing energy is released when an enzyme binds to the transition state. 
If the substrate-enzyme interaction is too stable, the transition 
state rarely forms. 
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Answers to Problems 


Chapter 11 

1. Carbohydrates were originally regarded as hydrates of carbon 
because the empirical formula of many of them is (CH20),. 

2. Three amino acids can be linked by peptide bonds in only six 
different ways. However, three different monosaccharides can be 
linked in a plethora of ways. The monosaccharides can be linked 
ina linear or branched manner, with a or B linkages, with bonds 
between C-1 and C-3, between C-1 and C-4, between C-1 and 
C-6, and so forth. In fact, the three monosaccharides can form 
12,288 different trisaccharides. 

3. (a) aldose-ketose; (b) epimers; (c) aldose-ketose (d) anomers; 
(e) aldose-ketose; (f) epimers. 

4. Erythrose: tetrose aldose; Ribose: pentose aldose; Glyceraldehyde: 


triose aldose; Dihydroxyacetone: triose ketose; Erythrulose: tetrose 
ketose; Ribulose: pentose ketose; Fructose: hexose ketose. 


* eHo CHO CHO 
H—C—OH HO—C—-H HO—-C——-H 
H—C— OH H—C—Oo0H HO——-C——-H 
H—C—OH H—C—\OH H—C—\ OH 
H—C—OH H— C—¥OH H—C—OH 

CH,OH CH,OH CH,0H 
p-Allose p-Altrose p-Mannose 
CHO CHO CHO 
H—C—OH HO—C—-H H—C—OH 
H—C—OH H—C—¥OH HO—-C——-H 
HO—C——H HO—-C——-H HO—C—H 
H—C— OH H—C—\OH H—C—OH 
CH,OH CH,OH CH,OH 
p-Gulose p-Idose p-Galactose 
CHO 
HO—-C——H 
HO—-C——H 
HO——-C——H 
H—C—OH 
CH,OH 
p-Talose 


6. The proportion of the a anomer is 0.36, and that of the B 


anomer is 0.64. 


7. Glucose is reactive because of the presence of an aldehyde group 
in its open-chain form. The aldehyde group slowly condenses with 
amino groups to form aldimine products of a type called Schiff-base 


adducts. 


8. A pyranoside reacts with two molecules of periodate; formate is 
one of the products. A furanoside reacts with only one molecule of 
periodate; formate is not formed. 

9. From methanol 

10 (a) B-D-Mannose; (b) B-D-galactose; (c) B-D-fructose; 

(d) B-D-glucosamine. 
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11. The trisaccharide itself should be a competitive inhibitor of 
cell adhesion if the trisaccharide unit of the glycoprotein is critical 
for the interaction. 
12. Reducing ends would form 1,2,3,6-tetramethylglucose. The 
branch points would yield 2,3-dimethylglucose. The remainder 
of the molecule would yield 2,3,6-trimethylglucose. 
13. (a) not a reducing sugar; no open-chain forms are possible. 
(b) D-Galactose, D-glucose, D-fructose. (c) D-Galactose and 
sucrose (glucose + fructose). 
14. CH20H The hemiketal linkage of the a anomer is 
© ox broken to form the open form. Rotation about 

the C-1 and C-2 bonds allows the formation of 
the B anomer, and a mixture of isomers results. 
15. Heating converts the very sweet pyranose 

B-p-Mannose = form into the more-stable but less-sweet 
furanose form. Consequently, the sweetness of the preparation is 
difficult to accurately control, which also accounts for why honey 
loses sweetness with time. See Figure 11.5 for structures. 
16. (a) Each glycogen molecule has one reducing end, whereas 
the number of nonreducing ends is determined by the num- 
ber of branches, or a-1,6 linkages. (b) Because the number of 
nonreducing ends greatly exceeds the number of reducing ends 
in a collection of glycogen molecules, all of the degradation and 
synthesis of glycogen takes place at the nonreducing ends, thus 
maximizing the rate of degradation and synthesis. 
17. No, sucrose is not a reducing sugar. The anomeric carbon 
atom acts as the reducing agent in both glucose and fructose but, 
in sucrose, the anomeric carbon atoms of fructose and glucose are 
joined by a covalent bond and are thus not available to react. 
18. Glycogen is polymer of glucose linked by a-1,4-glycosidic bonds 
with branches formed approximately every 10 glucose units by 
a-1,6-glycosidic bonds. Starch consists of two polymers of glucose. 
Amylose is a straight-chain polymer formed by a-1,4-glycosidic 
bonds. Amylopectin is similar to glycogen but amylopectin has fewer 
branches, one branch per 30 or so glucose units. 
19. Cellulose is a linear polymer of glucose joined by B-1,4 link- 
ages. Glycogen is a branched polymer with the main chain being 
formed by a-1,4-glycosidic bonds. The B-1,4 linkages allow the 
formation of a linear polymer ideal for structural roles. The a-1,4 
linkages of glycogen form a helical structure, which allows the 
storage of many glucose moieties in a small space. 
20. Simple glycoproteins are often secreted proteins and thus play 
a variety of roles. For example, the hormone EPO is a glycoprotein. 
Usually, the protein component constitutes the bulk of the glycopro- 
tein by mass. In contrast, proteoglycans and mucoproteins are pre- 
dominantly carbohydrates. Proteoglycans have glycosaminoglycans 
attached, and play structural roles as in cartilage and the extracellu- 
lar matrix. Mucoproteins often serve as lubricants and have multiple 
carbohydrates attached through an N-acetylgalactosamine moiety. 
21. The attachment of the carbohydrate allows the EPO to stay in 
circulation longer and thus to function for longer periods of time 
than would a carbohydrate-free EPO. 
22. The glycosaminoglycan, because it is heavily charged, binds 
many water molecules. When cartilage is stressed, such as when 
your heel hits the ground, the water is released, thus cushioning 
the impact. When you lift your heel, the water rebinds. 
23. The lectin that binds the mannose 6-phosphate might be 
defective and not recognize a correctly addressed protein. 
24. Different molecular forms of a glycoprotein that differ in the 
amount of carbohydrate attached or the location of attachment or 
both. 
25. The total collection of carbohydrates synthesized by a cell at 
particular times and under particular environmental conditions. 


OH OH 
HO H 


26. The genome comprises all of the genes present in an organism. 
The proteome includes all of the possible protein products and modi- 
fied proteins that a cell expresses under any particular set of circum- 
stances. The glycome consists of all of the carbohydrates synthesized 
by the cell under any particular set of circumstances. Because the 
genome is static, but any given protein can be variously expressed 
and modified, the proteome is more complex than the genome. The 
glycome, which includes not only glycoforms of proteins, but also 
many possible carbohydrate structures, must be even more complex. 
27. It suggests that carbohydrates are on the cell surfaces of all 
organisms for the purpose of recognition by other organisms or the 
environment. 

28. A glycoprotein is a protein that is decorated with carbohy- 
drates. A lectin is a protein that specifically recognizes carbohy- 
drates. A lectin can also be a glycoprotein. 

29. Each site either is or is not glycosylated, and so there are 2° = 64 
possible proteins. 

30. As discussed in Chapter 9, many enzymes display stereo- 
chemical specificity. Clearly, the enzymes of sucrose synthesis are 
able to distinguish between the isomers of the substrates and link 
only the correct pair. 

31. If the carbohydrate specificity of the lectin is known, an affinity 
column with the appropriate carbohydrate attached could be pre- 
pared. The protein preparation containing the lectin of interest could 
be passed over the column. The use of this method was indeed how 
the glucose-binding lectin concanavalin A was purified. 

32. (a) Aggrecan is heavily decorated with glycosaminoglycans. If 
glycosaminoglycans are released into the media, aggrecan must be 
undergoing degradation. 

(b) Another enzyme might be present that cleaves glycosamino- 
glycans from aggrecan without degrading aggrecan. Other experi- 
ments not shown established that glycosaminoglycan release is an 
accurate measure of aggrecan destruction. 

(c) The control provides a baseline of “background” degradation 
inherent in the assay. 

(d) Aggrecan degradation is greatly enhanced. 

(e) Aggrecan degradation is reduced to the background system. 
(f) It is an in vitro system in which not all the factors contributing 
to cartilage stabilization in vivo are present. 


Chapter 12 

1. 2.86 X 10° molecules, because each leaflet of the bilayer contains 
1.43 < 10° molecules. 

2. Essentially an “inside-out” membrane. The hydrophilic groups 
would come together on the interior of the structure, away from 
the solvent, whereas the hydrocarbon chains would interact with 
the solvent. 

3. 2X 10°’ cm, 6 X 10° °cm, and 2 x 10-* cm. 

4. The radius of this molecule is 3.1 X 107” cm, and its diffusion 
coefficient is 7.4 X 107? cm? s~!. The average distances traversed 
are 1.7 X 10°’ cmin1 ps, 5.4 X 10~° cm in 1 ms, and 1.7 X 

10 *cmin1s. 

5. The membrane underwent a phase transition from a highly fluid 
to a nearly frozen state when the temperature was lowered. A carrier 
can shuttle ions across a membrane only when the bilayer is highly 
fluid. A channel, in contrast, allows ions to traverse its pore even 
when the bilayer is quite rigid. 

6. The presence of a cis double bond introduces a kink in the 
fatty acid chain that prevents tight packing and reduces the 
number of atoms in van der Waals contact. The kink lowers the 
melting point compared with that of a saturated fatty acid. Trans 
fatty acids do not have the kink, and so their melting tempera- 
tures are higher, more similar to those of saturated fatty acids. 


Because trans fatty acids have no structural effect, they are rarely 
observed. 

7. Palmitic acid is shorter than stearic acid. Thus, when the chains 
pack together, there is less opportunity for van der Waals interaction 
and the melting point is thus lower than that of the longer stearic acid. 
8. Hibernators selectively feed on plants that have a high propor- 
tion of polyunsaturated fatty acids with lower melting temperature. 
9. The initial decrease in fluorescence with the first addition of 
sodium dithionite results from the quenching of NBD-PS molecules 
in the outer leaflet of the bilayer. Sodium dithionite does not traverse 
the membrane under these experimental conditions; hence, it does 
not quench the labeled phospholipids in the inner leaflet. A second 
addition of sodium dithionite has no effect, as the NBD-PS mole- 
cules in the outer leaflet remain quenched. However, after a 6.5 hour 
incubation, about half the NBD-PS has flipped over to the outer 
leaflet of the bilayer, resulting in the 50% decrease in fluorescence 
when sodium dithionite is added. 

10. The addition of the carbohydrate introduces a significant 
energy barrier to the flip-flop because a hydrophilic carbohydrate 
moiety would need to be moved through a hydrophobic environ- 
ment. This energetic barrier enhances membrane asymmetry. 

11. The Cy alkyl chain is attached by an ether linkage. The 
C-2 carbon atom of glycerol has only an acetyl group attached by 
an ester linkage instead of a fatty acid, as is the case with most 
phospholipids. 

12. Ina hydrophobic environment, the formation of intrachain 
hydrogen bonds stabilizes the amide hydrogen atoms and carbonyl 
oxygen atoms of the polypeptide chain, and so an a helix forms. In 
an aqueous environment, these groups are stabilized by interaction 
with water, and so there is no energetic reason to form an a helix. 
Thus, the a helix would be more likely to form in a hydrophobic 
environment. 

13. The protein may contain an a helix that passes through the 
hydrophobic core of the protein. This helix is likely to feature a 
stretch of hydrophobic amino acids similar to those observed in 
transmembrane helices. 

14. The shift to the lower temperature would decrease fluidity by 
enhancing the packing of the hydrophobic chains by van der Waals 
interactions. To prevent this packing, new phospholipids having 
shorter chains and a greater number of cis double bonds would be 
synthesized. The shorter chains would reduce the number of van 
der Waals interactions, and the cis double bonds, which cause the 
kink in structure, would prevent the packing of the fatty acid tails 
of the phospholipids. 

15. Each of the 21 v-SNARE proteins could interact with each 

of 7 t-SNARE partners. Multiplication gives the total number of 
different interacting pairs: 7 X 21 = 147 different v-SNARE-t- 
SNARE pairs. 

16. (a) The graph shows that, as temperature increases, the 
phospholipid bilayer becomes more fluid. T,, is the tempera- 
ture of the transition from the predominantly less fluid state 

to the predominantly more fluid state. Cholesterol broadens 
the transition from the less-fluid to the more-fluid state. In 
essence, cholesterol makes membrane fluidity less sensitive to 
temperature changes. 

(b) This effect is important because the presence of cholesterol 
tends to stabilize membrane fluidity by preventing sharp transi- 
tions. Because protein function depends on the proper fluidity of 
the membrane, cholesterol maintains the proper environment for 
membrane-protein function. 

17. The protein plotted in part c is a transmembrane protein from 
C. elegans. It spans the membrane with four a helices that are 
prominently displayed as hydrophobic peaks in the hydropathy 
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plot. Interestingly, the protein plotted in part a also is a membrane 
protein, a porin. This protein is made primarily of B strands, which 
lack the prominent hydrophobic window of membrane helices. 
This example shows that, although hydropathy plots are useful, 
they are not infallible. 

18. To purify any protein, the protein must first be solubilized. For 
a membrane protein, solubilization usually requires a detergent— 
hydrophobic molecules that bind to the protein and thus replace the 
lipid environment of the membrane. If the detergent is removed, the 
protein aggregates and precipitates from solution. Often, the steps 
in purification, such as ion-exchange chromatography, are difficult 
to perform in the presence of sufficient detergent to solubilize the 
protein. Crystals of appropriate protein—detergent complexes must 
be generated. 


Chapter 13 

1. In simple diffusion, the substance in question can diffuse down 
its concentration gradient through the membrane. In facilitated 
diffusion, the substance is not lipophilic and cannot directly dif- 
fuse through the membrane. A channel or carrier is required to 
facilitate movement down the gradient. 

2. The two forms are (1) ATP hydrolysis and (2) the movement 
of one molecule down its concentration gradient coupled with the 
movement of another molecule up its concentration gradient. 

3. The three types of carriers are symporters, antiporters, and uni- 
porters. Symporters and antiporters can mediate secondary active 
transport. 

4. The free-energy cost is 32 kJ mol! (7.6 kcal mol !). The 
chemical work performed is 20.4 kJ mol! (4.9 kcal mol), and 
the electrical work performed is 11.5 kJ mol! (2.8 kcal mol™!). 

5. For chloride, z = —1; for calcium z = +2. At the concentra- 
tions given, the equilibrium potential for chloride is -97 mV and 
the equlibrium potential for calcium is +122 mV. 

6. The concentration of glucose inside the cell is 66 times as great 
as that outside the cell [(c2/c,) = 66] when the free-energy input is 
10.8 kJ mol! (2.6 kcal mol’). 

7. By analogy with the Ca** ATPase, with three Na* ions binding 
from inside the cell to the E; conformation and with two K* ions 
binding from outside the cell to the Ez conformation, a plausible 
mechanism is as follows: 


(i) A catalytic cycle could begin with the enzyme in its unphospho- 
rylated state (E,) with three sodium ions bound. 

(ii) The E, conformation binds ATP. A conformational change 
traps sodium ions inside the enzyme. 

(iii) The phosphory] group is transferred from ATP to an aspartyl 
residue. 

(iv) On ADP release, the enzyme changes its overall conformation, 
including the membrane domain. This new conformation (E2) 
releases the sodium ions to the side of the membrane opposite that 
at which they entered and binds two potassium ions from the side 
where sodium ions are released. 

(v) The phosphorylaspartate residue is hydrolyzed to release inor- 
ganic phosphate. With the release of phosphate, the interactions 
stabilizing E> are lost, and the enzyme everts to the E; conforma- 
tion. Potassium ions are released to the cytoplasmic side of the 
membrane. The binding of three sodium ions from the cytoplasmic 
side of the membrane completes the cycle. 


8. Establish a lactose gradient across vesicle membranes that 
contain properly oriented lactose permease. Initially, the pH 
should be the same on both sides of the membrane and the lac- 
tose concentration should be higher on the “exit” side of lactose 
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permease. As the lactose flows “in reverse” through the permease, 
down its concentration gradient, it can be tested whether or not 

a pH gradient becomes established as the lactose gradient is 
dissipated. 

9. Ligand-gated channels open in response to the binding of a 
molecule by the channel, whereas voltage-gated channels open in 
response to changes in the membrane potential. 

10. An ion channel must transport ions in either direction at the 
same rate. The net flow of ions is determined only by the composi- 
tion of the solutions on either side of the membrane. 

11. Uniporters act as enzymes do; their transport cycles include 
large conformational changes, and only a few molecules interact 
with the protein per transport cycle. In contrast, channels, after 
having opened, provide a pore in the membrane through which 
many ions may pass. As such, channels mediate transport at a 
much higher rate than do uniporters. 

12. FCCP effectively creates a pore in the bacterial membrane 
through which protons can pass rapidly. Protons that are pumped 
out of the bacteria will pass through this pore preferentially (the 
“path of least resistance”), rather than participate in H* /lactose 
symport. 

13. Cardiac muscle must contract in a highly coordinated man- 
ner in order to pump blood effectively. Gap junctions medi- 

ate the orderly cell-to-cell propagation of the action potential 
through the heart during each beat. 

14. The positively charged guanidinium group resembles 

Na’* and binds to negatively charged carboxylate groups in the 
mouth of the channel. 

15. SERCA, a P-type ATPase, uses a mechanism by which a 
covalent phosphorylated intermediate (at an aspartate residue) 

is formed. At steady state, a subset of the SERCA molecules 

are trapped in the Ey-P state and, asa result, radiolabeled. 

The MDR protein is an ABC transporter and does not operate 
through a phosphorylated intermediate. Hence, a radiolabeled 
band would not be observed for MDR. 

16. The blockage of ion channels inhibits action potentials, 
leading to loss of nervous function. Like tetrodotoxin, these 
toxin molecules are useful for isolating and specifically inhibit- 
ing particular ion channels. 

17. After repolarization, the ball domains of the ion channels engage 
the channel pore, rendering them inactive for a short period of time. 
During this time, the channels cannot be reopened until the ball 
domains disengage and the channel returns to the “closed” state. 
18. Because sodium ions are charged and because sodium chan- 
nels carry only sodium ions (but not anions), the accumulation of 
excess positive charge on one side of the membrane dominates the 
chemical gradients. 

19. A mutation that impairs the ability of the sodium channel to 
inactivate would prolong the duration of the depolarizing sodium 
current, thus lengthening the cardiac action potential. 

20. No. Channels will likely open or close in response to an exter- 
nal stimulus, but the unit conductance of the open channel will be 
influenced very little. - 

21. The ratio of closed to open forms of the channel is 10°, 5000, 
250, 12.5, and 0.625 when zero, one, two, three, and four ligands, 
respectively, are bound. Hence, the fraction of open channels is 1.0 
x 107°, 2.0 x 107%, 4.0 x 1073, 7.4 x 1077, and 0.62. 

22. These organic phosphates inhibit acetylcholinesterase by 
reacting with the active-site serine residue to form a stable phos- 
phorylated derivative. They cause respiratory paralysis by blocking 
synaptic transmission at cholinergic synapses. 

23. (a) The binding of the first acetylcholine molecule increases 
the open-to-closed ratio by a factor of 240, and the binding of 


the second increases it by a factor of 11,700. (b) The free-energy 
contributions are 14 kJ mol~! (3.3 kcal mol” ') and 23 kJ mol”! 
(5.6 kcal mol~'), respectively. (c) No; the MWC model predicts 
that the binding of each ligand will have the same effect on the 
open-to-closed ratio. 

24. Batrachotoxin blocks the transition from the open to the 
closed state. 

25. (a) Chloride ions flow into the cell. (b) Chloride flux is inhibitory 
because it hyperpolarizes the membrane. (c) The channel consists of 
five subunits. 

26. After the addition of ATP and calcium, SERCA will pump 
Ca?” ions into the vesicle. However, the accumulation of Ca’* ions 
inside the vesicle will rapidly lead to the formation of an electrical 
gradient that cannot be overcome by ATP hydrolysis. The addition 
of calcimycin will allow the pumped Ca’* ions to flow back out of 
the vesicle, dissipating the charge buildup, and enabling the pump 
to operate continuously. 

27. The catalytic prowess of acetylcholinesterase ensures that the 
duration of the nerve stimulus will be short. 

28. See reaction below. 
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29. (a) Only ASIC 1a is inhibited by the toxin. (b) Yes; when the 
toxin was removed, the activity of the acid-sensing channel began 
to be restored. (c) 0.9 nM. 

30. This mutation is one of a class of mutations that result in 
slow-channel syndrome (SCS). The results suggest a defect in 
channel closing; so the channel remains open for prolonged periods. 
Alternatively, the channel may have a higher affinity for acetyl- 
choline than does the control channel. 

31. The mutation reduces the affinity of acetylcholine for the 
receptor. The recordings would show the channel opening only 
infrequently. 

32. Glucose displays a transport curve that suggests the participa- 
tion of a carrier because the initial rate is high but then levels off 


at higher concentrations, consistent with saturation of the carrier, 
which is reminiscent of Michaelis-Menten enzymes (Section 8.4). 
Indole shows no such saturation phenomenon, which implies 
that the molecule is lipophilic and simply diffuses across the 
membrane. Ouabain is a specific inhibitor the Na* —K* pump. 
If ouabain were to inhibit glucose transport, then a Na‘ -glucose 
cotransporter would be assisting in transport. 


Chapter 14 

1. The negatively charged glutamate residues mimic the negatively 
charged phosphoserine or phosphothreonine residues and stabilize 
the active conformation of the enzyme. 

2. No. Phosphoserine and phosphothreonine are considerably 
shorter than phosphotyrosine. 

3. The GTPase activity terminates the signal. Without such activ- 
ity, after a pathway has been activated, it remains activated and is 
unresponsive to changes in the initial signal. If the GTPase activity 
were more efficient, the lifetime of the GTP-bound G, subunit 
would be too short to achieve downstream signaling. 

4. Two identical receptor molecules must recognize different 
aspects of the same signal molecule. 

5. Growth-factor receptors can be activated by dimerization. If 
an antibody causes a receptor to dimerize, the signal-transduction 
pathway in a cell will be activated. 

6. The mutated a subunit will always be in the GTP form and, 
hence, in the active form, which would stimulate its signaling 
pathway. 

7. AG protein is a component of the signal-transduction pathway. 
GTPYS is not hydrolyzed by the G, subunit, leading to prolonged 
activation. 

8. Calcium ions diffuse slowly because they bind to many protein 
surfaces within a cell, impeding their free motion. Cyclic AMP 
does not bind as frequently, and so it diffuses more rapidly. 

9. Fura-2 is a highly negatively charged molecule, with five car- 
boxylate groups. Its charge prevents it from effectively crossing the 
hydrophobic region of the plasma membrane. 

10. Gas stimulates adenylate cyclase, leading to the generation 
of cAMP. This signal then leads to glucose mobilization (see 
Chapter 21). If cAMP phosphodiesterase were inhibited, then 
cAMP levels would remain high even after the termination of the 
epinephrine signal, and glucose mobilization would continue. 

11. Ifthe two kinase domains are forced to be within close prox- 
imity of each other, the activation loop of one kinase, in its inacti- 
vating conformation, can be displaced by the activation loop of the 
neighboring kinase, which acts as a substrate for phosphorylation. 
12. The full network of pathways initiated by insulin includes a 
large number of proteins and is substantially more elaborate than 
indicated in Figure 14.25. Furthermore, many additional proteins 
take part in the termination of insulin signaling. A defect in any 

of the proteins in the insulin signaling pathways or in the subse- 
quent termination of the insulin response could potentially cause 
problems. Therefore, it is not surprising that many different gene 
defects can cause type 2 diabetes. 

13. The binding of growth hormone causes its monomeric recep- 
tor to dimerize. The dimeric receptor can then activate a separate 
tyrosine kinase to which the receptor binds. The signaling path- 
way can then continue in similar fashion to the pathways that 

are activated by the insulin receptor or other mammalian EGF 
receptors. 

14. The truncated receptor will dimerize with the full-length 
monomers on EGF-binding, but cross-phosphorylation cannot take 
place, because the truncated receptor possesses neither the substrate 
for the neighboring kinase domain nor its own kinase domain to 
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phosphorylate the C-terminal tail of the other monomer. Hence, 
these mutant receptors will block normal EGF signaling. 

15. Insulin would elicit the response that is normally caused 

by EGF. Insulin binding will likely stimulate dimerization and 
phosphorylation of the chimeric receptor and thereby signal the 
downstream events that are normally triggered by EGF binding. 
Exposure of these cells to EGF would have no effect. 

16. 10° 

17. The formation of diacylglycerol implies the participation of 
phospholipase C. A simple pathway would entail receptor activa- 
tion by cross-phosphorylation, followed by the binding of phos- 
pholipase Cy (through its SH2 domains). The participation of 
phospholipase C indicates that IP3 would be formed and, hence, 
calcium concentrations would increase. 

18. Other potential drug targets within the EGF signaling cascade 
include, but are not limited to, the kinase active sites of the EGF 
receptor, Raf, MEK, or ERK. 

19. In the reaction catalyzed by adenylate cyclase, the 3’-OH group 
nucleophilically attacks the a-phosphorus atom attached to the 
5’-OH group, leading to displacement of pyrophosphate. The 
reaction catalyzed by DNA polymerase is similar except that the 
3’-OH group is on a different nucleotide. 

20. ATP-competitive inhibitors are likely to act on multiple kinases 
because every kinase domain contains an ATP-binding site. Hence, 
these drugs may not be selective for the desired kinase target. 

21. (a)X +10 ’M;Y ~5 X 10°°M; Z ~ 10° ° M. (b) Because 
much less X is required to fill half of the sites, X displays the highest 
affinity. (c) The binding affinity almost perfectly matches the ability 
to stimulate adenylate cyclase, suggesting that the hormone-receptor 
complex leads to the stimulation of adenylate cyclase. (d) Try per- 
forming the experiment in the presence of antibodies to Gas. 

22. (a) The total binding does not distinguish binding to a specific 
receptor from binding to different receptors or from nonspecific 
binding to the membrane. 

(b) The rationale is that the receptor will have a high affinity for 
the ligand. Thus, in the presence of excess nonradioactive ligand, 
the receptor will bind to nonradioactive ligand. Therefore, any 
binding of the radioactive ligand must be nonspecific. 

(c) The plateau suggests that the number of receptor-binding sites 
in the cell membrane is limited. 

23. Number of receptors per cell = 


10* cpm mg of membrane protein 
mg of membrane protein 10!° cells 
mmol _ 6.023 X 10”” molecules 
= 600 
10'? cpm mmol 


Chapter 15 

1. The highly integrated biochemical reactions that take place 
inside the cell. 

2. Anabolism is the set of biochemical reactions that use energy 
to build new molecules and ultimately new cells. Catabolism is the 
set of biochemical reactions that extract energy from fuel sources 
or breakdown biomolecules. 

3. Cellular movements and the performance of mechanical work; 
active transport; biosynthetic reactions. 

4. 1. £; 2. h; 3.1; 4. a; 5. 9; 6. b; 7. c; 8.e; 9.5; 10. d. 

5. Charge repulsion, resonance stabilization, and stabilization by 
hydration. 

6. Trick question. The answer is not known. Adenine appears to 
form more readily under prebiotic conditions; so ATP may have 
predominated initially. 
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7. Having only one nucleotide represent the available energy 
allows the cell to better monitor its energy status. 

8. Increasing the concentration of ATP or decreasing the concen- 
tration cellular ADP or P; (by rapid removal by other reactions, 
for instance) would make the reaction more exergonic. Likewise, 
altering the Mg”* concentration could raise or lower the AG of the 
reaction. 

9. The free-energy changes of the individual steps in a pathway 
are summed to determine the overall free-energy change of the 
entire pathway. Consequently, a reaction with a positive free- 
energy value can be powered to take place if coupled to a 
sufficiently exergonic reaction. 

10. Reactions in parts a and c, to the left; reactions in parts b and 
d, to the right. 

11. None whatsoever 

12. (a) AG’ = 31.4 kJ mol! (7.5 kcal mol” ') and K’, = 3.06 X 
107°; (b) 3.28 X 107. 

13. AG = 7.1 kJ mol”! (1.7 kcal mol™!). The equilibrium ratio 
is 17.8. 

14. (a) Acetate + CoA + H* goes to acetyl CoA + H,O, AG” = 
—31.4kJ mol! (-7.5 kcal mol?). ATP hydrolysis to AMP 

and PP;, AG’ = —45.6 kJ mol”! (—10.9 kcal mol™!). Overall 
reaction, AG?’ = —14.2 kJ mol”! (—3.4 kcal mol™'). 

(b) With pyrophosphate hydrolysis, AG” = —33.4 kJ mol”! 
(—7.98 kcal mol” '). Pyrophosphate hydrolysis makes the overall 
reaction even more exergonic. 


15. (a) For an acid AH, 


eet 

[AH] 
The pK is defined as pK = —logi9 K. AG®’ is the standard free- 
energy change at pH 7. Thus, AG°’ = —RT In K = —2.303 RT 
logjg K = +2.303 RT pK. 
(b) AG?’ = 27.32 kJ mol”! (6.53 kcal mol”). 
16. Arginine phosphate in invertebrate muscle, like creatine phos- 
phate in vertebrate muscle, serves as a reservoir of high-potential 
phosphoryl groups. Arginine phosphate maintains a high level of 
ATP in muscular exertion. 
17. An ADP unit 
18. (a) The rationale behind creatine supplementation is that it 
would be converted into creatine phosphate and thus serve as a 
rapid means of replenishing ATP after muscle contraction. (b) If 
creatine supplementation is beneficial, it would affect activities 
that depend on short bursts of activity; any sustained activity 
would require ATP generation by fuel metabolism, which, as 
Figure 15.7 shows, requires more time. 
19. Under standard conditions, AG°’ = —RT In [products]/ 
[reactants]. Substituting 23.8 kJ mol! (5.7 kcal mol!) for AG*’ 
and solving for [products]/[reactants] yields 7 X 107°. In other 
words, the forward reaction does not take place to a significant 
extent. Under intracellular conditions, AG is —1.3 kJ mol! 
(—0.3 kcal mol). Using the equation AG = AG®’ + RT In 
[products]/[reactants] and solving for [products]/[reactants] gives 
a ratio of 3.7 X 10°. Thus, a reaction that is endergonic under 
standard conditions can be converted into an exergonic reaction by 
maintaining the [products]/[reactants] ratio below the equilibrium 
value. This conversion is usually attained by using the products in 
another coupled reaction as soon as they are formed. 
20. Under standard conditions, 


[Bleg [ADP Jeg [Pile 
[Ale [ATP]eq 
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At equilibrium, the ratio of [B] to [A] is given by 


[Bleq [ATP Ieg 
[Aleq - [ADP eq [Pileg 


The ATP-generating system of cells maintains the [ATP]/[ADP] 
[P;] ratio at a high level, typically about 500 M |. For this ratio, 


[Bleq 
[A]eg 


This equilibrium ratio is strikingly different from the value of 
1.15 X 10-3 for the reaction A —> B in the absence of ATP 
hydrolysis. In other words, coupling the hydrolysis of ATP with 
the conversion of A into B has changed the equilibrium ratio of B 
to A by a factor of about 10°, 

21. Liver: —45.2 kJ mol! (—10.8 kcal mol‘); muscle: —48.1 kJ 
mol”! (—11.5 kcal mol~'); brain: —48.5 kJ mol™! (—11.6 kcal 
mol7!). The AG is most negative in brain cells. 

22. (a) Ethanol; (b) lactate; (c) succinate; (d) isocitrate; (e) malate; 
(f) 2-phosphoglycerate. 

23. Recall that AG = AG®’ + RT In [products]/[reactants]. 
Altering the ratio of products to reactants will cause AG to vary. In 
glycolysis, the concentrations of the components of the pathway 
result in a value of AG greater than that of AG°’. 

24. Unless the ingested food is converted into molecules capable 
of being absorbed by the intestine, no energy can ever be extracted 
by the body. 

25. NADH and FADH)? are electron carriers for catabolism; 
NADPH is the carrier for anabolism. 

26. The electrons of the C—O bond cannot form resonance 
structures with the C-S bond that are as stable as those that 
they can form with the C—O bond. Thus, the thioester is not 
stabilized by resonance to the same degree as an oxygen ester is 
stabilized. 

27. Oxidation—reduction reactions; ligation reactions; 
isomerization reactions; group-transfer reactions; hydrolytic 
reactions; the addition of functional groups to double bonds to 
form single bonds or the removal of functional groups to form 
double bonds. 

28. Controlling the amount of enzymes; controlling enzyme 
activity; controlling the availability of substrates. 

29. Although the reaction is thermodynamically favorable, the 
reactants are kinetically stable because of the large activation 
energy. Enzymes lower the activation energy so that reactions take 
place on time scales required by the cell. 

30. The activated form of sulfate in most organisms is 
3'-phosphoadenosine-5’-phosphosulfate. 

31. (a) As the Me?* concentration falls, the AG of hydrolysis 
rises. Note that pMg is a logarithmic plot, and so each number on 
the x-axis represents a 10-fold change in [Mg?*]. 

(b) Mg?* would bind to the phosphates of ATP and help to miti- 
gate charge repulsion. As the [Me?*] falls, charge stabilization 

of ATP would be less, leading to greater charge repulsion and an 
increase in AG on hydrolysis. 


= 2.67 X 10? X 500 = 1.34 x 10° 


Chapter 16 

1. Two molecules of ATP are produced per molecule of glycer- 
aldehyde 3-phosphate and, because two molecules of GAP are 
produced per molecule of glucose, the total ATP yield is four. 
However, two molecules of ATP are required to convert glucose 
into fructose 1,6-bisphosphate. Thus, the net yield is only two 
molecules of ATP. 


2. In both cases, the electron donor is glyceraldehyde 3-phosphate. 

In lactic acid fermentation, the electron acceptor is pyruvate, 

converting it into lactate. In alcoholic fermentation, acetaldehyde 

is the electron acceptor, forming ethanol. 

3. (a) 3 ATP; (b) 2 ATP; (c) 2 ATP; (d) 2 ATP; (e) 4 ATP. 

4, Glucokinase enables the liver to remove glucose from the blood 

when hexokinase is saturated, ensuring that glucose is captured for 

later use. 

5. Glycolysis is a component of alcoholic fermentation, the path- 

way that produces alcohol for beer and wine. The belief was that 

understanding the biochemical basis of alcohol production might 

lead to a more-efficient means of producing beer. 

6. The conversion of glyceraldehyde 3-phosphate into 1,3- 

bisphosphoglycerate would be impaired. Glycolysis would be less 

effective. 

7. Glucose 6-phosphate must have other fates. Indeed, it can be 

converted into glycogen (Chapter 21) or be processed to yield 

reducing power for biosynthesis (Chapter 20). 

8. The energy needs of a muscle cell vary widely, from rest to 

intense exercise. Consequently, the regulation of phosphofructo- 

kinase by energy charge is vital. In other tissues, such as the liver, 

ATP concentration is less likely to fluctuate and will not be a key 

regulator of phosphofructokinase. 

9. The AG® for the reverse of glycolysis is +96 kJ mol! 

(+23 kcal mol ~'), far too endergonic to take place. 

10. The conversion of glucose into glucose 6-phosphate by 

hexokinase; the conversion of fructose 6-phosphate into fructose 

1,6-bisphosphate by phosphofructokinase; the formation of pyru- 

vate from phosphoenolpyruvate by pyruvate kinase. 

11. Lactic acid is a strong acid. If it remained in the cell, the pH of 

the cell would fall, which could lead to the denaturation of muscle 

protein and result in muscle damage. 

12. GLUT2 transports glucose only when the blood concentration 

of glucose is high, which is precisely the condition in which the B 

cells of the pancreas secrete insulin. 

13. Fructose + ATP ——-> fructose 1-phosphate + ADP: 
Fructokinase 


Fructose 1-phosphate ———> dihydroxyacetone phosphate + 
glyceraldehyde: Fructose 1-phosphate aldolase 


Glyceraldehyde + ATP ——> glyceraldehyde 3-phosphate + 
ADP: Triose kinase 


The primary controlling step of glycolysis catalyzed by phospho- 
fructokinase is bypassed by the preceding reactions. Glycolysis 
will proceed in an unregulated fashion. 

14. Without triose isomerase, only one of the two three-carbon 
molecules generated by aldolase could be used to generate ATP. 
Only two molecules of ATP would result from the metabolism of 
each glucose. But two molecules of ATP would still be required to 
form fructose 1,6-bisphosphate, the substrate for aldolase. The net 
yield of ATP would be zero, a yield incompatible with life. 

15. Glucose is reactive because its open-chain form contains an 
aldehyde group. 

16. (a) The label is in the methyl carbon atom of pyruvate. 

(b) 5 mCimM~!. The specific activity is halved because the 
number of moles of product (pyruvate) is twice that of the labeled 
substrate (glucose). 

17. (a) Glucose + 2 P; + 2 ADP —> 2 lactate + 2 ATP. 

(b) AG = —114 kJ mol7! (—27.2 kcal mol7!). 

18. 3.06 x 10° 

19. The equilibrium concentrations of fructose 1,6-bisphosphate, 
dihydroxyacetone phosphate, and glyceraldehyde 3-phosphate 
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are 7.8 X 104M, 2.2 x 10°*M, and 2.2 x 10°-* M, 
respectively. 

20. All three carbon atoms of 2,3-BPG are 140 Jabeled. The 
phosphorus atom attached to the C-2 hydroxyl group is °”P 
labeled. 

21. Hexokinase has a low ATPase activity in the absence of a 
sugar because it is in a catalytically inactive conformation. The 
addition of xylose closes the cleft between the two lobes of the 
enzyme. However, xylose lacks a hydroxymethyl group, and so 

it cannot be phosphorylated. Instead, a water molecule at the site 
normally occupied by the C-6 hydroxymethyl group acts as the 
acceptor of the phosphoryl group from ATP. 

22. (a) The fructose 1-phosphate pathway forms glyceraldehyde 
3-phosphate. 

(b) Phosphofructokinase, a key control enzyme, is bypassed. 
Furthermore, fructose 1-phosphate stimulates pyruvate kinase. 
23. The reverse of glycolysis is highly endergonic under cellular 
conditions. The expenditure of six NTP molecules in gluconeo- 
genesis renders gluconeogenesis exergonic. 

24. Lactic acid is capable of being further oxidized and is thus 
useful energy. The conversion of this acid into glucose saves the 
carbon atoms for future combustion. 

25. In glycolysis, the formation of pyruvate and ATP by pyru- 
vate kinase is irreversible. This step is bypassed by two reac- 
tions in gluconeogenesis: (1) the formation of oxaloacetate from 
pyruvate and CO) by pyruvate carboxylase and (2) the forma- 
tion of phosphoenolpyruvate from oxaloacetate and GTP by 
phosphoenolpyruvate carboxykinase. The formation of fructose 
1,6-bisphosphate by phosphofructokinase is bypassed by fruc- 
tose 1,6-bisphosphatase in gluconeogenesis, which catalyzes the 
conversion of fructose 1,6-bisphosphate into fructose 6-phosphate. 
Finally, the hexokinase-catalyzed formation of glucose 
6-phosphate in glycolysis is bypassed by glucose 6-phosphatase, 
but only in the liver. 

26. Reciprocal regulation at the key allosteric enzymes in the 

two pathways. For instance, PFK is stimulated by fructose 2,6- 
bisphosphate and AMP. The effect of these signals is opposite that 
of fructose 1,6-bisphosphatase. If both pathways were operating 
simultaneously, a futile cycle would result. ATP would be hydro- 
lyzed, yielding only heat. 

27. Muscle is likely to produce lactic acid during contraction. 
Lactic acid is a strong acid and cannot accumulate in muscle or 
blood. Liver removes the lactic acid from the blood and converts it 
into glucose. The glucose can be released into the blood or stored 
as glycogen for later use. 

28. Glucose produced by the liver could not be released into the 
blood. Tissues that rely on glucose as an energy source would not 
function as well unless glucose was provided in the diet. 

29. Glucose is an important energy source for both tissues and 

is essentially the only energy source for the brain. Consequently, 
these tissues should never release glucose. Glucose release is pre- 
vented by the absence of glucose 6-phosphatase. 

30. 6 NTP (4 ATP and 2 GTP); 2 NADH. 

31. (a) None; (b) none; (c) 4 (2 ATP and 2 GTP); (d) none. 

32. Ifthe amino groups are removed from alanine and aspartate, 
the ketoacids pyruvate and oxaloacetate are formed. Both of these 
molecules are components of the gluconeogenic pathway. 

33. (a) Increased; (b) increased; (c) increased; (d) decreased. 

34. Fructose 2,6-bisphosphate, present at high concentration 
when glucose is abundant, normally inhibits gluconeogenesis by 
blocking fructose 1,6-bisphosphatase. In this genetic disorder, the 
phosphatase is active irrespective of the glucose level. Hence, sub- 
strate cycling is increased. The level of fructose 1,6-bisphosphate is 
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consequently lower than normal. Less pyruvate is formed and thus 
less ATP is generated. 

35. Reactions in parts b and e would be blocked. 

36. There will be no labeled carbons. The CO, added to pyruvate 
(formed from the lactate) to form oxaloacetate is lost with the con- 
version of oxaloacetate into phosphoenolpyruvate. 

37. The net reaction in the presence of arsenate is 


Glyceraldehyde 3-phosphate + NAD* + H,O > 
3-phosphoglycerate + NADH + 2 H* 


Glycolysis proceeds in the presence of arsenate, but the ATP 
normally formed in the conversion of 1,3-bisphosphoglycer- 

ate into 3-phosphoglycerate is lost. Thus, arsenate uncouples 
oxidation and phosphorylation by forming a highly labile acyl 
arsenate. 

38. This example illustrates the difference between the stoichio- 
metric and the catalytic use of a molecule. If cells used NAD* 
stoichiometrically, a new molecule of NAD* would be required 
each time a molecule of lactate was produced. As we will see, 
the synthesis of NAD* requires ATP. On the other hand, if 

the NAD* that is converted into NADH could be recycled and 
reused, a small amount of the molecule could regenerate a vast 
amount of lactate, which is the case in the cell. NAD* is regener- 
ated by the oxidation of NADH and reused. NAD* is thus used 
catalytically. 

39. Consider the equilibrium equation of adenylate kinase: 


Keg = [ATP] [AMP]/[ADP]? (1) 


or 
AMP = Keg [ADP]/[ATP] (2) 
Recall that [ATP] > [ADP] > [AMP] in the cell. As ATP is 


utilized, a small decrease in its concentration will result in a 
larger percentage increase in [ADP] because its concentration 
is greater than that of ADP. This larger percentage increase 

in [ADP] will result in an even greater percentage increase in 
[AMP] because the concentration of AMP is related to the 
square of [ADP]. In essence, equation 2 shows that monitoring 
the energy status with AMP magnifies small changes in [ATP], 
leading to tighter control. 

40. The synthesis of glucose during intense exercise provides 

a good example of interorgan cooperation in higher organisms. 
When muscle is actively contracting, lactate is produced from 
glucose by glycolysis. The lactate is released into the blood and 
absorbed by the liver, where it is converted by gluconeogenesis 
into glucose. The newly synthesized glucose is then released and 
taken up by the muscle for energy generation. 

41. The input of four additional high-phosphoryl-transfer-potential 
molecules in gluconeogenesis changes the equilibrium constant 

by a factor of 10°2, which makes the conversion of pyruvate into 
glucose thermodynamically feasible. Without this energetic input, 
gluconeogenesis would not take place. 

42. The mechanism is analogous to that for triose phosphate 
isomerase (Figure 16.5). It proceeds through an enediol interme- 
diate. The active site would be expected to have a general base 
(analogous to Glu 165 in TPI) and a general acid (analogous to 
His 95 in TPI). 

43. Galactose is a component of glycoproteins. Possibly, the 
absence of galactose leads to the improper formation or function of 
glycoproteins required in the central nervous system. More gener- 
ally, the fact that the symptoms arise in the absence of galactose 
suggests that galactose is required in some fashion. 

44. Fructose 2,6-bisphosphate stabilizes the R state of the enzyme. 


45. (a) Curiously, the enzyme uses ADP as the phosphoryl donor 
rather than ATP. 

(b) Both AMP and ATP behave as competitive inhibitors of ADP, 
the phosphoryl donor. Apparently, the P. furiosus enzyme is not 
allosterically inhibited by ATP. 

46. (a) If both enzymes operated simultaneously, the following 
reactions would take place: 


Fructose 
me 6-phosphate Pi 
: Fructose 
Phosphofructokinase 1,6-bisphosphatase 
ADP Fructose H,0 


1,6-bisphosphate 


The net result would be simply: 
ATP + Hx0 ——> ADP + P; 


The energy of ATP hydrolysis would be released as heat. 

(b) Not really. For the cycle to generate heat, both enzymes must 
be functional at the same time in the same cell. 

(c) The species B. terrestris and B. rufocinctus might show some 
futile cycling because both enzymes are active to a substantial 
degree. 

(d) No. These results simply suggest that simultaneous activity of 
phosphofructokinase and fructose 1,6-bisphosphatase is unlikely 
to be employed to generate heat in the species shown. 


Chapter 17 
1. Pyruvate dehydrogenase catalyzes the decarboxylation of 
pyruvate and the formation of acetyllipoamide. Dihydrolipoyl 
transacetylase catalyzes the formation of acetyl CoA. Dihydrolipoyl 
dehydrogenase catalyzes the reduction of the oxidized lipoic acid. 
The kinase associated with the complex phosphorylates and inacti- 
vates the complex, whereas the phosphatase dephosphorylates and 
activates the complex. 
2. Thiamine pyrophosphate plays a role in the decarboxylation 
of pyruvate. Lipoic acid (as lipoamide) transfers the acetyl group. 
Coenzyme A accepts the acetyl group from lipoic acid to form 
acetyl CoA. FAD accepts the electrons and hydrogen ions when 
reduced lipoic acid is oxidized. NAD* accepts electrons from 
FADH). 
3. Catalytic coenzymes (TPP, lipoic acid, and FAD) are 
modified but regenerated in each reaction cycle. Thus, they can 
play a role in the processing of many molecules of pyruvate. 
Stoichiometric coenzymes (coenzyme A and NAD*) are used in 
only one reaction because they are the components of products of 
the reaction. 
4. The advantages are as follows: 
The reaction is facilitated by having the active sites in proximity. 
The reactants do not leave the enzyme until the final product 
is formed. 
Constraining the reactants minimizes loss due to diffusion 
and minimizes side reactions. 
All of the enzymes are present in the correct amounts. 
Regulation is more efficient because the regulatory 
enzymes—the kinase and phosphatase—are part of the 
complex. 
5. (a) After one round of the citric acid cycle, the label emerges in 
C-2 and C-3 of oxaloacetate. (b) The label emerges in CO, in the 
formation of acetyl CoA from pyruvate. (c) After one round of the 
citric acid cycle, the label emerges in C-1 and C-4 of oxaloacetate. 
(d) and (e) Same fate as that in part a. 


6. (a) Isocitrate lyase and malate synthase are required in addition 
to the enzymes of the citric acid cycle. 

(b) 2 Acetyl CoA + 2 NAD* + FAD + 3 H,O —> oxaloacetate + 
2 CoA + 2NADH + FADH) + 3H". 

(c) No. Hence, mammals cannot carry out the net synthesis of 
oxaloacetate from acetyl CoA. 

7. —41.0 kJ mol! (—9.8 kcal mol™!) 

8. Enzymes or enzyme complexes are biological catalysts. Recall 
that a catalyst facilitates a chemical reaction without the catalyst 
itself being permanently altered. Oxaloacetate can be thought of as 
a catalyst because it binds to an acetyl group, leads to the oxidative 
decarboxylation of the two carbon atoms, and is regenerated at the 
completion of a cycle. In essence, oxaloacetate (and any cycle inter- 
mediate) acts as a catalyst. 

9. Thiamine thiazolone pyrophosphate is a transition-state analog. 
The sulfur-containing ring of this analog is uncharged, and so it 
closely resembles the transition state of the normal coenzyme in 
thiamine-catalyzed reactions (e.g., the uncharged resonance form 
of hydroxyethyl-TPP). 

10. A decrease in the amount of Op will necessitate an increase in 
anaerobic glycolysis for energy production, leading to the genera- 
tion of a large amount of lactic acid. Under conditions of shock, the 
kinase inhibitor is administered to ensure that pyruvate dehydro- 
genase is operating maximally. 

11. Acetyllipoamide and acetyl CoA 

12. In muscle, the acetyl CoA generated by the complex is used 
for energy generation. Consequently, signals that indicate an 
energy-rich state (high ratios of ATP/ ADP and NADH/NAD*) 
inhibit the complex, whereas the reverse conditions stimulate 

the enzyme. Calcium as the signal for muscle contraction (and, 
hence, energy need) also stimulates the enzyme. In liver, acetyl 
CoA derived from pyruvate is used for biosynthetic purposes, 
such as fatty acid synthesis. Insulin, the hormone denoting the 
fed state, stimulates the complex. 

13. (a) Enhanced kinase activity will result in a decrease in the 
activity of the PDH complex because phosphorylation by the 
kinase inhibits the complex. 

(b) Phosphatase activates the complex by removing a phosphate. 
If the phosphatase activity is diminished, the activity of the PDH 
complex also will decrease. 

14. She might have been ingesting, in some fashion, the arsenite 
from the peeling paint or the wallpaper. Also, she might have 
been breathing arsine gas from the wallpaper, which would be 
oxidized to arsenite in her body. In any of these circumstances, the 
arsenite inhibited enzymes that require lipoic acid—notably, the 
PDH complex. 

15. The TCA cycle depends on a steady supply of NAD*, which 
is typically generated from NADH by reaction of the NADH with 
oxygen. If there is no oxygen to accept the electrons, the citric acid 
cycle will cease to operate. 

16. (a) The steady-state concentrations of the products are low 
compared with those of the substrates. (b) The ratio of malate to 
oxaloacetate must be greater than 1.57 X 10* for oxaloacetate to 
be formed. 


17. 
Pyruvate 
dehydrogenase 
Pyruvate + CoA + N ADt complex 


acetyl CoA + CO, + NADH 


Pyruvate 


Pyruvate + CO, + ATP + H,O0 carboxylase , 
oxaloacetate + ADP+ P; + H* 
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Citrate 
Oxaloacetate + acetyl CoA+ HzO synthase, citrate + CoA + Ht 


Citrate _Aconitase_, isocitrate 


Isocitrate 
Isocitrate + NAD* —dehydrogenase_, 
a-ketoglutarate + CO, + NADH 
Net: 2 Pyruvate + 2 NAD* + ATP + H,O > 


a-ketoglutarate + CO) + ADP + P; +2 NADH +3 H* 


18. Succinate will increase in concentration, followed by 
a-ketoglutarate and the other intermediates “upstream” of the site 
of inhibition. Succinate has two methylene groups that are required 
for the dehydrogenation, whereas malonate has but one. 

19. Pyruvate carboxylase should be active only when the acetyl 
CoA concentration is high. Acetyl CoA might accumulate if the 
energy needs of the cell are not being met, because of a deficiency 
of oxaloacetate. Under these conditions the pyruvate carboxylase 
catalyzes an anapleurotic reaction. Alternatively, acetyl CoA might 
accumulate because the energy needs of the cell have been met. In 
this circumstance, pyruvate will be converted back into glucose, and 
the first step in this conversion is the formation of oxaloacetate. 

20. The energy released when succinate is reduced to fumarate is 
not sufficient to power the synthesis of NADH but is sufficient to 
reduce FAD. 

21. Citrate is a tertiary alcohol that cannot be oxidized, because 
oxidation requires a hydrogen atom to be removed from the alco- 
hol and a hydrogen atom to be removed from the carbon atom 
bonded to the alcohol. No such hydrogen exists in citrate. The 
isomerization converts the tertiary alcohol into isocitrate, which is 
a secondary alcohol that can be oxidized. 

22. Because the enzyme nucleoside diphosphokinase transfers a 
phosphoryl group from GTP (or any nucleoside triphosphate) to 
ADP according to the reversible reaction: 


GTP + ADP == GDP + ATP 


23. The reaction is powered by the hydrolysis of a thioester. 
Acetyl CoA provides the thioester that is converted into citryl 
CoA. When this thioester is hydrolyzed, citrate is formed in an 
irreversible reaction. 

24. We cannot get the net conversion of fats into glucose, because 
the only means to get the carbon atoms from fats into oxaloacetate, 
the precursor of glucose, is through the citric acid cycle. However, 
although two carbon atoms enter the cycle as acetyl CoA, two 
carbon atoms are lost as CO? before oxaloacetate is formed. Thus, 
although some carbon atoms from fats may end up as carbon atoms 
in glucose, we cannot obtain a net synthesis of glucose from fats. 

25. Acetyl CoA will inhibit the complex. Glucose metabolism to 
pyruvate will be slowed because acetyl CoA is being derived from 
an alternative source. 

26. The enol intermediate of acetyl CoA attacks the carbonyl 
carbon atom of glyoxylate to form a C—C bond. This reaction is 
like the condensation of oxaloacetate with the enol intermediate of 
acetyl CoA in the reaction catalyzed by citrate synthase. Glyoxylate 
contains a hydrogen atom in place of the -CHzCOO © group of 
oxaloacetate; the reactions are otherwise nearly identical. 

27. Citrate is a symmetric molecule. Consequently, the inves- 
tigators assumed that the two -CHzCOO groups in it would 
react identically. Thus, for every citrate molecule undergoing 
the reactions shown in path 1, they thought that another citrate 
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molecule would react as shown in path 2. If so, then only half the 
label should have emerged in the COp. 


CH; 
Oo COO 
00c S-CoA HS-CoA OH 
2 "COO™ 
> ————-> 
COO™ COO 
CcoOo7 CcoOo- 
co, 
COO 
i ae eee ane 
HO' 
COO 
Path 1 
CH; 
oO 
~OOC 


S-CoA HS-CoA 


Ccoo™ COoOo™ 


COO™ 
Path 2 (does not occur) 


28. Call one hydrogen atom A and the other B. Now suppose that 
an enzyme binds three groups of this substrate—X, Y, and H—at 
three complementary sites. The adjoining diagram shows X, Y, and 
Ha bound to three points on the enzyme. In contrast, X, Y, and Hg 
cannot be bound to this active site; two of these three groups can be 
bound, but not all three. Thus, Ha and Hg will have different fates. 


Sterically nonequivalent groups such as Ha and Hg will almost 
always be distinguished in enzymatic reactions. The essence of the 
differentiation of these groups is that the enzyme holds the substrate 
in a specific orientation. Attachment at three points, as depicted in 
the diagram, is a readily visualized way of achieving a particular ori- 
entation of the substrate, but it is not the only means of doing so. 

29. (a) The complete oxidation of citrate requires 4.5 pmol of Oz 
for every micromole of citrate. 


CeHgO7 + 4.5 Oo 


>6 CO, + 4H,O 


Thus, 13.5 pmol of Oz would be consumed by 3 pmol of citrate. 
(b) Citrate led to the consumption of far more O2 than can be 
accounted for simply by the oxidation of citrate itself. Citrate thus 
facilitated O2 consumption. 


30. (a) In the absence of arsenite, the amount of citrate remained 
constant. In its presence, the concentration of citrate fell, suggest- 
ing that it was being metabolized. 

(b) The action of arsenite is not altered. Citrate still disappears. 

(c) Arsenite is preventing the regeneration of citrate. Recall (pp. 517— 
518) that arsenite inhibits the pyruvate dehydrogenase complex. 

31. (a) The initial infection is unaffected by the absence of isoci- 
trate lyase, but the absence of this enzyme inhibits the latent phase 
of the infection. 

(b) Yes 

(c) A critic could say that, in the process of deleting the isocitrate 
lyase gene, some other gene was damaged, and it is the absence of 
this other gene that prevents latent infection. Reinserting the isoci- 
trate lyase gene into the bacteria from which it had been removed 
renders the criticism less valid. 

(d) Isocitrate lyase enables the bacteria to synthesize carbohydrates 
that are necessary for survival, including carbohydrate components 
of the cell membrane. 


Chapter 18 

1. In fermentations, organic compounds are both the donors and 
the acceptors of electrons. In respiration, the electron donor is 
usually an organic compound, whereas the electron acceptor is an 
inorganic molecule, such as oxygen. 

2. Biochemists use Eo, the value at pH 7, whereas chemists use 
Eo, the value in 1 M H™. The prime denotes that pH 7 is the 
standard state. 

3. The reduction potential of FADH) is less than that of NADH 
(see Table 18.1). Consequently, when those electrons are passed 
along to oxygen, less energy is released. The consequence of the 
difference is that electron flow from FADH) to O2 pumps fewer 
protons than do the electrons from NADH. 

4. The AG®’ for the reduction of oxygen by FADH; is 

—200 kJ mol! (—48 kcal mol” ?). 

5. AG is + 67 kJ mol”! (+16.1 kcal mol!) for oxidation by 
NAD* and —3.8 kJ mol7! (—0.92 kcal mol™!) for oxidation by 
FAD. The oxidation of succinate by NAD* is not thermodynami- 
cally feasible. 

6. Pyruvate accepts electrons and is thus the oxidant. NADH 
gives up electrons and is the reductant. 

7. AG®' =—nFAE5 

8. The AE9 value of iron can be altered by changing the environment 
of the ion. 

9. c,e,b,a,d. 

10. (a) 4; (b) 3; (c) 1; (d) 5; (e) 2. 

11. The 10 isoprene units render coenzyme Q soluble in the hydro- 
phobic environment of the inner mitochondrial membrane. The two 
oxygen atoms can reversibly bind two electrons and two protons as 
the molecule transitions between the quinone form and quinol form. 
12. Rotenone: NADH, NADH-Q oxidoreductase will be reduced. 
The remainder will be oxidized. Antimycin A: NADH, NADH-Q 
oxidoreductase and coenzyme Q will be reduced. The remainder 
will be oxidized. Cyanide: All will be reduced. 

13. Complex I would be reduced, whereas Complexes II, II, and 
IV would be oxidized. The citric acid cycle would become reduced 
because it has no way to oxidize NADH. 

14. The respirasome is another example of the use of supra- 
molecular complexes in biochemistry. Having the three complexes 
that are proton pumps associated with one another will enhance 
the efficiency of electron flow from complex to complex, which in 
turn will cause more-efficient proton pumping. 

15. Hydroxyl radical (OH - ), hydrogen peroxide (H2Oz), super- 
oxide ion (O9~), and peroxide (O,”” ). These small molecules react 


with a host of macromolecules—including proteins, nucleotides, 
and membranes—to disrupt cell structure and function. 

16. The ATP is recycled by ATP-generating processes, most 
notably oxidative phosphorylation. 

17. (a) 12.5; (b) 14; (c) 32; (d) 13.5; (e) 30; (f) 16. 

18. (a) It blocks electron transport and proton pumping at Complex 
IV. (b) It blocks electron transport and ATP synthesis by inhibit- 
ing the exchange of ATP and ADP across the inner mitochondrial 
membrane. (c) It blocks electron transport and proton pumping at 
Complex I. (d) It blocks ATP synthesis without inhibiting electron 
transport by dissipating the proton gradient. (e) It blocks electron 
transport and proton pumping at Complex IV. (f) It blocks electron 
transport and proton pumping at Complex HI. 

19. Ifthe proton gradient is not dissipated by the influx of protons 
into a mitochondrion with the generation of ATP, eventually the 
outside of the mitochondrion develops such a large positive charge 
that the electron-transport chain can no longer pump protons 
against the gradient. 

20. The subunits are jostled by background thermal energy 
(Brownian motion). The proton gradient makes clockwise rotation 
more likely because that direction results in protons flowing down 
their concentration gradient. 

21. In the presence of poorly functioning mitochondria, the only 
means of generating ATP is by anaerobic glycolysis, which will 
lead to an accumulation of lactic acid in blood. 

22. If ADP cannot get into mitochondria, the electron-transport 
chain will cease to function because there will be no acceptor for 
the energy. NADH will build up in the matrix. Recall that NADH 
inhibits some citric acid cycle enzymes and that NAD* is required 
by several citric acid cycle enzymes. Glycolysis will stop function- 
ing aerobically but will switch to anaerobic glycolysis so that the 
NADH can be reoxidized to NAD* by lactate dehydrogenase. 

23. (a) No effect; mitochondria cannot metabolize glucose. 

(b) No effect; no fuel is present to power the synthesis of ATP. 

(c) The [Og] falls because citrate is a fuel and ATP can be formed 
from ADP and P;. 

d) Oxygen consumption stops because oligomycin inhibits ATP 
synthesis, which is coupled to the activity of the electron-transport 
chain. 

e) No effect, for the reasons given in part d. 

f) [Og] falls rapidly because the system is uncoupled and does not 
require ATP synthesis to lower the proton-motive force. 

g) [Oo] falls, though at a lower rate. Rotenone inhibits Complex I, 
but the presence of succinate will enable electrons to enter at 
Complex II. 

h) Oxygen consumption ceases because Complex IV is inhibited 
and the entire chain backs up. 

24. (a) The P: O ratio is equal to the product of (H*/2e ) and 
P/H"*). Note that the P : O ratio is identical with the P : 2” ratio. 
(b) 2.5 and 1.5, respectively. 

25. Cyanide can be lethal because it binds to the ferric form of 
cytochrome oxidase and thereby inhibits oxidative phosphoryla- 
tion. Nitrite converts ferrohemoglobin into ferrihemoglobin, 
which also binds cyanide. Thus, ferrihemoglobin competes with 
cytochrome oxidase for cyanide. This competition is therapeuti- 
cally effective because the amount of ferrihemoglobin that can be 
formed without impairing oxygen transport is much greater than 
the amount of cytochrome oxidase. 

26. Such a defect (called Luft syndrome) was found in a 38-year- 
old woman who was incapable of performing prolonged physical 
work. Her basal metabolic rate was more than twice normal, but 
her thyroid function was normal. A muscle biopsy showed that 
her mitochondria were highly variable and atypical in structure. 
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Biochemical studies then revealed that oxidation and phospho- 
rylation were not tightly coupled in these mitochondria. In this 
patient, much of the energy of fuel molecules was converted into 
heat rather than ATP. 

27. Triose phosphate isomerase converts dihydroxyacetone phos- 
phate (a potential dead end) into glyceraldehyde 3-phosphate 
(a mainstream glycolytic intermediate). 

28. This inhibitor (like antimycin A) blocks the reduction of 
cytochrome c; by QH)g, the crossover point. 

29. If oxidative phosphorylation were uncoupled, no ATP could 
be produced. In a futile attempt to generate ATP, much fuel would 
be consumed. The danger lies in the dose. Too much uncoupling 
would lead to tissue damage in highly aerobic organs such as the 
brain and heart, which would have severe consequences for the 
organism as a whole. The energy that is normally transformed into 
ATP would be released as heat. To maintain body temperature, 
sweating might increase, although the very process of sweating 
itself depends on ATP. 

30. If ATP and ADP cannot exchange between the matrix and the 
mitochondria, ATP synthase will cease to function because its sub- 
strate ADP is absent. The proton gradient will eventually become 
so large that the energy released by the electron-transport chain 
will not be great enough to pump protons against the larger-than- 
normal gradient. 

31. Add the inhibitor with and without an uncoupler, and 
monitor the rate of O2 consumption. If the O2 consumption 
increases again in the presence of inhibitor and uncoupler, the 
inhibitor must be inhibiting ATP synthase. If the uncoupler 
has no effect on the inhibition, the inhibitor is inhibiting the 
electron-transport chain. 

32. Presumably, because the muscle has greater energy needs, espe- 
cially during exercise, it will require more ATP. This requirement 
means that more sites of oxidative phosphorylation are called for, and 
these sites can be provided by an increase in the amount of cristae. 
33. The arginine residue, with its positive charge, will facilitate 
proton release from aspartic acid by stabilizing the negatively 
charged aspartate. 

34. 4; 4.7 

35. The ATP synthase would pump protons at the expense of 
ATP hydrolysis, thus maintaining the proton-motive force. The 
synthase would function as an ATPase. There is some evidence 
that damaged mitochondria use this tactic to maintain, at least 
temporarily, the proton-motive force. 

36. It suggests that malfunctioning mitochondria may play a role 
in the development of Parkinson disease. Specifically, it implicates 
Complex I. 

37. The extra negative charge on ATP relative to that on ADP 
accounts for ATP’s more-rapid translocation out of the mitochon- 
drial matrix. If the charge differences between ATP and ADP were 
lessened by the binding of Mg”*, ADP might more readily com- 
pete with ATP for transport to the cytoplasm. 

38. When all of the available ADP has been converted into 

ATP, ATP synthase can no longer function. The proton gradient 
becomes large enough that the energy of the electron-transport 
chain is not enough to pump against the gradient, and electron 
transport and, hence, oxygen consumption falls. 

39. The effect on the proton gradient is the same in each case. 

40. ATP export from the matrix. Phosphate import into the matrix. 
41. Recall from the discussion of enzyme-catalyzed reactions that the 
direction of a reaction is determined by the AG difference between 
substrate and products. An enzyme speeds up the rate of both the 
forward and the backward reactions. The hydrolysis of ATP is exer- 
gonic, and so ATP synthase will enhance the hydrolytic reaction. 
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42. The cytoplasmic kinases thereby obtaining preferential access 
to the exported ATP. 

43. The organic acids in the blood are indications that the mice are 
deriving a large part of their energy needs through anaerobic gly- 
colysis. Lactate is the end product of anaerobic glycolysis. Alanine 
is an aminated transport form of lactate. Alanine formation plays a 
role in succinate formation, which is caused by the reduced state of 
the mitochondria. 


NADH NADH 
Asparate sreipataas— a malate oo fumarate 
FADH2 
pyruvate Y 
7 alanine succinate 
Lactate 


The electron-transport chain is slowed because the inner mito- 
chondrial membrane is hyperpolarized. Without ADP to accept 
the energy of the proton-motive force, the membrane becomes 
polarized to such an extent that protons can no longer be pumped. 
The excess H2O> is probably due to the fact that the superoxide 
radical is present in higher concentration because the oxygen can 
no longer be effectively reduced. 


O,*> +O) + 2H* 


> Oo + H20> 


Indeed, these mice display evidence of such oxidative damage. 

44. (a) Vitamins C and E. 

(b) Exercise induces superoxide dismutase, which converts ROS in 
hydrogen peroxide and oxygen. 

(c) The answer to this question is not fully established. Two pos- 
sibilities are (1) the suppression of ROS by vitamins prevents the 
expression of more superoxide dismutase and (2) some ROS may be 
signal molecules required to stimulate insulin-sensitivity pathways. 
45. (a) Succinate is oxidized by Complex IJ, and the electrons are 
used to establish a proton-motive force that powers ATP synthesis. 
(b) The ability to synthesize ATP is greatly reduced. 

(c) Because the goal was to measure ATP hydrolysis. If succinate 
had been added in the presence of ATP, no reaction would have 
taken place, because of respiratory control. 

(d) The mutation has little effect on the ability of the enzyme to 
catalyze the hydrolysis of ATP. 

(e) They suggest two things: (1) the mutation did not affect the 
catalytic site on the enzyme, because ATP synthase is still capable 
of catalyzing the reverse reaction, and (2) the mutation did not 
affect the amount of enzyme present, given that the controls and 
patients had similar amounts of activity. 

46. The absolute configuration of thiophosphate indicates that 
inversion at phosphorus has taken place in the reaction catalyzed by 
ATP synthase. This result is consistent with an in-line phosphoryl- 
transfer reaction taking place in a single step. The retention of con- 
figuration in the Ca?*-ATPase reaction points to two phosphoryl- 
transfer reactions—inversion by the first and a return to the starting 
configuration by the second. The Ca**-ATPase reaction proceeds 
by a phosphorylated enzyme intermediate. 


Chapter 19 

1. Photosystem I generates ferredoxin, which reduces NADP* to 
NADPH, a biosynthetic reducing power. Photosystem II activates 
the manganese complex, an oxidant capable of oxidizing water, 
generating electrons for photosynthesis, and generating protons 

to form a proton gradient and to reduce NADP* and Oy. 


2. The light reactions take place on thylakoid membranes. 
Increasing the membrane surface increases the number of ATP- 
and NADH -generating sites. 

3. These complexes absorb more light than can a reaction center 
alone. The light-harvesting complexes funnel light to the reaction 
centers. 

4. NADP* is the acceptor. H2O is the donor. Light energy. 

5. The charge gradient, a component of the proton-motive force in 
mitochondria, is neutralized by the influx of Mg?* into the lumen 
of the thylakoid membranes. 

6. Chlorophyll is readily inserted into the hydrophobic interior of 
the thylakoid membranes. 

7. Protons released by the oxidation of water; protons pumped 
into the lumen by the cytochrome bf complex; protons removed 
from the stroma by the reduction of NADP* and plastoquinone. 
8. 700-nm photons have an energy content of 172 kJ mol” | The 
absorption of light by photosystem I results in a AE$ of —1.0V. Recall 
that AG) = —nF AE%, where F = 96.48 kJ mol”! V~!. Under stan- 
dard conditions, the energy change for the electrons is 96.5 kJ. Thus, 
the efficiency is 96.5/172 = 56%. 

9. The electron flow from PS II to PS Tis uphill, or exergonic. For 
this uphill flow, ATP would need to be consumed, defeating the 
purpose of photosynthesis. 

10. AEj = 10.11 V, and AG” = —21.3 kJ mol7! (—5.1 kcal mol7?). 
11. (a) All ecosystems require an energy source from outside the 
system, because the chemical-energy sources will ultimately be 
limited. The photosynthetic conversion of sunlight is one example 
of such a conversion. 

(b) Not at all. Spock would point out that chemicals other than 
water can donate electrons and protons. 

12. DCMU inhibits electron transfer in the link between photosys- 
tems I and I. Op can evolve in the presence of DCMU if an artificial 
electron acceptor such as ferricyanide can accept electrons from Q. 

13. DCMU will have no effect, because it blocks photosystem II, 
and cyclic photophosphorylation uses photosystem I and the cyto- 
chrome bf complex. 

14. (a) 120 kJ einstein”! (28.7 kcal einstein!) 

(b)1.24V 

(c) One 1000-nm photon has the free energy content of 2.4 molecules 
of ATP. A minimum of 0.42 photon is needed to drive the synthesis of 
a molecule of ATP. 

15. At this distance, the expected rate is one electron per second. 
16. The distance doubles, and so the rate should decrease by a factor 
of 64 to 640 ps. 

17. The cristae. 

18. In eukaryotes, both processes take place in specialized organ- 
elles. Both depend on high-energy electrons to generate ATP. In 
oxidative phosphorylation, the high-energy electrons originate in 
fuels and are extracted as reducing power in the form of NADH. 
In photosynthesis, the high-energy electrons are generated by light 
and are captured as reducing power in the form of NADPH. Both 
processes use redox reactions to generate a proton gradient, and 
the enzymes that convert the proton gradient into ATP are very 
similar in both processes. In both systems, electron transport takes 
place in membranes inside organelles. 

19. We need to factor in the NADPH because it is an energy-rich 
molecule. Recall from Chapter 18, that NADH is worth 2.5 ATP if 
oxidized by the electron-transport chain. 12 NADPH = 30 ATP. 
Eighteen molecules of ATP are used directly, and so the equivalent 
of 48 molecules of ATP is required for the synthesis of glucose. 

20. Both photosynthesis and cellular respiration are powered by 
high-energy electrons flowing toward a more-stable state. In cel- 
lular respiration, the high-energy electrons are derived from the 


oxidation of carbon fuels as NADH and FADH). They release 
their energy as they reduce oxygen. In photosynthesis, high-energy 
electrons are generated by absorbing light energy, and they find 
stability in photosystem I and ferridoxin. 

21. The electrons flow through photosystem II directly to ferri- 
cyanide. No other steps are required. 


22. (a) Thioredoxin 


(b) The control enzyme is unaffected, but the mitochondrial 
enzyme with part of the chloroplast y subunit increases activity as 
the concentration of DTT increases. 

(c) The increase was even larger when thioredoxin was present. 
Thioredoxin is the natural reductant for the chloroplast enzyme, 
and so it presumably operates more efficiently than would DTT, 
which probably functions to keep the thioredoxin reduced. 


(d) They seem to have done so. 


(e) The enzyme is susceptible to control by the redox state. In 
plant cells, reduced thioredoxin is generated by photosystem I. 
Thus, the enzyme is active when photosynthesis is taking place. 


(f) Cysteine 


(g) Group-specific modification or site-specific mutagenesis. 


Chapter 20 


1. The Calvin cycle is the primary means of converting gaseous 
CO, into organic matter—that is, biomolecules. Essentially, every 
carbon atom in your body passed through rubisco and the Calvin 


cycle at some time in the past. 
2. 
Calvin cycle 


Stroma 


Carbon chemistry for 
photosynthesis 


Fixes CO» 
Requires high-energy electrons 


(NADPH) 


Regenerates starting compound 
(ribulose 1.5-bisphosphate 


Requires ATP 
Complex stoichiometry 


Krebs cycle 
Matrix 


Carbon chemistry for oxidative 
phosphorylation 


Releases CO 


Generates high-energy 
electrons (NADPH) 


Regenerates starting compound 
(oxaloacetate) 


Generates ATP or GTP 
Simple stoichiometry 


3. (a) 3-Phosphoglycerate. (b) The other members of the Calvin cycle. 
4. Stage 1 is the fixation of CO) with ribulose 1,5-bisphosphate 
and the subsequent formation of 3-phosphoglycerate. Stage 2 is the 
conversion of some of the 3-phosphoglycerate into hexose. Stage 3 
is the regeneration of ribulose 1,5-bisphosphate. 

5. It catalyzes a crucial reaction, but it is highly inefficient. 
Consequently, it is required in large amounts to overcome its 


slow catalysis. 


6. Because carbamate forms only in the presence of COx, this 
property prevents rubisco from catalyzing the oxygenase reaction 


exclusively when CO, is absent. 


7. Because NADPH is generated in the chloroplasts by the light 


reactions. 


8. The concentration of 3-phosphoglycerate would increase, 
whereas that of ribulose 1,5-bisphosphate would decrease. 

9. The concentration of 3-phosphoglycerate would decrease, 
whereas that of ribulose 1,5-bisphosphate would increase. 

10. Aspartate + glyoxylate ——- oxaloacetate + glycine 

11. The oxygenase activity of rubisco increases with temperature. 
Crabgrass is a C4 plant, whereas most grasses lack this capability. 
Consequently, the crabgrass will thrive at the hottest part of the 
summer because the Cy pathway provides an ample supply of CO>. 
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12. The C4 pathway allows the CO, concentration to increase 
at the site of carbon fixation. High concentrations of CO2 
inhibit the oxygenase reaction of rubisco. This inhibition is 
important for tropical plants because the oxygenase activ- 

ity increases more rapidly with temperature than does the 
carboxylase activity. 

13. ATP is required to form phosphoenolpyruvate (PEP) from 
pyruvate. The PEP combines with CO, to form oxaloacetate 
and, subsequently, malate. Two ATP molecules are required 
because a second ATP molecule is required to phosphorylate 
AMP to ADP. 

14. Photorespiration is the consumption of oxygen by plants 
with the production of COs, but it does not generate energy. 
Photorespiration is due to the oxygenase activity of rubisco. It is 
wasteful because, instead of fixing CO) for conversion into hexo- 
ses, rubisco is generating CO>. 

15. As global warming progresses, C4 plants will invade the higher 
latitudes, and C3 plants will retreat to cooler regions. 

16. The light reactions lead to an increase in the stromal 
concentrations of NADPH, reduced ferredoxin, and Me?*, 

as well as an increase in pH. 

17. The enzymes catalyze the transformation of the five-carbon 
sugar formed by the oxidative phase of the pentose phosphate 
pathway into fructose 6-phosphate and glyceraldehyde 3-phosphate, 
intermediates in glycolysis (and gluconeogenesis). 

18. The label emerges at C-5 of ribulose 5-phosphate. 

19. Oxidative decarboxylation of isocitrate to a-ketoglutarate. A 
8-ketoacid intermediate is formed in both reactions. 

20. (a) 5 Glucose 6-phosphate + ATP —> 

6 ribose 5-phosphate + ADP + H*. 

(b) Glucose 6-phosphate + 12 NADP* + 7 H,O —> 

6 CO, + 12 NADPH + 12 H* + P;. 

21. The nonoxidative phase of the pentose phosphate pathway 
can be used to convert three molecules of ribose 5-phosphate 
into two molecules of fructose 6-phosphate and one molecule of 
glyceraldehyde 3-phosphate. These molecules are components of 
the glycolytic pathway. 

22. The conversion of fructose 6-phosphate into fructose 1,6- 
bisphosphate by phosphofructokinase requires ATP. 

23. When much NADPH is required. The oxidative phase of 
the pentose phosphate pathway is followed by the nonoxidative 
phase. The resulting fructose 6-phosphate and glyceraldehyde 
3-phosphate are used to generate glucose 6-phosphate through 
gluconeogenesis, and the cycle is repeated until the equivalent of 
one glucose molecule is oxidized to COp. 

24. Fava beans contain pamaquine, a purine glycoside that can 
lead to the generation of peroxides—reactive oxygen species that 
can damage membranes as well as other biomolecules. Glutathione 
is used to detoxify the ROS. The regeneration of glutathione 
depends on an adequate supply of NADPH, which is synthesized 
by the oxidative phase of the pentose phosphate pathway. People 
with low levels of the dehydrogenase are especially susceptible to 
pamaquine toxicity. 

25. Because red blood cells do not have mitochondria and the 
only means to obtain NADPH is through the pentose phosphate 
pathway. There are biochemical means to convert mitochondrial 
NADH into cytoplasmic NADPH. 

26. Reactive peroxides are a type of reactive oxygen species. The 
enzyme glutathione peroxidase uses reduced glutathione to neu- 
tralize peroxides by converting them into alcohols while generat- 
ing oxidized glutathione. Reduced glutathione is regenerated by 
glutathione reductase with the use of NADPH, the product of the 
oxidative phase of the pentose phosphate pathway. 
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27. AEj§ for the reduction of glutathione by NADPH is + 
0.09 V. Hence, AG” is —17.4 kJ mol! (— 4.2 kcal mol™!), 
which corresponds to an equilibrium constant of 1126. The 
required [NADPH]/[ NADP" ] ratio is 8.9 X 10°”. 

28. 
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30. Incubate an aliquot of a tissue homogenate with glucose 
labeled with *C at C-1, and incubate another with glucose labeled 
with C at C-6. Compare the radioactivity of the CO2 produced 
by the two samples. The rationale of this experiment is that only 
C-1 is decarboxylated by the pentose phosphate pathway, whereas 
C-1 and C-6 are decarboxylated equally when glucose is metabo- 
lized by the glycolytic pathway, the pyruvate dehydrogenase 
complex, and the citric acid cycle. The reason for the equivalence 
of C-1 and C-6 in the latter set of reactions is that glyceraldehyde 
3-phosphate and dihydroxyacetone phosphate are rapidly inter- 
converted by triose phosphate isomerase. 

31. The reduction of each mole of COz to the level of a hexose 
requires two moles of NADPH. The reduction of NADP* is 

a two-electron process. Hence, the formation of two moles of 
NADPH requires the pumping of four moles of electrons by 
photosystem I. The electrons given up by photosystem I are 
replenished by photosystem II, which needs to absorb an equal 
number of photons. Hence, eight photons are needed to generate the 


CH,OPO;" 


required NADPH. The energy input of eight moles of photons 
is 1594 kJ (381 kcal). Thus, the overall efficiency of photosyn- 
thesis under standard conditions is at least 477/1594, or 30%. 
32. It is neither a violation nor a miracle. The equation on page 580 
requires not only 18 ATP, but also 12 NADPH. These electrons, 

if transferred to NAD* and used in the electron-transport chain, 
would yield 30 ATP. Thus, the synthesis of glucose requires the 
equivalent of 48 ATP. 

33. (a) The curve on the right in graph A was generated by the 
C4 plant. Recall that the oxygenase activity of rubisco increases 
with temperature more rapidly than does the carboxylase activity. 
Consequently, at higher temperatures, the C3 plants would fix less 
carbon. Because C4 plants can maintain a higher CO? concentra- 
tion, the rise in temperature is less deleterious. 

(b) The oxygenase activity will predominate. Additionally, when 
the temperature rise is very high, the evaporation of water might 
become a problem. The higher temperatures can begin to damage 
protein structures as well. 

(c) The C4 pathway is a very effective active-transport system for 
concentrating CO , even when environmental concentrations are 
very low. 

(d) With the assumption that the plants have approximately the same 
capability to fix COg, the Cy pathway is apparently the rate-limiting 
step in Cy plants. 


Chapter 21 

1. Glycogen is an important fuel reserve for several reasons. The 
controlled breakdown of glycogen and release of glucose increase 
the amount of glucose that is available between meals. Hence, 
glycogen serves as a buffer to maintain blood-glucose levels. 
Glycogen’s role in maintaining blood-glucose levels is especially 
important because glucose is virtually the only fuel used by the 
brain, except during prolonged starvation. Moreover, the glucose 
from glycogen is readily mobilized and is therefore a good source 
of energy for sudden, strenuous activity. Unlike fatty acids, the 
released glucose can provide energy in the absence of oxygen and 
can thus supply energy for anaerobic activity. 

2. Asan unbranched polymer, a-amylose has only one nonreduc- 
ing end. Therefore, only one glycogen phosphorylase molecule 
could degrade each a-amylose molecule. Because glycogen is 
highly branched, there are many nonreducing ends per molecule. 
Consequently, many phosphorylase molecules can release many 
glucose molecules per glycogen molecule. 

3. The patient has a deficiency of the branching enzyme. 

4. In muscle, the b form of phosphorylase is activated by AMP. 

In the liver, the a form is inhibited by glucose. The difference cor- 
responds to the difference in the metabolic role of glycogen in each 
tissue. Muscle uses glycogen as a fuel for contraction, whereas the 
liver uses glycogen to maintain blood-glucose levels. 

5. Cells maintain the [P;]/[glucose 1-phosphate] ratio at greater 
than 100, substantially favoring phosphorolysis. We see here an 
example of how the cell can alter the free-energy change to favor a 
reaction taking place by altering the ratio of substrate and product. 
6. The high level of glucose 6-phosphate in von Gierke disease, 
resulting from the absence of glucose 6-phosphatase or the trans- 
porter, shifts the allosteric equilibrium of phosphorylated glycogen 
synthase toward the active form. 

7. The phosphoryl donor is glucose 1,6-bisphosphate, which is 
formed from glucose 1-phosphate and ATP in a reaction catalyzed 
by phosphoglucokinase. 

8. The different manifestations correspond to the different roles of 
the liver and muscle. Liver glycogen phosphorylase plays a crucial 
role in the maintenance of blood-glucose levels. Recall that glucose 


is the primary fuel for the brain. Muscle glycogen phosphorylase 
provides glucose only for the muscle and, even then, only when 
the energy needs of the muscle are high, as during exercise. The 
fact that there are two different diseases suggests that there are two 
different isozymic forms of the glycogen phosphorylase—a liver- 
specific isozyme and a muscle-specific isozyme. 

9. Water is excluded from the active site to prevent hydrolysis. 
The entry of water could lead to the formation of glucose rather 
than glucose 1-phosphate. A site-specific mutagenesis experi- 
ment is revealing in this regard. In phosphorylase, Tyr 573 is 
hydrogen bonded to the 2’-OH group of a glucose residue. The 
ratio of glucose 1-phosphate to glucose product is 9000 : 1 for 
the wild-type enzyme, and 500 : 1 for the Phe 573 mutant. 
Model building suggests that a water molecule occupies the 

site normally filled by the phenolic OH group of tyrosine and 
occasionally attacks the oxocarbonium ion intermediate to form 
glucose. 

10. The amylase activity was necessary to remove all of the glyco- 
gen from the glycogenin. Recall that glycogenin synthesizes oligo- 
saccharides of about eight glucose units, and then activity stops. 
Consequently, if the glucose residues are not removed by extensive 
amylase treatment, glycogenin will not function. 

11. The substrate can be handed directly from the transferase site 
to the debranching site. 

12. During exercise, [ATP] falls and [AMP] rises. Recall that 
AMP is an allosteric activator of glycogen phosphorylase b. Thus, 
even in the absence of covalent modification by phosphorylase 
kinase, glycogen is degraded. 

13. Although glucose 1-phosphate is the actual product of the 
phosphorylase reaction, glucose 6-phosphate is a more versatile 
molecule with respect to metabolism. Among other fates, glucose- 
6-phosphate can be processed to yield energy or building blocks. 
In the liver, glucose 6-phosphate can be converted into glucose and 
released into the blood. 

14. Epinephrine binds to its G-protein-coupled receptor. The 
resulting structural changes activate a G, protein, which in turn 
activates adenyl cyclase. Adenyl cyclase synthesizes cAMP, which 
activates protein kinase A. Protein kinase A partly activates phos- 
phoryl kinase, which phosphorylates and activates glycogen phos- 
phorylase. The calcium released during muscle contraction further 
activates the phosphorylase kinase, leading to further stimulation 
of glycogen phosphorylase. 

15. First, the signal-transduction pathway is shut down when the 
initiating hormone is no longer present. Second, the inherent GTPase 
activity of the G protein converts the bound GTP into inactive GDP. 
Third, phosphodiesterases convert cyclic AMP into AMP. Fourth, 
PP1 removes the phosphoryl group from glycogen phosphorylase, 
converting the enzyme into the usually inactive b form. 

16. It prevents both from operating simultaneously, which would lead 
to a useless expenditure of energy. See the answer to Problem 24. 

17. All these symptoms suggest central nervous system problems. 
If exercise is exhaustive enough or the athlete has not prepared 
well enough or both, liver glycogen also can be depleted. The brain 
depends on glucose derived from liver glycogen. The symptoms 
suggest that the brain is not getting enough fuel. 

18. Liver phosphorylase a is inhibited by glucose, which facili- 
tates the R —> T transition. This transition releases PP1, which 
inactivates glycogen breakdown and stimulates glycogen synthesis. 
Muscle phosphorylase is insensitive to glucose. 

19. The presence of high concentrations of glucose 6-phosphate 
indicates that glucose is abundant and that it is not being used by 
glycolysis. Therefore, this valuable resource is saved by incorpora- 
tion into glycogen. 
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20. Free glucose must be phosphorylated at the expense of a 
molecule of ATP. Glucose 6-phosphate derived from glycogen is 
formed by phosphorolytic cleavage, thus sparing one molecule of 
ATP. Thus, the net yield of ATP when glycogen-derived glucose 
is processed to pyruvate is three molecules of ATP compared with 
two molecules of ATP from free glucose. 

21. Breakdown: Phosphoglucomutase converts glucose 1-phos- 
phate, liberated from glycogen breakdown, into glucose 6-phos- 
phate, which can be released as free glucose (liver) or processed 

in glycolysis (muscle and liver). Synthesis: Converts glucose 
6-phophosphate into glucose 1-phosphate, which reacts with UTP 
to form UDP-glucose, the substrate for glycogen synthase. 

22. Glycogen, + P; —— glycoge,—1 + glucose 6-phosphate 


Glucose 6-phosphate —— glucose 1-phosphate 

UTP + glucose 1-phosphate —~> UDP-glucose + 2 P; 
Glycogen, —; + UDP-glucose —> glycogen, + UDP 
Sum: Glycogen, + UTP —— glycogen, + UDP + P; 


23. In principle, having glycogen be the only primer for the further 
synthesis of glycogen should be a successful strategy. However, if 
the glycogen granules were not evenly divided between daughter 
cells, glycogen stores for future generations of cells might be com- 
promised. Glycogenin synthesizes the primer for glycogen synthase. 
24. Insulin binds to its receptor and activates the tyrosine kinase 
activity of the receptor, which in turn triggers a pathway that acti- 
vates protein kinases. The kinases phosphorylate and inactivate gly- 
cogen synthase kinase. Protein phosphatase 1 then removes the phos- 
phate from glycogen synthase and thereby activates the synthase. 

25. 
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26. Galactose + ATP + UTP + H20 + glycogen, —> 
glycogen, +; + ADP + UDP + 2P;+H*. 

27. Phosphorylase, transferase, glucosidase, phosphoglucomutase, 
and glucose 6-phosphatase. 

28. Glucose is an allosteric inhibitor of phosphorylase a. Hence, 
crystals grown in its presence are in the T state. The addition of 
glucose 1-phosphate, a substrate, shifts the R-to-T equilibrium 
toward the R state. The conformational differences between these 
states are sufficiently large that the crystal shatters unless it is sta- 
bilized by chemical cross-links. 

29. Galactose is converted into UDP-galactose to eventually form 
glucose 6-phosphate. 

30. This disease can also be produced by a mutation in the gene 
that encodes the glucose 6-phosphate transporter. Recall that glu- 
cose 6-phosphate must be transported into the lumen of the endo- 
plasmic reticulum to be hydrolyzed by phosphatase. Mutations in 
the other three essential proteins of this system can likewise lead to 
von Gierke disease. 

31. (a) Glycogen was too large to enter the gel and, because analysis 
was by western blot with the use of an antibody specific to glycogenin, 
we would not expect to see background proteins. 

(b) a-Amylase degrades glycogen, releasing the protein glycogenin, 
which can be visualized by a western blot. 

(c) Glycogen phosphorylase, glycogen synthase, and protein 
phosphatase 1. These proteins might be visible if the gel were 
stained for protein, but a western analysis reveals the presence of 
glycogenin only. 

32. (a) The smear was due to molecules of glycogenin with 
increasingly large amounts of glycogen attached to them. 

(b) In the absence of glucose in the medium, glycogen is metabolized, 
resulting in a loss of the high-molecular-weight material. 

(c) Glycogen could have been resynthesized and added to the 
glycogenin when the cells were fed glucose again. 

(d) No difference between lanes 3 and 4 suggests that, by 1 hour, 
the glycogen molecules had attained maximum size in this cell line. 
Prolonged incubation does not apparently increase the amount of 
glycogen. 

(e) a-Amylase removes essentially all of the glycogen, and so only 
the glycogenin remains. 


Chapter 22 
1. Glycerol + 2 NAD* + P; + ADP —> 
pyruvate + ATP + H,O + 2 NADH + H* 


Glycerol kinase and glycerol phosphate dehydrogenase 

2. The ready reversibility is due to the high-energy nature of the 
thioester in the acyl CoA. 

3. To return the AMP to a form that can be phosphorylated by 
oxidative phosphorylation or substrate-level phosphorylation, 
another molecule of ATP must be expended in the reaction: 


ATP + AMP == 2 ADP 


4. b,c,a,g,h,d,e, f. 

5. The citric acid cycle. The reactions that take succinate to 
oxaloacetate, or the reverse, are similar to those of fatty acid 
metabolism (Section 17.2). 

6. The next-to-last degradation product, acetoacetyl CoA, yields 
two molecules of acetyl CoA with the thiolysis by only one molecule 
of CoA. 

7. Palmitic acid yields 106 molecules of ATP. Palmitoleic acid 
has a double bond between carbons C-9 and C-10. When palmi- 
toleic acid is processed in B oxidation, one of the oxidation steps 


(to introduce a double bond before the addition of water) will not 
take place, because a double bond already exists. Thus, FADH, 
will not be generated, and palmitoleic acid will yield 1.5 fewer 
molecules of ATP than palmitic acid, for a total of 104.5 mol- 
ecules of ATP. 

8. 


Activation fee to form the acyl CoA —2 ATP 
Seven rounds of yield: 
7 acetyl CoA at 10 ATP/acetyl CoA + 70 ATP 
7 NADH at 2.5 ATP/NADH F175 A 
7 FADH) at 1.5 ATP/FADH), F105 0? 
Propionyl CoA, which requires an = il M02 


ATP to be converted into succinyl CoA 
Succinyl CoA —> succinate 


ap il ANN (CANE) 


Succinate —> fumarate + FADH) =p les AICP 
FADH) at 1.5 ATP/FADH) 
Fumarate —> malate 
Malate —> oxaloacetate + NADH 4p D5 JNIUP 
NADH at 2.5 ATP/NADH 
Total 120 ATP 


9. You might hate yourself in the morning, but at least you won’t 
have to worry about energy. To form stearoyl CoA requires the 
equivalent of 2 molecules of ATP. 


Stearoyl CoA + 8 FAD + 8 NAD* + 8 CoA + 8H,O > 
9 acetyl CoA + 8 FADH, + 8 NADH + 8 H* 


9 acetyl CoA at 10 ATP/acetyl CoA + 90 ATP 
8 NADH at 2.5 ATP/NADH ap 20 AIP 
8 FADH) at 1.5 ATP/FADH) +12 ATP 
Activation fee =2,0 

Total 122 ATP 


10. Keep in mind that, in the citric acid cycle, 1 molecule of FADH2 
yields 1.5 ATP, 1 molecule of NADH yields 2.5 ATP, and 1 molecule 
of acetyl CoA yields 10 ATP. Two molecules of ATP are produced 
when glucose is degraded to 2 molecules of pyruvate. Two molecules 
of NADH also are produced, but the electrons are transferred to 
FADH> to enter the mitochondria. Each molecule of FADH) can 
generate 1.5 ATP. Each molecule of pyruvate will produce 1 mol- 
ecule of NADH. Each molecule of acetyl CoA generates 3 molecules 
of NADH, 1 molecule of FADHg, and 1 molecule of ATP. So, we 
have a total of 10 ATP per acetyl CoA, or 20 for the 2 molecules 
of acetyl CoA. The total for glucose is 30 ATP. Now, what about 
hexanoic acid? Caprioic acid is activated to caprioic CoA at the 
expense of 2 ATP, and so we are 2 ATP in the hole. The first cycle 
of B oxidation generates 1 FADH2, 1 NADH, and 1 acetyl CoA. 
After the acetyl CoA has been run through the citric acid cycle, 
this step will have generated a total of 14 ATP. The second cycle of 
8 oxidation generates 1 FADH) and 1 NADH but 2 acetyl CoA. 
After the acetyl CoA has been run through the citric acid cycle, 
this step will have generated a total of 24 ATP. The total is 36 ATP. 
Thus, the foul-smelling caprioic acid has a net yield of 36 ATP. 

So on a per carbon basis, this fat yields 20% more ATP than does 
glucose, a manifestation of the fact that fats are more reduced than 
carbohydrates. 

11. Stearate + ATP + 13.5H,O + 8 FAD + 8 NAD*-> 

4.5 acetoacetate + 14.5 H* + 8 FADH, + 8 NADH + AMP + 2P, 
12. Palmitate is activated and then processed by B oxidation 
according to the following reactions. 


Palmitate + CoA + ATP > palmitoyl CoA + AMP + 2 P; 


Palmitoyl CoA + 7 FAD + 7 NAD + 7 CoASH + H»O0 ——> 
8 acetyl CoA + 7 FADH, + 7 NADH + 7 H* 


The eight molecules of acetyl CoA combine to form four molecules 
of acetoacetate for release into the blood, and so they do not con- 
tribute to the energy yield in the liver. However, the FADH) and 
NADH generated in the preparation of acetyl CoA can be pro- 
cessed by oxidative phosphorylation to yield ATP. 


1.5 ATP/FADH) X 7 = 10.5 ATP 
2.5 ATP/NADH xX 7 = 17.5 ATP 


The equivalent of 2 ATP were used to form palmitoyl CoA. Thus, 
26 ATP were generated for use by the liver. 

13. NADH produced with the oxidation to acetoacetate = 2.5 ATP. 
Acetoacetate is converted into acetoacetyl CoA. 

Two molecules of acetyl CoA result from the hydrolysis of 
acetoacetyl CoA, each worth 10 ATP when processed by the citric 
acid cycle. Total ATP yield is 22.5. 

14. Because a molecule of succinyl CoA is used to form acetoacetyl 
CoA. Succinyl CoA could be used to generate one molecule of ATP 
(GTP), and so someone could argue that the yield is 21.5. 

15. For fats to be combusted, not only must they be converted into 
acetyl CoA, but the acetyl CoA must be processed by the citric acid 
cycle. In order for acetyl CoA to enter the citric acid cycle, there 
must be a supply of oxaloacetate. Oxaloacetate can be formed by the 
metabolism of glucose to pyruvate and the subsequent carboxylation 
of pyruvate to form oxaloacetate. 
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CH; Hs 
R—C—-C—coo > R—C——CH—COO - 
H HH, H | 
OH 
ia iC 
R—C—cC—coo™ > R—C—C—0O +0, 
H || _ I 


One round of « oxidation rather than B oxidation converts phytanic 
acid into a B-oxidation substrate. 

17. The first oxidation removes two tritium atoms. The hydration 
adds nonradioactive H and OH. The second oxidation removes 
another tritium atom from the B-carbon atom. Thiolysis removes 
an acetyl CoA with only one tritium atom; so the trittum-to-carbon 
ratio is 1/2. This ratio will be the same for two of the acetates. The 
last one, however, does not undergo oxidation, and so all tritium 
remains. The ratio for this acetate is 3/2. The ratio for the entire 
molecule is then 5/6. 

18. In the absence of insulin, lipid mobilization will take place to 
an extent that it overwhelms the ability of the liver to convert the 
lipids into ketone bodies. 

19. (a) Oxidation in mitochondria; synthesis in the cytoplasm. 

(b) Coenzyme A in oxidation; acyl carrier protein for synthesis. 

(c) FAD and NAD* in oxidation; NADPH for synthesis. (d) the L 
isomer of 3-hydroxyacyl CoA in oxidation; the D isomer in synthesis. 
(e) From carboxyl to methy] in oxidation; from methyl to carboxyl 
in synthesis. (f) The enzymes of fatty acid synthesis, but not those of 
oxidation, are organized in a multienzyme complex. 

20. 7 acetyl CoA + 6 ATP + 12 NADPH +12 H*—> 
myristate + 7 CoA + 6 ADP + 6 Pi+ 12 NADP* + 5H,0. 


16. (a) 
CH; iii i iid 
HC CH—(CH,)z CH (CH,),— CH (CH) CH——-CH, 
Phytanic acid 


The problem with phytanic acid is that, as it undergoes B oxida- 
tion, we encounter the dreaded pentavalent carbon atom. Because 
the pentavalent carbon atom doesn’t exist, 8 oxidation cannot take 
place and phytanic acid accumulates. 


c: is 
R—C—C—coo- > R——C—=C—CO0Oo- > 
H 2 
CH; Hs 
R—C==C—Coo- > R——-C—C—COO™ - 
[Me 
OH 
CH; 


R—C—c—coo" 
| Me 


The dreaded pentavalent 
carbon atom 


(b) Removing methyl groups, though theoretically possible, would 
be time consuming and, lacking in elegance. What would we do with 
the methyl groups? Our livers solve the problem by inventing 

a oxidation. 


COO™ 
21. We will need six acetyl CoA units. One acetyl CoA 


unit will be used directly to become the two carbon 
atoms farthest from the acid end. The other five units must be 
converted into malonyl CoA. The synthesis of each malonyl CoA 
molecule costs a molecule of ATP; so 5 molecules of ATP will 
be required. Each round of elongation requires 2 molecules of 
NADPH, 1 molecule to reduce the keto group to an alcohol and 
1 molecule to reduce the double bond. As a result, 10 molecules 
of NADPH will be required. Therefore, 5 molecules of ATP and 
10 molecules of NADPH are required to synthesize lauric acid. 
22. e,b,d,a,c. 
23. Such a mutation would inhibit fatty acid synthesis because the 
enzyme cleaves cytoplasmic citrate to yield acetyl CoA for fatty 
acid synthesis. 
24. (a) False. Biotin is required for acetyl CoA carboxylase activity. 


cleaved to yield its substrate acetyl CoA. 

25. Fatty acids with odd numbers of carbon atoms are synthesized 
starting with propionyl ACP (instead of acetyl ACP), which is 
formed from propionyl CoA by acetyl transacetylase. 

26. All of the labeled carbon atoms will be retained. Because we need 
8 acetyl CoA molecules and only 1 carbon atom is labeled in the acetyl 
group, we will have 8 labeled carbon atoms. The only acetyl CoA used 
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directly will retain 3 tritium atoms. The 7 acetyl CoA molecules used 
to make malonyl CoA will lose 1 tritium atom on addition of the CO 
and another one at the dehydration step. Each of the 7 malonyl CoA 
molecules will retain 1 trittum atom. Therefore, the total retained tri- 
tium is 10 atoms. The ratio of trittum to carbon is 1.25. 

27. With a diet rich in raw eggs, avidin will inhibit fatty acid 
synthesis by reducing the amount of biotin required by acetyl CoA 
carboxylase. Cooking the eggs will denature avidin, and so it will 
no longer bind biotin. 

28. The only acetyl CoA used directly, not in the form of malonyl 
CoA, provides the two carbon atoms at the w end of the fatty acid 
chain. Because palmitic acid is a Cy¢ fatty acid, acetyl CoA will 
have provided carbons 15 and 16. 

29. HCO3_ is attached to acetyl CoA to form malonyl CoA. When 
malonyl CoA condenses with acetyl CoA to form the four-carbon 
keto acyl CoA, the HCO3 is lost as CO3. 

30. Phosphofructokinase controls the flux down the glycolytic 
pathway. Glycolysis functions to generate ATP or building blocks 
for biosynthesis, depending on the tissue. The presence of citrate 
in the cytoplasm indicates that those needs are met, and there is no 
need to metabolize glucose. 

31. C-1 is more radioactive. 

32. The mutant enzyme will be persistently active because it can- 
not be inhibited by phosphorylation. Fatty acid synthesis will be 
abnormally active. Such a mutation might lead to obesity. 

33. (a) Palmitoleate; (b) linoleate; (c) linoleate; (d) oleate; (e) oleate; 
(f ) linolenate. 

34. Decarboxylation drives the condensation of malonyl ACP 

and acetyl ACP. In contrast, the condensation of two molecules 

of acetyl ACP is energetically unfavorable. In gluconeogenesis, 
decarboxylation drives the formation of phosphoenolpyruvate 
from oxaloacetate. 

35. Fat mobilization in adipocytes is activated by phosphoryla- 
tion. Hence, overproduction of the cAMP.-activated kinase will 
lead to an accelerated breakdown of triacylglycerols and a deple- 
tion of fat stores. 

36. Carnitine translocase deficiency and glucose 6-phosphate 
transporter deficiency. 

37. In the fifth round of B oxidation, cis-A?-enoyl CoA is formed. 
Dehydration by the classic hydratase yields D-3-hydroxyacyl CoA, 
the wrong isomer for the next enzyme in B oxidation. This dead 
end is circumvented by a second hydratase that removes water 

to give trans-A’-enoyl CoA. The addition of water by the clas- 

sic hydratase then yields L-3-hydroxyacyl CoA, the appropriate 
isomer. Thus, hydratases of opposite stereospecificities serve to 
epimerize (invert the configuration of ) the 3-hydroxyl group of the 
acyl CoA intermediate. 

38. The probability of synthesizing an error-free polypeptide 
chain decreases as the length of the chain increases. A single mis- 
take can make the entire polypeptide ineffective. In contrast, a 
defective subunit can be spurned in the formation of a noncovalent 
multienzyme complex; the good subunits are not wasted. 

39. The absence of ketone bodies is due to the fact that the liver, 
the source of ketone bodies in the blood, cannot oxidize fatty acids 
to produce acetyl CoA. Moreover, because of the impaired fatty 
acid oxidation, the liver becomes more dependent on glucose as an 
energy source. This dependency results in a decrease in gluconeo- 
genesis and a drop in blood-glucose levels, which is exacerbated by 
the lack of fatty acid oxidation in muscle and a subsequent increase 
in glucose uptake from the blood. 

40. Peroxisomes enhance the degradation of fatty acids. 
Consequently, increasing the activity of peroxisomes could help 


to lower levels of blood triglycerides. In fact, clofibrate is rarely 
used because of serious side effects. 

41. Citrate works by facilitating the formation of active fila- 
ments from inactive monomers. In essence, it increases the 
number of active sites available, or the concentration of enzyme. 
Consequently, its effect is visible as an increase in the value of 
Vmax: Allosteric enzymes that alter their V,,,, values in response 
to regulators are sometimes called V-class enzymes. The more 
common type of allosteric enzyme, in which Ky, is altered, com- 
prises K-class enzymes. Palmitoyl CoA causes depolymerization 
and thus inactivation. 

42. The thiolate anion of CoA attacks the 3-keto group to form a 
tetrahedral intermediate. This intermediate collapses to form acyl 
CoA and the enolate anion of acetyl CoA. Protonation of the eno- 


late yields acetyl CoA. 
43. 
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Acetoacetyl-ACP 


44. (a) Fats burn in the flame of carbohydrates. Without carbohy- 
drates, there would be no anapleurotic reactions to replenish the 
components of the citric acid cycle. With a diet of fats only, the 
acetyl CoA from fatty acid degradation would build up. 

(b) Acetone from ketone bodies 

(c) Yes. Odd-chain fatty acids would lead to the production of 
propionyl CoA, which can be converted into succinyl CoA, a citric 
acid cycle component. It would serve to replenish the citric acid 
cycle and mitigate the halitosis. 

45. A labeled fat can enter the citric acid cycle as acetyl CoA and 
yield labeled oxaloacetate, but only after two carbon atoms have 
been lost as CO2. Consequently, even though oxaloacetate may be 
labeled, there can be no net synthesis of oxaloacetate and hence no 
net synthesis of glucose or glycogen. 

46. (a) The Vmax is decreased and the K,, is increased. Vinax (wild 
type) = 13 nmol minute! mg |; K,, (wild type) = 45 &M; Vinax 
(mutant) = 8.3 nmol minute ! mg |; K,, (mutant) = 74 pM. 

(b) Both the V,,,, and the K,, are decreased. Vinay (wild type) = 

41 nmol minute! mg !; Kym (wild type) = 104 uM; Vinax 
(mutant) = 23 nmol minute ! mg |; K,, (mutant) = 69 pM. 

(c) The wild type is significantly more sensitive to malonyl CoA. 
(d) With respect to carnitine, the mutant displays approximately 
65% of the activity of the wild type; with respect to palmitoyl CoA, 
approximately 50% activity. On the other hand, 10 uM of malonyl 
CoA inhibits approximately 80% of the wild type but has essen- 
tially no effect on the mutant enzyme. 


(e) The glutamate appears to play a more prominent role in regu- 
lation by malonyl CoA than in catalysis. 


Chapter 23 

1. When the proteins are denatured, all of the peptide bonds are 
accessible to proteolytic enzymes. If the three-dimensional struc- 
ture of a protein is maintained, access to many peptide bonds is 
denied to the proteolytic enzymes. 

2. First, the ubiquitin-activating enzyme (E1) links ubiquitin to 
a sulfhydryl group on E1 itself. Next, the ubiquitin is transferred 
to a cysteine residue on the ubiquitin-conjugating enzyme (E2) by 
E2. The ubiquitin—protein ligase (E3), using the ubiquitinated E2 
as a substrate, transfers the ubiquitin to the target protein. 

3. (a) 7; (b) 4; (c) 2; (d) 10; (e) 5; (£) 3; (g) 9; (h) 1; (4) 6; (3) 8. 

4. (a) The ATPase activity of the 26S proteasome resides in the 
19S subunit. The energy of ATP hydrolysis could be used to 
unfold the substrate, which is too large to enter the catalytic barrel. 
ATP may also be required for translocation of the substrate into 
the barrel. 

(b) Substantiates the answer in part a. Because they are small, the 
peptides do not need to be unfolded. Moreover, small peptides 
could probably enter all at once and not require translocation. 

5. (a) Pyruvate; (b) oxaloacetate; (c) a-ketoglutarate; (d) a-ketoi- 
socaproate; (e) phenylpyruvate; (f) hydroxyphenylpyruvate. 

6. (a) Aspartate + a-ketoglutarate + GTP + ATP + 2 H20 

+ NADH + H*—=> % glucose + glutamate + CO) + ADP + 
GDP + NAD* + 2 Pj. 

The required coenzymes are pyridoxal phosphate in the transami- 
nation reaction and NAD* /NADH in the redox reactions. 

(b) Aspartate + CO) + NH,* +3 ATP + NAD* + 4H,O—> 
oxaloacetate + urea + 2 ADP + 4 P; + AMP + NADH + H?. 
7. In the eukaryotic proteasome, the distinct B subunits have 
different substrate specificities, allowing proteins to be more 
thoroughly degraded. 

8. The six subunits probably exist as a heterohexamer. Cross- 
linking experiments could test the model and help determine 
which subunits are adjacent to one another. 

9. Thiamine pyrophosphate 

10. Aminotransferases transfer the w-amino group to a-ketoglu- 
tarate to form glutamate. Glutamate is oxidatively deaminated to 
form an ammonium ion. 

11. Aspartate (oxaloacetate), glutamate (a-ketoglutarate), alanine 
(pyruvate). 

12. Serine and threonine 

13. They are either fuels for the citric acid cycle, components of 
the citric acid cycle, or molecules that can be converted into a fuel 
for the citric acid cycle in one step. 

14. It acts as an electron sink. 

15. Carbamoy] phosphate and aspartate 

16. (a) 4; (b) 5; (c) 1; (d) 6; (e) 7; £) 3; (g) 2. 

17. A, arginine; B, citrulline; C, ornithine; D, arginosuccinate. 
The order of appearance: C, B, D, E. 

18. CO2 + NH,4* +3 ATP + NAD* + aspartate + 3 H2O —> 
urea + 2 ADP + 2 P; + AMP + PP; + NADH + H* + 
oxaloacetate. 

Four high-transfer-potential phosphoryl groups are spent. Note, 
however, that an NADH is generated if fumarate is converted into 
oxaloacetate. NADH can generate 2.5 ATP in the electron-transport 
chain. Taking these ATP into account, only 1.5 high-transfer- 
potential phosphoryl groups are spent. 

19. The synthesis of fumarate by the urea cycle is important because 
it links the urea cycle and the citric acid cycle. Fumarate is hydrated 
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to malate, which, in turn, is oxidized to oxaloacetate. Oxaloacetate has 
several possible fates: (1) transamination to aspartate, (2) conversion 
into glucose by the gluconeogenic pathway, (3) condensation with 
acetyl CoA to form citrate, or (4) conversion into pyruvate. You can 
collect. 

20. Ornithine transcarbamoylase (analogous to PALA; see 
Chapter 10). 

21. Ammonia could lead to the amination of a-ketoglutarate, 
producing a high concentration of glutamate in an unregulated 
fashion. a-Ketoglutarate for glutamate synthesis could be removed 
from the citric acid cycle, thereby diminishing the cell’s respiration 
capacity. 

22. The mass spectrometric analysis strongly suggests that three 
enzymes—pyruvate dehydrogenase, a-ketoglutarate dehydro- 
genase, and the branched-chain a-ketoacid dehydrogenase— 

are deficient. Most likely, the common E3 component of these 
enzymes is missing or defective. This proposal could be tested by 
purifying these three enzymes and assaying their ability to cata- 
lyze the regeneration of lipoamide. 

23. Benzoate, phenylacetate, and arginine would be given to sup- 
ply a protein-restricted diet. Nitrogen would emerge in hippurate, 
phenylacetylglutamine, and citrulline. 

24. The liver is the primary tissue for capturing nitrogen as 
urea. If the liver is damaged (for instance, by hepatitis or the 
excessive consumption of alcohol), free ammonia is released 
into the blood. 

25. This defect can be partly bypassed by providing a surplus 
of arginine in the diet and restricting the total protein intake. 

In the liver, arginine is split into urea and ornithine, which 
then reacts with carbamoyl phosphate to form citrulline. This 
urea-cycle intermediate condenses with aspartate to yield 
argininosuccinate, which is then excreted. Note that two nitro- 
gen atoms—one from carbamoyl phosphate and the other from 
aspartate—are eliminated from the body per molecule of arginine 
provided in the diet. In essence, argininosuccinate substitutes 
for urea in carrying nitrogen out of the body. The formation of 
argininosuccinate removes the nitrogen, and the restriction on 
protein intake relieves the aciduria. 

26. Aspartame, a dipeptide ester (L-aspartyl-L-phenylalanine 
methyl ester), is hydrolyzed to L-aspartate and L-phenylalanine. 
High levels of phenylalanine are harmful in phenylketonurics. 

27. N-Acetylglutamate is synthesized from acetyl CoA and 
glutamate. Once again, acetyl CoA serves as an activated 

acetyl donor. This reaction is catalyzed by N-acetylglutamate 
synthase. 

28. Not all proteins are created equal: some are more important 
than others. Some proteins would be degraded to provide the miss- 
ing amino acid. The nitrogen from the other amino acids would be 
excreted as urea. Consequently, more nitrogen would be excreted 
than ingested. 

29. The carbon skeletons of ketogenic amino acids can be con- 
verted into ketone bodies or fatty acids. Only leucine and lysine are 
purely ketogenic. Glucogenic amino acids are those whose carbon 
skeletons can be converted into glucose. 

30. The branched-chain amino acids leucine, isoleucine, and 
valine. The required enzyme is the branched-chain a-ketoacid 
dehydrogenase complex. 

31. Pyruvate (glycolysis and gluconeogenesis), acetyl CoA (citric 
acid cycle and fatty acid synthesis), acetoacetyl CoA (ketone-body 
formation), a-ketoglutarate (citric acid cycle), succinyl CoA (citric 
acid cycle), fumarate (citric acid cycle), and oxaloacetate (citric acid 
cycle and gluconeogenesis). 
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The equilibrium constant for the interconversion of L-serine and 
D-serine is exactly 1. 
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34. Exposure of such a domain suggests that a component 

of a multiprotein complex has failed to form properly or that 
one component has been synthesized in excess. This exposure 
leads to rapid degradation and the restoration of appropriate 
stoichiometries. 

35. (a) Depletion of glycogen stores. When they are gone, 
proteins must be degraded to meet the glucose needs of the 
brain. The resulting amino acids are deaminated, and the 
nitrogen atoms are excreted as urea. 

(b) The brain has adapted to the use of ketone bodies, which 
are derived from fatty acid catabolism. In other words, the 
brain is being powered by fatty acid breakdown. 

(c) When the glycogen and lipid stores are gone, the only 
available energy source is protein. 

36. Deamination to a-keto-B-methylvalerate; oxidative 
decarboxylation to a-methylbutyryl CoA; oxidation to tiglyl 
CoA; hydration, oxidation, and thiolysis yield acetyl CoA 
and propiony! CoA; propionyl CoA to succinyl CoA. 

37. Glycogen phosphorylase. The coenzyme serves as an 
acid-base catalyst. 

38. In the Cori cycle, the carbon atoms are transferred 
from muscle to liver as lactate. For lactate to be of any use, 
it must be reduced to pyruvate. This reduction requires 
high-energy electrons in the form of NADH. When the 
carbon atoms are transferred as alanine, transamination 
yields pyruvate directly. 

39. (a) Virtually no digestion in the absence of nucleotides. 
(b) Protein digestion is greatly stimulated by the presence 
of ATP. (c) AMP-PNP, a nonhydrolyzable analog of ATP, 
is no more effective than ADP. (d) The proteasome requires 
neither ATP nor PAN to digest small substrates. (e) PAN 
and ATP hydrolysis may be required to unfold the pep- 
tide and translocate it into the proteasome. (f ) Although 
Thermoplasma PAN is not as effective with the other protea- 
somes, it nonetheless results in threefold to fourfold stimu- 
lation of digestion. (g) In light of the fact that the archaea 
and eukarya diverged several billion years ago, the fact that 


Thermoplasma PAN can stimulate rabbit muscle suggests homoll- 
ogy not only between the proteasomes, but also between PAN and 
the 19S subunit (most likely the ATPases) of the mammalian 265 
proteasome. 


Chapter 24 

1. Nitrogen fixation is the conversion of atmospheric N2 into 
NH;°. Diazotrophic (nitrogen-fixing) microorganisms are able 
to fix nitrogen. 

2. Oxaloacetate, pyruvate, ribose-5-phosphate, phospho- 
enolpyruvate, erythrose-4-phosphate, a-ketoglutarate, 

and 3-phosphoglycerate. 

3. Human beings do not have the biochemical pathways to synthesize 
certain amino acids from simpler precursors. Consequently, these 
amino acids are “essential” and must be obtained from the diet. 

4. Glucose + 2 ADP + 2 P; + 2NAD* + 2 glutamate —> 

2 alanine + 2 a-ketoglutrate + 2 ATP + 2 NADH + 

2H,O0 + 2H". 

5. No —> NH,* —> glutamate —> serine —> glycine > 
5-aminolevulinate —> porphobilinogen —> heme. 

6. False. Nitrogen fixation is thermodynamically favorable. 
Nitrogenase is required because the process is kinetically disfa- 
vored. 

7. Pyridoxal phosphate (PLP) 

8. S-Adenosylmethionine, tetrahydrofolate, and methylcobalamin. 
9. (a) N°,\N 10_Methylenetetrahydrofolate; 

(b) N°-methyltetrahydrofolate. 

10. y-Glutamyl phosphate is a likely reaction intermediate. 

11. The synthesis of asparagine from aspartate passes through 

an acyl-adenylate intermediate. One of the products of 
the reaction will be '8O-labeled AMP. 

12. The administration of glycine leads to the forma- 
tion of isovalerylglycine. This water-soluble conjugate, 
in contrast with isovaleric acid, is excreted very rapidly 
by the kidneys. 

13. The nitrogen atom shaded red is derived from 
glutamine. The carbon atom shaded blue is derived 
from serine. 

14. They carry out nitrogen fixation. The absence of 
photosystem IT provides an environment in which Oy 
is not produced. Recall that the nitrogenase is very 
rapidly inactivated by Oo. 

15. The cytoplasm is a reducing environment, whereas Ny 
the extracellular milieu is an oxidizing environment. H 
16. (a) None; (b) D-glutamate and oxaloacetate. 

17. Succinyl CoA is formed in the mitochondrial 

matrix. 

18. Alanine from pyruvate; aspartate from oxaloacetate; 
glutamate from a-ketoglutarate. 

19. Lysine cyclodeaminase converts L-lysine into the 
six-membered ring analog of proline, also referred to as 
L-homoproline or L-pipecolate: 


Pipecolate 


20. Y could inhibit the C —> D step, Z could inhibit the C—> F 
step, and C could inhibit A —> B. This scheme is an example of 
sequential feedback inhibition. Alternatively, Y could inhibit the 
C— D step, Z could inhibit the C —> F step, and the A—> B 
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step would be inhibited only in the presence of both Y and Z. This 
scheme is called concerted feedback inhibition. 

21. The rate of the A —> B step in the presence of high levels of 
Y and Z would be 24s! (0.6 X 0.4 X 100873). 

22. Lysine 258 is absolutely essential for the activity of aspartate 
aminotransferase, as it is responsible both for the formation of the 
internal aldimine with the pyridoxal phosphate cofactor and for 
transferring the proton between the ketimine and quinonoid inter- 
mediates. Mutation of this residue to cysteine would be expected 
to dramatically impair catalysis, as cysteine cannot occupy the 
same space as lysine and also exhibits differing pK, properties. 
Upon treatment with 2-bromoethylamine, however, the resulting 
thioether now has a shape and pK, similar to the original lysine 
side chain. Hence, some catalytic activity is restored. 

23. An external aldimine forms with SAM, which is deprotonated 

to form the quinonoid intermediate. The deprotonated carbon atom 
attacks the carbon atom adjacent to the sulfur atom to form the cyclo- 
propane ring and release methylthioadenosine, the other product. 

24. An external aldimine forms with L-serine, which is deprotonat- 
ed to form the quinonoid intermediate. This intermediate is repro- 
tonated on its opposite face to form an aldimine with D-serine. This 
compound is cleaved to release D-serine. The equilibrium constant 
for a racemization reaction is 1 because the reactant and product are 
exact mirror images of each other. 

25. (a) In the first step, histidine attacks the methylene group from 
the methionine subgroup of SAM (rather than the usual methyl 
substituent), resulting in the transfer of an aminocarboxypropyl 
group. Three subsequent conventional SAM-mediated methyla- 
tions of the primary amine yield diphthine. 
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(b) In this chapter, we have observed two examples of an ATP- 
dependent conversion of a carboxylate into an amide: glutamine 
synthetase, which uses an acyl-phosphate intermediate, and aspar- 
agine synthetase, which uses an acyl-adenylate intermediate. Either 
mechanism is possible in formation of diphthamide from diphthine. 
26. Synthesis from oxaloacetate and a-ketoglutarate would deplete 
the citric acid cycle, which would decrease ATP production. 
Anapleurotic reactions would be required to replenish the citric 
acid cycle. 

27. SAM is the donor for DNA methylation reactions that protect 
a host from digestion by its own restriction enzymes. A lack of 
SAM would render the bacterial DNA susceptible to digestion by 
the cell’s own restriction enzymes. 

28. Acetate —> acetyl-CoA —> citrate —> isocitrate —> 
a-ketoglutarate —> succinyl-CoA. 

29. (a) Asparagine is much more abundant in the dark. More 
glutamine is present in the light. These amino acids show the most 
dramatic effects. Glycine also is more abundant in the light. 

(b) Glutamine is a more metabolically reactive amino acid, used 

in the synthesis of many other compounds. Consequently, when 
energy is available as light, glutamine will be preferentially synthe- 
sized. Asparagine, which carries more nitrogen per carbon atom 
and is thus a more-efficient means of storing nitrogen when energy 
is short, is synthesized in the dark. Glycine is more prevalent in the 
light because of photorespiration. 

(c) White asparagus has an especially high concentration of aspar- 
agine, which accounts for its intense taste. All asparagus has a large 
amount of asparagine. In fact, as suggested by its name, asparagine 
was first isolated from asparagus. 


Chapter 25 

1. In de novo synthesis, the nucleotides are synthesized from 
simpler precursor compounds, in essence from scratch. In sal- 
vage pathways, preformed bases are recovered and attached to 
riboses. 

2. Carbon 2 and nitrogen 3 come from carbamoyl phosphate. 
Nitrogen 1 and carbons 4, 5, and 6 are derived from aspartate. 

3. Nitrogen 1: aspartate; carbon 2: N!°-formyltetrahydrofolate; 
nitrogen 3: glutamine; carbons 4 and 5 and nitrogen 7: glycine; 
carbon 6: CO); carbon 8: N'°-formyltetrahydrofolate; nitrogen 9: 
glutamine. 

4. Energy currency: ATP; signal transduction: ATP and GTP; 
RNA synthesis: ATP, GTP, CTP, and UTP; DNA synthesis: 
dATP, dCTP, dGTP, and TTP; components of coenzymes: ATP in 
CoA, FAD, and NAD(P)*; carbohydrate synthesis: UDP-glucose. 
They are just some of the uses. 

5. A nucleoside is a base attached to ribose. A nucleotide is a 
nucleoside with the ribose bearing one or more phosphates. 

6. (a) 9; (b) 7; (c) 6; (d) 10; (e) 2; (£) 4; (g) 15 (h) 11; (a) 8; (J) 3; (k) 5. 
7. Substrate channeling is the process whereby the product of 
one active site moves to become a substrate at another active site 
without ever leaving the enzyme. A channel connects the active 
sites. Substrate channeling greatly enhances enzyme efficiency and 
minimizes the diffusion of a substrate to an active site. 

8. Glucose + 2 ATP + 2 NADP* + H,O —> PRPP + CO, + 
ADP + AMP + 2 NADPH + 3H". 

9. Glutamine + aspartate + CO2 + 2 ATP + NAD* —> orotate 
+2ADP + 2P; + glutamate + NADH + H*. 

10. (a, c, and d) PRPP; (b) carbamoyl phosphate. 

11. PRPP and formylglycinamide ribonucleotide 

12. dUMP + serine + NADPH +H* —> dTMP + 

NADP?* + glycine. 


13. There is a deficiency of N 10_formyltetrahydrofolate. 
Sulfanilamide inhibits the synthesis of folate by acting as an analog 
of p-aminobenzoate, one of the precursors of folate. 

14. (a) Cell A cannot grow ina HAT medium, because it cannot 
synthesize TMP either from thymidine or from dUMP. Cell B 
cannot grow in this medium, because it cannot synthesize purines 
by either the de novo pathway or the salvage pathway. Cell C 

can grow ina HAT medium because it contains active thymi- 
dine kinase from cell B (enabling it to phosphorylate thymidine 
to TMP) and hypoxanthine guanine phosphoribosyltransferase 
from cell A (enabling it to synthesize purines from hypoxanthine 
by the salvage pathway). 

(b) Transform cell A with a plasmid containing foreign genes of 
interest and a functional thymidine kinase gene. The only cells that 
will grow ina HAT medium are those that have acquired a thy- 
midylate kinase gene; nearly all of these transformed cells will also 
contain the other genes on the plasmid. 

15. The reciprocal substrate relation refers to the fact that 
AMP synthesis requires GTP, whereas GMP synthesis requires 
ATP. These requirements tend to balance the synthesis of ATP 
and GTP. 

16. Ring carbon 6 in cytosine will be labeled. In guanine, only 
carbon 5 will be labeled with '°C. 

17. The enzyme that uses ammonia synthesizes carbamoyl phos- 
phate for a reaction with ornithine, the first step of the urea cycle. 
The enzyme that uses glutamine synthesizes carbamoyl phos- 
phate for use in the first step of pyrimidine biosynthesis. 

18. These patients have a high level of urate because of the break- 
down of nucleic acids. Allopurinol prevents the formation of kidney 
stones and blocks other deleterious consequences of hyperuricemia 
by preventing the formation of urate. 

19. The free energies of binding are —57.7 (wild type), 

—49.8 (Asn 27), and —38.1 (Ser 27) kJ mol! (-13.8, -11.9, 
and —9.1 kcal mol‘, respectively). The loss in binding energy is 
7.9 kJ mol7! (1.9 kcal mol™!) and 19.7 kJ mol7! (4.7 kcal mol™!). 
20. Inosine or hypoxanthine could be administered. 

21. N-1 in both cases, and the amine group linked to C-6 in ATP. 
22. Nitrogen atoms 3 and 9 in the purine ring 

23. Allopurinol, an analog of hypoxanthine, is a suicide inhibitor 
of xanthine oxidase. 

24. An oxygen atom is added to allopurinol to form alloxanthine. 
25. 


The synthesis of carbamoyl phosphate requires 2 ATP 2 ATP 
The formation of PRPP from ribose 5-phosphate 

yields an AMP* 2 INSP 
The conversion of UMP to UTP requires 2 ATP 2 ATP 
The conversion of UTP to CTP requires 1 ATP LAMM 
Total 7 ATP 


*Remember that AMP is the equivalent of 2 ATP because an 
ATP must be expended to generate ADP, the substrate for ATP 
synthesis. 

26. (a) Carboxyaminoimidazole ribonucleotide; (b) glycinamide 
ribonucleotide; (c) phosphoribosyl amine; (d) formylglycinamide 
ribonucleotide. 

27. The first reaction proceeds by phosphorylation of glycine to 
form an acyl phosphate followed by nucleophilic attack by the 
amine of phosphoribosylamine to displace orthophosphate. The 
second reaction consists of adenylation of the carbonyl group 


of xanthylate followed by nucleophilic attack by ammonia to 
displace AMP. 


28. The —NH)2 group attacks the carbonyl carbon atom to form 
a tetrahedral intermediate. Removal of a proton leads to the 
elimination of water to form inosinate. 

29. PRPP is the activated intermediate in the synthesis of phos- 
phoribosylamine in the de novo pathway of purine formation; 

of purine nucleotides from free bases by the salvage pathway; of 
orotidylate in the formation of pyrimidines; of nicotinate ribo- 
nucleotide; of phosphoribosyl ATP in the pathway leading to 
histidine; and of phosphoribosylanthranilate in the pathway 
leading to tryptophan. 

30. (a) cAMP; (b) ATP; (c) UDP-glucose; (d) acetyl CoA; 

(e) NAD*, FAD; (f) dideoxynucleotides; (g) fluorouracil; (h) CTP 
inhibits ATCase. 

31. In vitamin By) deficiency, methyltetrahydrofolate cannot 
donate its methyl group to homocysteine to regenerate methionine. 
Because the synthesis of methyltetrahydrofolate is irreversible, the 
cell’s tetrahydrofolate will ultimately be converted into this form. 
No formyl or methylene tetrahydrofolate will be left for nucleotide 
synthesis. Vitamin By is also required to metabolize propionyl 
CoA generated in the oxidation of odd-chain fatty acids and in the 
degradation of methionine. 

32. Because folate is required for nucleotide synthesis, cells that 
are dividing rapidly would be most readily affected. They would 
include cells of the intestine, which are constantly replaced, and 
precursors to blood cells. A lack of intestinal cells and blood cells 
would account for the symptoms often observed. 

33. The cytoplasmic level of ATP in the liver falls and that of 
AMP rises above normal in all three conditions. The excess AMP 
is degraded to urate. 

34. Succinate —> malate —> oxaloacetate by the citric acid cycle. 
Oxaloacetate —> aspartate by transamination, followed by pyrimi- 
dine synthesis. Carbons 4, 5, and 6 are labeled. 

35. Glucose will most likely be converted into two molecules of 
pyruvate, one of which will be labeled in the 2 position: 


Now consider two common fates of pyruvate—conversion into 
acetyl CoA and subsequent processing by the citric acid cycle or 
carboxylation by pyruvate carboxylase to form oxaloacetate. 
Formation of citrate by condensing the labeled pyruvate with 
oxaloacetate will yield labeled citrate: 
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The labeled carbon will be retained through one round of the citric 
acid cycle but, on the formation of the symmetric succinate, the 
label will appear in two different positions. Thus, when succinate 
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is metabolized to oxaloacetate, which may be aminated to form 
aspartate, two carbons will be labeled: 


When this aspartate is used to form uracil, the labeled COO™ 
attached to the a-carbon is lost and the other COO” becomes 
incorporated into uracil as carbon 4. 

Suppose, instead, that labeled 2-['4C]pyruvate is carboxylated 
to form oxaloacetate and processed to form aspartate. In this case, 
the a-carbon of aspartate bears the label. 
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When this aspartate is used to synthesize uracil, carbon 6 bears the 
label. 

36. (a) Some ATP can be salvaged from the ADP that is being 
generated. (b) There are equal numbers of high-phosphoryl-transfer- 
potential groups on each side of the equation. (c) Because the adeny- 
late kinase reaction is at equilibrium, the removal of AMP would 
lead to the formation of more ATP. (d) Essentially, the cycle serves 
as an anaplerotic reaction for the generation of the citric acid cycle 
intermediate fumarate. 

37. (i) The formation of 5-aminoididazole-4-carboxamide ribo- 
nucleotide from 5-aminoimidazole-4-(N-succinylcarboxamide) 
ribonucleotide in the synthesis of IMP. (11) The formation of 
AMP from adenylosuccinate. (iii) The formation of arginine 
from argininosuccinate in the urea cycle. 

38. Allopurinol is an inhibitor of xanthine oxidase, which is on the 
pathway for urate synthesis. In your pet duck, this pathway is the 
means by which excess nitrogen is excreted. If xanthine oxidase 
were inhibited in your duck, nitrogen could not be excreted, with 
severe consequences such as the formation of a dead duck. 


Chapter 26 

1. Glycerol 3-phosphate is the foundation for both triacylglycerol 
and phospholipid synthesis. Glycerol 3-phosphate is acylated 
twice to form phosphatidate. In triacylglycerol synthesis, the 
phosphoryl group is removed from glycerol 3-phosphate to form 
diacylglycerol, which is then acylated to form triacylglycerol. In 
phospholipid synthesis, phosphatidate commonly reacts with CTP 
to form CDP-diacylglycerol, which then reacts with an alcohol to 
form a phospholipid. Alternatively, diacylglycerol may react with a 
CDP-alcohol to form a phospholipid. 

2. Glycerol 3-phosphate is formed primarily by the reduction of 
dihydroxyacetone phosphate, a gluconeogenic intermediate, and to 
a lesser extent by the phosphorylation of glycerol. 
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3. Glycerol + 4 ATP + 3 fatty acids + 4 H20 — 
triacylglycerol + ADP + 3 AMP +7P;+4H". 

4. Glycerol + 3 ATP + 2 fatty acids + 2H,0 + CTP 4 
ethanolamine —> phosphatidylethanolamine + CMP + ADP + 
2 AMP + 6P;+3H". 

5. Three. One molecule of ATP to form phosphorylethanolamine 
and two molecules of ATP to regenerate CTP from CMP. 

6. All are synthesized from ceramide. In sphingomyelin, the 
terminal hydroxyl group of ceramide is modified with phospho- 
rylcholine. In a cerebroside, the hydroxyl group has a glucose or 
galactose attached. In a ganglioside, oligosaccharide chains are 
attached to the hydroxyl group. 

7. (i) Activate the diacylglycerol as CDP-DAG. (ii) Activate the 
alcohol as CDP-alcohol. (iii) Use the base-exchange reaction. 

8. (a) CDP-diacylglycerol; (b) CDP-ethanolamine; (c) acyl CoA; 
(d) phosphatidylcholine; (e) UDP-glucose or UDP-galactose; 

(f ) UDP-galactose; (g) geranyl pyrophosphate. 

9. Such mutations are seen in mice. The amount of adipose tis- 
sue would decrease severely because diacylglycerol could not be 
formed. Normally, diacylglycerol is acylated to form triacylglycer- 
ols. If there were deficient phosphatidic acid phosphatase activity, 
no triacylglycerols would form. 

10. (1) The synthesis of activated isoprene units (isopentyl pyrophos- 
phate), (ii) the condensation of six of the activated isoprene units to 
form squalene, and (iii) cyclization of the squalene to form cholesterol. 
11. The amount of reductase and its activity control the regulation 
of cholesterol biosynthesis. Transcriptional control is mediated 

by SREBP. Translation of the reductase mRNA also is controlled. 
The reductase itself may undergo regulated proteolysis. Finally, 
the activity of the reductase is inhibited by phosphorylation by 
AMP kinase when ATP levels are low. 

12. (aand b) None, because the label is lost as CO3. 

13. The hallmark of this genetic disease is elevated cholesterol levels 
in the blood of even young children. The excess cholesterol is taken 
up by marcrophages, which eventually results in the formation of 
plaques and heart disease. There are many mutations that cause the 
disease, but all result in malfunctioning of the LDL receptor. 

14. The categories of mutations are: (i) no receptor is synthe- 
sized; (ii) receptors are synthesized but do not reach the plasma 
membrane, because they lack signals for intracellular transport 
or do not fold properly; (iii) receptors reach the cell surface, but 
they fail to bind LDL normally because of a defect in the LDL- 
binding domain; (iv) receptors reach the cell surface and bind 
LDL, but they fail to cluster in coated pits because of a defect in 
their carboxyl-terminal regions. 

15. “None of your business” and “T don’t talk biochemistry 

until after breakfast” are appropriate but rude and uninformative 
answers. A better answer might be: “Although it is true that cho- 
lesterol is a precursor to steroid hormones, the rest of the statement 
is oversimplified. Cholesterol is a component of membranes, and 
membranes literally define cells, and cells make up tissues. But to 
say that cholesterol ‘makes’ cells and tissues is wrong.” 

16. Statins are competitive inhibitors of HMG-CoA reductase. 
They are used as drugs to inhibit cholesterol synthesis in patients 
with high levels of cholesterol. 

17. No. Cholesterol is essential for membrane function and as a 
precursor for bile salts and steroid hormones. The complete lack of 
cholesterol would be lethal. 

18. Deamination of cytidine to uridine changes CAA (Gln) into 
UAA (stop). 

19. The LDL contains apolipoprotein B-100, which binds to an 
LDL receptor on the cell surface in a region known as a coated 
pit. On binding, the complex is internalized by endocytosis to 


form an internal vesicle. The vesicle is separated into two com- 
ponents. One, with the receptor, is transported back to the cell 
surface and fuses with the membrane, allowing continued use of 
the receptor. The other vesicle fuses with lysosomes inside the 
cell. The cholesteryl esters are hydrolyzed, and free cholesterol is 
made available for cellular use. The LDL protein is hydrolyzed to 
free amino acids. 

20. Benign prostatic hypertrophy can be treated by inhibiting 
5a-reductase. Finasteride, the 4-azasteroid analog of dihydrotes- 
tosterone, competitively inhibits the reductase but does not act on 
androgen receptors. Patients taking finasteride have a markedly 
lower plasma level of dihydrotestosterone and a nearly normal 
level of testosterone. The prostate gland becomes smaller, whereas 
testosterone-dependent processes such as fertility, libido, and 
muscle strength appear to be unaffected. 
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21. Patients who are most sensitive to debrisoquine have a 
deficiency of a liver P450 enzyme encoded by a member of the 
CYP2 subfamily. This characteristic is inherited as an autosom- 
al recessive trait. The capacity to degrade other drugs may be 
impaired in people who hydroxylate debrisoquine at a slow rate, 
because a single P450 enzyme usually handles a broad range of 
substrates. 

22. Many hydrophobic odorants are deactivated by hydroxy- 
lation. Molecular oxygen is activated by a cytochrome P450 
monooxygenase. NADPH serves as the reductant. One oxygen 
atom of O2 goes into the odorant substrate, whereas the other is 
reduced to water. 

23. Recall that dihydrotestosterone is crucial for the development 
of male characteristics in the embryo. If a pregnant woman were 
to be exposed to Propecia, the 5a-reductase of the male embryo 
would be inhibited, which could result in severe developmental 
abnormalities. 

24. The oxygenation reactions catalyzed by the cytochrome P450 
family permit greater flexibility in biosynthesis. Because plants are 
not mobile, they must rely on physical defenses, such as thorns, 
and chemical defenses, such as toxic alkaloids. The larger P450 
array might permit greater biosynthetic versatility. 

25. This knowledge would enable clinicians to characterize the 
likelihood of a patient’s having an adverse drug reaction or being 
susceptible to chemical-induced illnesses. It would also permit a 
personalized and especially effective drug-treatment regime for 
diseases such as cancer. 

26. The honey bees may be especially sensitive to environmen- 
tal toxins, including pesticides, because these chemicals are not 
readily detoxified, owing to the minimal P450 system. 

27. The core structure of a steroid is four fused rings: three 
cyclohexane rings and one cyclopentane ring. In vitamin D, the 

B ring is split by ultraviolet light. 

28. The negatively charged phosphoserine residue interacts with 
the positively charged protonated histidine residue and decreases 
its ability to transfer a proton to the thiolate. 
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29. The methyl group is first hydroxylated. The hydroxymethy]l- 
amine eliminated formaldehyde to form methylamine. 

30. Note that a cytidine nucleotide plays the same role in the 
synthesis of these phosphoglycerides as a uridine nucleotide 
does in the formation of glycogen (Section 21.4). In all of 

these biosyntheses, an activated intermediate (UDP-glucose, 
CDP-diacylglycerol, or CDP-alcohol) is formed from a phos- 
phorylated substrate (glucose 1-phosphate, phosphatidate, or 
a phosphorylalcohol) and a nucleoside triphosphate (UTP or 
CTP). The activated intermediate then reacts with a hydroxyl 
group (the terminus of glycogen, the side chain of serine, or a 
diacylglycerol). 

31. The attachment of isoprenoid side chains confers hydropho- 
bic character. Proteins having such a modification are targeted to 
membranes. 

32. 3-Hydroxy-3-methylglutaryl CoA is also a precursor for 
ketone-body synthesis. If fuel is needed elsewhere in the body, as 
might be the case during a fast, 3-hydroxy-3-methylglutaryl CoA 
is converted into the ketone acetoacetate. If energy needs are met, 
the liver will synthesize cholesterol. 

33. One way in which phosphatidylcholine can be synthesized 

is by the addition of three methyl groups to phosphatidyletha- 
nolamine. The methyl donor is a modified form of methionine, 
S-adenosylmethionine or SAM (Section 24.2). 

34. Citrate is transported out of the mitochondria in times of 
plenty. ATP-citrate lyase yields acetyl CoA and oxaloacetate. The 
acetyl CoA can then be used to synthesize cholesterol. 

35. (a) There is no effect. (b) Because actin is not controlled 

by cholesterol, the amount isolated should be the same in both 
experimental groups; a difference would suggest a problem in the 
RNA isolation. (c) The presence of cholesterol in the diet dra- 
matically reduces the amount of HMG-CoA reductase protein. 
(d) A common means of regulating the amount of a protein pres- 
ent is to regulate transcription, which is clearly not the case here. 
(e) The translation of mRNA could be inhibited, and the protein 
could be rapidly degraded. 


Chapter 27 

1. Adipose tissue is now known to be an active endocrine organ, 
secreting signal molecules called adipokines. 

2. Caloric homeostasis is the condition in which the energy expen- 
diture of an organism is equal to the energy intake. 

3. Leptin and insulin 

4. CCK produces a feeling of satiety and stimulates the secretion 
of digestive enzymes by the pancreas and the secretion of bile salts 
by the gall bladder. GLP-1 also produces a feeling of satiety; in 
addition, it potentiates the glucose-induced secretion of insulin by 
the B cells of the pancreas. 

5. Obviously, something is amiss. Although the answer is not 
known, the leptin-signaling pathway appears to be inhibited by 
suppressors of cytokine signaling, the regulatory proteins. 

6. 1:a, b; 2: f; 3: c, d, f; 4: c, d; 5: c; 6: f; 7: e; 8: e; 9: e. 

7. Phosphorylation of dietary glucose after it enters the liver; glu- 
coneogenesis; glycogen breakdown. 
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8. Type 1 diabetes is due to autoimmune destruction of the insu- 
lin-producing cells of the pancreas. Type 1 diabetes is also called 
insulin-dependent diabetes because affected people require insulin 
to survive. Type 2 diabetes is characterized by insulin resistance. 
Insulin is produced, but the tissues that should respond to insulin, 
such as muscle, do not. 

9. Leptin stimulates processes impaired in diabetes. For instance, 
leptin stimulates fatty acid oxidation, inhibits triacylglycerol 
synthesis, and increases the sensitivity of muscle and the liver to 
insulin. 

10. (a) A watt is equal to 1 joule (J) per second (0.239 calorie per 
second). Hence, 70 W is equivalent to 0.07 kJ s (0.017 kcal s~'). 
(b) A watt is a current of 1 ampere (A) across a potential of 1 volt 
(V). For simplicity, let us assume that all the electron flow is from 
NADH to ©; (a potential drop of 1.14 V). Hence, the current is 
61.4 A, which corresponds to 3.86 X 107° electrons per second 
(1 A =1 coulomb s! = 6.28 x 10'8 charges). 

(c) About 2.5 molecules of ATP are formed per molecule of 
NADH oxidized (two electrons). Hence, 1 molecule of ATP is 
formed per 0.8 electron transferred. A flow of 3.86 X 10° elec- 
trons per second therefore leads to the generation of 4.83 X 107° 
molecules of ATP per second, or 0.80 mmol s~'. 

(d) The molecular weight of ATP is 507. The total body content 
of ATP of 50 g is equal to 0.099 mol. Hence, ATP turns over 
about once in 125 seconds when the body is at rest. 

11. (a) The stoichiometry of the complete oxidation of glucose is 


Ce6H120¢ + 6 Oo — 6 CO, +6 H,0 
and that of tripalmitoylglycerol is 
Cs5,HggO2 + 72.5 O27 —> 51 CO, + 49 HO 


Hence, the RQ values are 1.0 and 0.703, respectively. 

(b) An RQ value reveals the relative use of carbohydrates and fats 
as fuels. The RQ of a marathon runner typically decreases from 
0.97 to 0.77 in the course of a race. The lowering of the RQ indi- 
cates the shift in fuel from carbohydrates to fat. 

12. One gram of glucose (molecular weight 180.2) is equal to 
5.55 mmol, and one gram of tripalmitoylglycerol (molecular weight 
807.3) is equal to 1.24 mmol. The reaction stoichiometries (see 
Problem 11) indicate that 6 mol of H2O is produced per mole 

of glucose oxidized, and 49 mol of H2O is produced per mole of 
tripalmitoylglycerol oxidized. Hence, the H2O yields per gram 

of fuel are 33.3 mmol (0.6 g) for glucose and 60.8 mmol (1.09 g) for 
tripalmitoylglycerol. Thus, complete oxidation of this fat gives 
1.82 times as much water as does glucose. Another advantage of 
triacylglycerols is that they can be stored in essentially anhydrous 
form, whereas glucose is stored as glycogen, a highly hydrated 
polymer. A hump consisting mainly of glycogen would be an 
intolerable burden—far more than the straw that broke the 
camel’s back. 

13. The starved—fed cycle is the nightly hormonal cycle that 
humans experience during sleep and on eating. The cycle main- 
tains adequate amounts of blood glucose. The starved part— 
sleep—is characterized by increased glucagon secretion and 
decreased insulin secretion. After a meal, glucagon concentration 
falls and insulin concentration rises. 

14. Ethanol is oxidized to yield acetaldehyde by alcohol dehydro- 
genase, which is subsequently oxidized to acetate acetaldehyde. 
Ethanol is also metabolized to acetaldehyde by the MEOS, with 
the subsequent depletion of NADPH. 

15. First, fatty liver develops owing to the increased amounts of 
NADH that inhibit fatty acid oxidation and stimulate fatty acid 
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synthesis. Second, alcoholic hepatitis begins owing to oxidative 
damage and damage due to excess acetaldehyde that results in 

cell death. Finally, fibrous tissues form, creating scars that impair 
blood flow and biochemical function. Ammonia cannot be con- 
verted into urea, and its toxicity leads to coma and death. 

16. A typical macadamia nut has a mass of about 2 g. Because 

it consists mainly of fats (~37 kJ gl, ~9 kcal g!), a nut has a 
value of about 75 kJ (18 kcal). The ingestion of 10 nuts results in 
an intake of about 753 kJ (180 kcal). As stated in the answer to 
Problem 10, a power consumption of 1 W corresponds to 1 Js! 
(0.239 cal s~!), and so 400-W running requires 0.4 kJ s~' (0.0956 
kcal s!), Hence, a person would have to run 1882 s, or about 31 
minutes, to spend the calories provided by 10 nuts. 

17. A high blood-glucose level triggers the secretion of insulin, 
which stimulates the synthesis of glycogen and triacylglycerols. A 
high insulin level would impede the mobilization of fuel reserves 
during the marathon. 

18. A lack of adipose tissue leads to an accumulation of fats in the 
muscle, with the generation of insulin resistance. The experiment 
shows that adipokines secreted by the adipose tissue, here leptin, 
facilitate in some fashion the action of insulin in muscle. 

19. Such a mutation would increase the phosphorylation of the insu- 
lin receptor and IRS in muscle and would improve insulin sensitivity. 
Indeed, PTP1B is an attractive therapeutic target for type 2 diabetes. 
20. Lipid mobilization can be so rapid that it exceeds the ability of 
the liver to oxidize the lipids or convert them into ketone bodies. 
The excess is reesterified and released into the blood as VLDLs. 
21. A role of the liver is to provide glucose for other tissues. In the 
liver, glycolysis is used not for energy production but for biosyn- 
thetic purposes. Consequently, in the presence of glucagon, liver 
glycolysis stops so that the glucose can be released into the blood. 
22. The urea cycle and gluconeogenesis 

23. (a) Insulin inhibits lipid utilization. 

(b) Insulin stimulates protein synthesis, but there are no amino 
acids in the children’s diet. Moreover, insulin inhibits protein 
breakdown. Consequently, muscle proteins cannot be broken down 
and used for the synthesis of essential proteins. 

(c) Because proteins cannot be synthesized, blood osmolarity is too 
low. Consequently, fluid leaves the blood. An especially important 
protein for maintaining blood osmolarity is albumin. 

24. During strenuous exercise, muscle converts glucose into 
pyruvate through glycolysis. Some of the pyruvate is processed 

by cellular respiration. However, some of it is converted into 
lactate and released into the blood. The liver takes up the lactate 
and converts it into glucose through gluconeogenesis. Muscle 

may process the carbon skeletons of branched-chain amino acids 
aerobically. The nitrogens of these amino acids are transferred 

to pyruvate to form alanine, which is released into the blood and 
taken up by the liver. After the transamination of the amino group 
to a-ketoglutarate, the resulting pyruvate is converted into glu- 
cose. Finally, muscle glycogen may be mobilized, and the released 
glucose can be used by muscle. 

25. This conversion allows muscle to function anaerobically. 
NAD‘ is regenerated when pyruvate is reduced to lactate, and so 
energy can continue to be extracted from glucose during strenuous 
exercise. The liver converts the lactate into glucose. 

26. Fatty acids and glucose, respectively. 

27. This practice is called carbo-loading. Depleting the glycogen 
stores will initially cause the muscles to synthesis a large amount of 
glycogen when dietary carbohydrates are provided and will lead to 
the supercompensation of glycogen stores. 

28. The oxygen consumption at the end of exercise is used to replen- 
ish ATP and creatine phosphate and to oxidize any lactate produced. 


29. Oxygen is used in oxidative phosphorylation to resynthesize 
ATP and creatine phosphate. The liver converts lactate released 
by the muscle into glucose. Blood must be circulated to return the 
body temperature to normal, and so the heart cannot return to 

its resting rate immediately. Hemoglobin must be reoxygenated 
to replace the oxygen used in exercise. The muscles that power 
breathing must continue working at the same time as the exercised 
muscles are returning to resting states. In essence, all the biochem- 
ical systems activated in intense exercise need increased oxygen to 
return to the resting state. 

30. Ethanol may replace water that is hydrogen bonded to proteins 
and membrane surfaces. This alteration of the hydration state of the 
protein would alter its conformation and hence function. Ethanol 
may also alter phospholipid packing in membranes. The two effects 
suggest that integral membrane proteins would be most sensitive to 
ethanol, as indeed seems to be the case. 

31. Cells from the type I fiber would be rich in mitochondria, 
whereas those of the type II fiber would have few mitochondria. 
32. (a) The ATP expended during this race amounts to about 
8380 kg, or 18,400 pounds. (b) The cyclist would need about 
$1,260,000,000 to complete the race. 

33. 55 pounds = 25 kg = 25,000 g = total weight gain 


40 years X 365 days year | = 14,600 days 
25,000 ¢/14,600 days = 1.7 g day | 


which is equivalent to an extra pat of butter per day. Her BMI is 
26.5, and she would be considered overweight but not obese. 

34. Exercise greatly enhances the ATP needs of muscle cells. To more 
efficiently meet these needs, more mitochondria are synthesized. 

35. The inability of muscle mitochondria to process all of the fatty 
acids produced by overnutrition leads to excessive levels of diacylglyc- 
erol and ceramide in the muscle cytoplasm. These second-messenger 
molecules activate enzymes that impair insulin signaling. 

36. Both are due to a lack of thiamine (vitamin B,). Thiamine, 
which is sometimes called aneurin, is required most notably for the 
proper functioning of pyruvate dehydrogenase. 

37. (a) Red blood cells always produce lactate, and fast-twitch 
muscle fibers (see Problem 31) also produce a large amount of lactate. 
(b) At that point, the athlete is beginning to move into anaerobic 
exercise, in which most energy is produced by anaerobic glycolysis. 
(c) The lactate threshold is essentially the point at which the ath- 
lete switches from aerobic exercise, which can be done for extended 
periods, to anaerobic exercise, essentially sprinting, which can be 
done for only short periods. The idea is to race at the extreme of his 
or her aerobic capacity until the finish line is in sight and then to 
switch to anaerobic. 

(d) Training increases the amount of blood vessels and the num- 
ber of muscle mitochondria. Together, they increase the ability to 
process glucose aerobically. Consequently, a greater effort can be 
expended before the switch to anaerobic energy production. 


Chapter 28 

1. DNA polymerase I uses deoxyribonucleoside triphosphates; 
pyrophosphate is the leaving group. DNA ligase uses DNA- 
adenylate (AMP joined to the 5’-phosphory] group) as a reac- 
tion partner; AMP is the leaving group. Topoisomerase I uses a 
DNA-tyrosyl intermediate (5’-phosphoryl group linked to the 
phenolic OH group); the tyrosine residue of the enzyme is the 
leaving group. 

2. Positive supercoiling resists the unwinding of DNA. The melting 
temperature of DNA increases in proceeding from negatively super- 
coiled to relaxed to positively supercoiled DNA. Positive supercoil- 
ing is probably an adaptation to high temperature. 


3. The nucleotides used for DNA synthesis have the triphosphate 
attached to the 5’-hydroxyl group with free 3’-hydroxyl groups. 
Such nucleotides can be utilized only for 5’-to-3’ DNA synthesis. 
4. DNA replication requires RNA primers. Without appropriate 
ribonucleotides, such primers cannot be synthesized. 

5. This close contact prevents the incorporation of ribonucleotides 
rather than 2'-deoxyribonucleotides. 

6. (a) 96.2 revolutions per second (1000 nucleotides per second 
divided by 10.4 nucleotides per turn for B-DNA gives 96.2 rps). 
(b) 0.34 pm s~! (1000 nucleotides per second corresponds to 
3400 A s~! because the axial distance between nucleotides in 
B-DNA is 3.4 A). 

7. Eventually, the DNA would become so tightly wound that move- 
ment of the replication complex would be energetically impossible. 
8. Linking number Lk = Tw + Wr = 48 + 3 = 51. If Tw = 50, 
the Wr = 1. 

9. A hallmark of most cancer cells is prolific cell division, which 
requires DNA replication. If the telomerase were not activated, 
the chromosomes would shorten until they became nonfunctional, 
leading to cell death. 

10. No. 

11. Treat the DNA briefly with endonuclease to occasionally nick 
each strand. Add the polymerase with the radioactive dNTPs. At 
the broken bond, or nick, the polymerase will degrade the existing 
strand with its 5’ —> 3’ exonuclease activity and replace it with a 
radioactive complementary copy by using its polymerase activity. 
This reaction scheme is referred to as nick translation because the 
nick is moved, or translated, along the DNA molecule without 
ever becoming sealed. 

12. If replication were unidirectional, tracks with a low grain 
density at one end and a high grain density at the other end 
would be seen. On the other hand, if replication were bidirec- 
tional, the middle of a track would have a low density, as shown 
in the diagram below. For E. coli, the grain tracks are denser 
on both ends than in the middle, indicating that replication is 
bidirectional. 


Unidirectional synthesis 


——> 
Origin 


Bidirectional synthesis 


<> 
Origin 


13. (a) Pro (CCC), Ser (UCC), Leu (CUC), and 
Phe (UUC). Alternatively, the last base of each of 
these codons could be U. 

(b) Nitrous acid. 

14. Potentially deleterious side reactions are prevented. The 
enzyme itself might be damaged by light if it could be activated 
by light in the absence of bound DNA harboring a pyrimidine 
dimer. 

15. The free DNA ends that appear in the absence of telomeres are 
repaired by DNA fusion. 

16. The free energy of ATP hydrolysis under standard conditions 
is —30.5 kJ mol” (—7.3 kcal mol™!). In principle, it could be used 
to break three base pairs. 

17. The oxidation of guanine could lead to DNA repair: DNA 
strand cleavage could allow looping out of the triplet repeat regions 
and triplet expansion. 
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18. The release of DNA topoisomerase II after the enzyme has 
acted on its DNA substrate requires ATP hydrolysis. Negative 
supercoiling requires only the binding of ATP, not its hydrolysis. 
19. (a) Size; the top is relaxed and the bottom is supercoiled DNA. 
(b) Topoisomers. (c) The DNA is becoming progressively more 
unwound, or relaxed, and thus slower moving. 

20. (a) It was used to determine the number of spontaneous 
revertants—that is, the background mutation rate. 

(b) To firmly establish that the system was working. A known 
mutagen’s failure to produce revertants would indicate that 
something was wrong with the experimental system. 

(c) The chemical itself has little mutagenic ability but is apparently 
activated into a mutagen by the liver homogenate. 

(d) Cytochrome P450 system. 


Chapter 29 


1. The sequence of the coding (+, sense) strand is 
5'-ATGGGGAACAGCAAGAGTGGGGCCCTGTCCAAGGAG-3' 

and the sequence of template (—, antisense) strand is 
3'-TACCCCTTGTCGTTCTCACCCCGGGACAGGTTCCTC-5' 


2. An error will affect only one molecule of mRNA of many 
synthesized from a gene. In addition, the errors do not become a 
permanent part of the genomic information. 

3. At any given instant, only a fraction of the genome (total DNA) 
is being transcribed. Consequently, speed is not essential. 

4, The active sites are related by convergent evolution. 

5. Heparin, a glycosaminoglycan, is highly anionic. Its negative 
charges, like the phosphodiester bridges of DNA templates, allow 
it to bind to lysine and arginine residues of RNA polymerase. 

6. This mutant o will competitively inhibit the binding of 
holoenzyme and prevent the specific initiation of RNA chains at 
promoter sites. 

7. The core enzyme without o binds more tightly to the DNA 
template than does the holoenzyme. The retention of o after 
chain initiation would make the mutant RNA polymerase less 
processive. Hence, RNA synthesis would be much slower than 
normal. 

8. A 100-kd protein contains about 910 residues, which are encoded 
by 2730 nucleotides. At a maximal transcription rate of 50 nucleotides 
per second, the mRNA would be synthesized in 54.6 s. 

9. The RNA polymerase slides along the DNA rapidly rather than 
simply diffusing through three-dimensional space. 

10. The start site is in red type: 


5’-GCCGTTGACACCGTTCGGCGATCGATCCGCTATAATGTGTGGATCCGCTT-3’ 


11. Initiation at strong promoters takes place every 2 s. In this 
interval, 100 nucleotides are transcribed. Hence, centers of 
transcription bubbles are 34 nm (340 A) apart. 

12. (a) The lowest band on the gel will be that of strand 3 alone (1), 
whereas the highest will be that of stands 1, 2, and 3 and core poly- 
merase (v). Band ii will be at the same position as band i because 
the RNA is not complementary to the nontemplate strand, where- 
as band iii will be higher because a complex is formed between 
RNA and the template strand. Band iv will be higher than the oth- 
ers because strand 1 is complexed to 2, and strand 2 is complexed 
to 3. Band v is the highest because core polymerase associates with 
the three strands. 

(b) None, because rifampicin acts before the formation of the open 
complex. 
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(c) RNA polymerase is processive. When the template is bound, 
heparin cannot enter the DNA-binding site. 

(d) When GTP is absent, synthesis stops when the first cytosine 
residue downstream of the bubble is encountered in the template 
strand. In contrast, with all four nucleoside triphosphates present, 
synthesis will continue to the end of the template. 

13. RNA polymerase must backtrack before cleavage, leading to 
dinucleotide products. 

14. The base-pairing energy of the di- and trinucleotide DNA-— 
RNA hybrids formed at the very beginning of transcription is not 
sufficient to prevent strand separation and loss of product. 

15. (a) Because cordycepin lacks a 3’-OH group, it cannot par- 
ticipate in 3’ —> 5’ bond formation. (b) Because the poly(A) 
tail is a long stretch of adenosine nucleotides, the likelihood 
that a molecule of cordycepin would become incorporated is 
higher than with most RNA. (c) Yes, it must be converted into 
cordycepin 5'-triphosphate. 

16. There are 2° = 256 possible products. 

17. The relation between the —10 and —35 sequences could be 
affected by torsional strain. The fact that topoisomerase IJ intro- 
duces negative supercoils in DNA prevents this enzyme from 
overstimulating the expression of its own gene. 

18. Ser-Ile-Phe-His-Pro-Stop 

19. A mutation that disrupted the normal AAUAAA recognition 
sequence for the endonuclease could account for this finding. In 
fact, a change from U to C in this sequence caused this defect in 

a thalassemic patient. Cleavage was at the AAUAAA 900 nucle- 
otides downstream of this mutant AACAAA site. 

20. One possibility is that the 3’ end of the poly(U) donor strand 
cleaves the phosphodiester bond on the 5’ side of the insertion site. 
The newly formed 3’ terminus of the acceptor strand then cleaves 
the poly(U) strand on the 5’ side of the nucleotide that initiated 
the attack. In other words, a uridine residue could be added by two 
transesterification reactions. This postulated mechanism is similar 
to the one in RNA splicing. 

21. Alternative splicing and RNA editing. Covalent modification of 
the proteins subsequent to synthesis further enhances the complexity. 
22. Attach an oligo(dT) or oligo(U) sequence to an inert support 
to create an affinity column. When RNA is passed through the 
column, only poly(A)-containing RNA will be retained. 

23. (a) Different amounts of RNA are present for the various genes. 
(b) Although all of the tissues have the same genes, the genes are 
expressed to different extents in different tissues. 

(c) These genes are called housekeeping genes—genes that most 
tissues express. They might include genes for glycolysis or citric 
acid cycle enzymes. 

(d) The point of the experiment is to determine which genes are 
initiated in vivo. The initiation inhibitor is added to prevent initia- 
tion at start sites that may have been activated during the isolation 
of the nuclei. 

24. DNA is the single strand that forms the trunk of the tree. 
Strands of increasing length are RNA molecules; the beginning of 
transcription is where growing chains are the smallest; the end of 
transcription is where chain growth stops. Direction is left to right. 
Many enzymes are actively transcribing each gene. 


Chapter 30 

1. The Oxford English Dictionary defines translation as the action 
or process of turning from one language into another. Protein syn- 
thesis converts nucleic acid sequence information into amino acid 
sequence information. 

2. An error frequency of 1 incorrect amino acid every 10* incor- 
porations allows for the rapid and accurate synthesis of proteins as 


large as 1000 amino acids. Higher error rates would result in too 
many defective proteins. Lower error rates would likely slow the 
rate of protein synthesis without a significant gain in accuracy. 

3. (i) Each is a single chain. (ii) They contain unusual bases. 

(iii) Approximately half of the bases are base-paired to form 
double helices. (iv) The 5’ end is phosphorylated and is usually 
pG. (v) The amino acid is attached to the hydroxyl group of the A 
residue of the CCA sequence at the 3’ end of the tRNA. (vi) The 
anticodon is located in a loop near the center of the tRNA sequence. 
(vit) The molecules are L-shaped. 

4. First is the formation of the aminoacyl adenylate, which then 
reacts with the tRNA to form the aminoacyl-tRNA. Both steps are 
catalyzed by aminoacyl-tRNA synthetase. 

5. Unique features are required so that the aminoacyl-tRNA syn- 
thetases can distinguish among the tRNAs and attach the correct 
amino acid to the proper tRNA. Common features are required 
because all tRNAs must interact with the same protein-synthesizing 
machinery. 

6. An activated amino acid is one linked to the appropriate tRNA. 
7. (a) No; (b) no; (c) yes. 

8. The ATP is cleaved to AMP and PP;. Consequently, a second 
ATP is required to convert AMP into ADP, the substrate for 
oxidative phosphorylation. 

9. Amino acids larger than the correct amino acid cannot fit into 
the active site of the tRNA. Smaller but incorrect amino acids 

that become attached to the tRNA fit into the editing site and are 
cleaved from the tRNA. 

10. Recognition sites on both faces of the tRNAs may be required 
to uniquely identify the 20 different tRNAs. 

11. The first two bases in a codon form Watson—Crick base 
pairs that are checked for fidelity by bases of the 165 rRNA. The 
third base is not inspected for accuracy, and so some variation is 
tolerated. 

12. Four bands: light, heavy, a hybrid of light 30S and heavy 50S, 
and a hybrid of heavy 305 and light 505 

13. Two hundred molecules of ATP are converted into 200 AMP 
+ 400 P; to activate the 200 amino acids, which is equivalent to 
400 molecules of ATP. One molecule of GTP is required for 
initiation, and 398 molecules of GTP are needed to form 199 
peptide bonds. 

14. (a, d, and e) Type 2; (b, c, and f) type 1. 

15. The reading frame is a set of contiguous, nonoverlapping 
three-nucleotide codons that begins with a start codon and ends 
with a stop codon. 

16. A mutation caused by the insertion of an extra base can be 
suppressed by atRNA that contains a fourth base in its anti- 
codon. For example, UUUC rather than UUU is read as the 
codon for phenylalanine by a tRNA that contains 3’-AAAG-5’ 
as its anticodon. 

17. One approach is to synthesize a tRNA that is acylated with a 
reactive amino acid analog. For example, bromoacetyl-phenyla- 
lanyl-tRNA is an affinity-labeling reagent for the P site of E. coli 
ribosomes. 

18. The sequence GAGGU is complementary to a sequence of 
five bases at the 3’ end of 16S rRNA and is located several bases 
upstream of an AUG start codon. Hence, this region is a start 
signal for protein synthesis. The replacement of G by A would be 
expected to weaken the interaction of this mRNA with the 16S 
rRNA and thereby diminish its effectiveness as an initiation signal. 
In fact, this mutation results in a 10-fold decrease in the rate of 
synthesis of the protein specified by this mRNA. 

19. The peptide would be Phe-Cys-His-Val-Ala-Ala. The 
codons UGC and UGU encode cysteine but, because the 


cysteine was modified to alanine, alanine is incorporated in 
place of cysteine. 

20. Proteins are synthesized from the amino to the carboxyl end 
on ribosomes, whereas they are synthesized in the reverse direction 
in the solid-phase method. The activated intermediate in ribosom- 
al synthesis is an aminoacyl-tRNA; in the solid-phase method, it is 
the adduct of the amino acid and dicyclohexylcarbodiimide. 

21. The error rates of DNA, RNA, and protein synthesis are of 

the order of 107!°, 1075, and 1077, respectively, per nucleotide (or 
amino acid) incorporated. The fidelity of all three processes depends 
on the precision of base-pairing to the DNA or mRNA template. 
Few errors are corrected in RNA synthesis. In contrast, the fidelity 
of DNA synthesis is markedly increased by the 3’ —> 5’ proofreading 
nuclease activity and by postreplicative repair. In protein synthesis, 
the mischarging of some tRNAs is corrected by the hydrolytic action 
of aminoacyl-tRNA synthetase. Proofreading also takes place when 
aminoacyl-tRNA occupies the A site on the ribosome; the GTPase 
activity of EF-Tu sets the pace of this final stage of editing. 

22. GTP is not hydrolyzed until aminoacyl-tRNA is delivered to the 
A site of the ribosome. An earlier hydrolysis of GTP would be waste- 
ful because EF-Tu-GDP has little affinity for aminoacyl-tRNA. 

23. The translation of an mRNA molecule can be blocked by anti- 
sense RNA, an RNA molecule with the complementary sequence. 
The antisense-sense RNA duplex cannot serve as a template for 
translation; single-stranded mRNA is required. Furthermore, 

the antisense—sense duplex is degraded by nucleases. Antisense 
RNA added to the external medium is spontaneously taken up by 
many cells. A precise quantity can be delivered by microinjection. 
Alternatively, a plasmid encoding the antisense RNA can be intro- 
duced into target cells. 

24. (a) As. (b) As > Aq > Az > Ag. (c) Synthesis is from the 
amino terminus to the carboxy] terminus. 

25. These enzymes convert nucleic acid information into protein 
information by interpreting the tRNA and linking it to the proper 
amino acid. 

26. The rate would fall because the elongation step requires that the 
GTP be hydrolyzed before any further elongation can take place. 

27. Protein factors modulate the initiation of protein synthesis. 
The role of IF1 and IF3 is to prevent premature binding of the 

30S and 505 ribosomal subunits, whereas IF2 delivers Met-tRNA¢ 
to the ribosome. Protein factors are also required for elongation 
(EF-G and EF-Tu), for termination (release factors, RFs), and for 
ribosome dissociation (ribosome release factors, RRFs). 

28. The signal sequence, signal-recognition particle (SRP), the 
SRP receptor, and the translocon. 

29. The formation of peptide bonds, which in turn are powered by 
the hydrolysis of the aminoacyl-tRNAs. 

30. The Shine-Dalgarno sequence of the mRNA base-pairs with 
a part of the 16S rRNA of the 30S subunit, which positions the 
subunit so that the initiator AUG is recognized. 


31; 

Prokaryote Eukaryote 
Ribosome size 605 80S 
mRNA polycistronic Not polycistronic 
Initiation Shine-Dalgarno is First AUG is used 


Protein factors 


Relation to 
transcription 


First amino acid 


required 
Required 
Translation can start 
before transcription 
is completed 


fMet 


Many more required 
Transcription and 
translation are 
spatially separated 
Met 
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32. The SRP binds to the signal sequence and inhibits further 
translation. The SRP ushers the inhibited ribosome to the ER, 
where it interacts with the SRP receptor (SR). The SRP-SR com- 
plex binds the translocon and simultaneously hydrolyzes GTP. On 
GTP hydrolysis, SRP and SR dissociate from each other and from 
the ribosome. Protein synthesis resumes and the nascent protein is 
channeled through the translocon. 

33. The alternative would be to have a single ribosome translating 
a single mRNA molecule. The use of polysomes allows more pro- 
tein synthesis per mRNA molecule in a given period of time and 
thus the production of more protein. 

34. (a) 1, 2, 3, 5, 6, 10; (b) 1, 2, 7, 8,; (c) 1, 4, 8, 9. 

35. Transfer RNAs have roles in several recognition processes. A 
tRNA must be recognized by the appropriate aminoacyl-tRNA 
synthetase, and the tRNA must interact with the ribosome and, in 
particular, with the peptidyl transferase. 

36. The nucleophile is the amino group of the aminoacyl-tRNA. This 
amino group attacks the carbonyl group of the ester of peptidyl-tRNA 
to form a tetrahedral intermediate, which eliminates the tRNA alcohol 
to form a new peptide bond. 

37. The aminoacyl-tRNA can be initially synthesized. However, 
the side-chain amino group attacks the ester linkage to form a six- 
membered amide, releasing the tRNA. 

38. EF-Ts catalyzes the exchange of GTP for GDP bound to 
EF-Tu. In G-protein cascades, an activated 7TM receptor cata- 
lyzes GTP—GDP exchange in a G protein. 

39. The a subunits of G proteins are inhibited by a similar mecha- 
nism in cholera and whooping cough (Section 14.5). 

40. Glu-tRNA®@" js formed by misacylation. The activated glu- 
tamate is subsequently amidated to form Gln-tRNAS". Ways in 
which glutamine is formed from glutamate were discussed in Section 
24.2. In regard to H. pylori, a specific enzyme, Glu-tRNAS" ami- 
dotransferase, catalyzes the following reaction: 


Gln + Glu-tRNAG? + ATP —> 
Gln-tRNAC™ + glu + ADP + P; 


Glu-tRNA®@" is not a substrate for the enzyme; so the transferase 
must also recognize aspects of the structure of tRNAC?, 

41. The primary structure determines the three-dimensional 
structure of the protein. Thus, the final phase of information trans- 
fer from DNA to RNA to protein synthesis is the folding of the 
protein into its functional state. 

42. (a) elF-4H has two effects: (1) the extent of unwinding is 
increased and (2) the rate of unwinding is increased, as indicated 
by the increased rise in activity at early reaction times. 

(b) To firmly establish that the effect of e[F-H4 was not due to any 
inherent helicase activity. 

(c) Half-maximal activity was achieved at 0.11 wM of elF-4H. 
Therefore, maximal stimulation would be achieved at a ratio of 1:1. 
(d) eIF-4H enhances the rate of unwinding of all helices, but the 
effect is greater as the helices increase in stability. 

(e) The results in graph C suggest that elF-4H increases the 
processivity. 

43. (a) The three peaks represent, from left to right, the 405 ribo- 
somal subunit, the 60S ribosomal subunit, and the 805 ribosome. 
(b) Not only are ribosomal subunits and the 805 ribosome pres- 
ent, but polysomes of various lengths also are apparent. The 
individual peaks in the polysome region represent polysomes of 
discrete length. 

(c) The treatment significantly inhibited the number of polysomes 
while increasing the number of free ribosomal subunits. This 
outcome could be due to inhibited protein-synthesis initiation or 
inhibited transcription. 
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Chapter 31 
1. (a) Cells will express B-galactosidase, lac permease, and thiogalacto- 
side transacetylase even in the absence of lactose. (b) Cells will express 
B-galactosidase, lac permease, and thiogalactoside transacetylase even 
in the absence of lactose. (c) The levels of catabolic enzymes such as 
B-galactosidase and arabinose isomerase will remain low even at low 
levels of glucose. 
2. The concentration is 1/(6 X 107°) moles per 107} liter = 1.7 X 
10-° M. Because Ky = 10° '° M, the single molecule should be 
bound to its specific binding site. 
3. The number of possible 8-bp sites is 48 = 65,536. Ina genome 
of 4.6 X 10° base pairs, the average site should appear 
(4.6 X 10°)/65,536 = 70 times. Each 10-bp site should appear 4 
times. Each 12-bp site should appear 0.27 times (many 12-bp 
sites will not appear at all). 
4. The lac repressor does not bind DNA when the repressor is bound 
toa small molecule (the inducer), whereas the pur repressor binds 
DNA only when the repressor is bound to a small molecule (the 
corepressor). The E. coli genome contains only a single lac repressor- 
binding region, whereas it has many sites for the pur repressor 
5. Anti-inducers bind to the conformation of repressors, such as 
the lac repressor, that are capable of binding DNA. They occupy 
a site that overlaps that for the inducer and, therefore, compete for 
binding to the repressor. 
6. The inverted repeat may be a binding site for a dimeric DNA- 
binding protein or it may correspond to a stem-loop structure in 
the encoded RNA. 
7. Bacteriophage \ would be more likely to enter the lytic phase 
because the cooperative binding of the A repressor to Op2 and 
Or, which supports the lysogenic pathway, would be disrupted. 
8. Nrepressor gene —10 region GATTTA —35 region TAGATA 
Cro gene —10 region TAATGG —35 region TTGACT 
There are four differences in the —10 region and three differences 
in the —35 region. 
9. Increased Cro concentration reduces the expression of the A repres- 
sor gene. Increased \ repressor concentration reduces the expression 
of the Cro gene. At low J repressor concentration, increased J repres- 
sor concentration increases the expression of the \ repressor gene. At 
higher \ repressor concentrations, increased \ repressor concentration 
decreases the expression of the \ repressor gene. 
10. Normally, bacterial mRNAs have a leader sequence in which a 
Shine—Delgarno sequence precedes the AUG start codon. The absence 
of a leader would be expected to lead to inefficient translation. 
11. Add each compound to a culture of V. fischeri at low density 
and look for the development of luminescence. 
12. ACC, 7; ACA, 1, ACU, 0; ACG, 0. 
13. The reaction takes place with overall retention of configura- 
tion. Each step likely takes place with inversion of configuration, 
which suggests that the reaction consists of two (or some other 
even number of) steps. A possible mechanism is nucleophilic 
attack by the carboxylate group of Glu 537 on the C-1 carbon 
atom of the galactose moiety within glucose, releasing glucose and 
forming an intermediate with the galactose linked to the enzyme 
through an ester linkage. Water then attacks this carbon atom, 
displacing the glutamate carboxylate and releasing galactose. 
14. The binding appears to be half complete at a concentration of 
repressor near 3.7 nM. Thus, Kg is approximately 3.7 nM and 
AG® = —48 kJ/mol (—11 kcal/mol) at 298 K. 


Chapter 32 

1. The distribution of charged amino acids is H2A (13 K, 13 R, 
2D, 7E, charge = +17), H2B (20 K, 8R, 3D, 7 E, charge = 
+18), H3 (13 K, 18 R, 4D, 7E, charge = +20), H4 (11K, 14R, 


3D, 4E, charge = +18). The total charge of the histone octamer 
is estimated to be 2 X (17 + 18 + 20 + 18) = +146. The total 
charge on 150 base pairs of DNA is —300. Thus, the histone 
octamer neutralizes approximately one-half of the charge. 

2. The presence of a particular DNA fragment could be detected 
by hybridization, by PCR, or by direct sequencing. 

3. The total length of the DNA is estimated to be 145 bp X 

3.4 A/bp = 493 A, which represents 1.75 turns or 1.75 X 2arr = 11.07. 
Thus, the radius is estimated to be r = 493 A/11.0 = 44.8 A. 

4. 5-Azacytidine cannot be methylated. Some genes, normally 
repressed by methylation, will be active. 

5. Proteins containing these domains will be targeted to methy]l- 
ated DNA in repressed promoter regions. They would likely 
bind in the major groove because that is where the methyl group 
is located. 

6. Gene expression is not expected to respond to the presence of 
estrogen. However, genes for which expression normally responds 
to estrogen will respond to the presence of progesterone. 

7. The acetylation of lysine will reduce the charge from +1 to 0. 
The methylation of lysine will not reduce the charge. 

8. On the basis of the pattern of cysteine and histidine residues, 
this region appears to contain three zinc-finger domains. 

9. 10/4000 = 0.25%. 0.25% of 12 Mb = 30 kilobase pairs. 

10. The addition of an IRE to the 5’ end of the mRNA is expected 
to block translation in the absence of iron. The addition of an IRE 
to the 3’ end of the mRNA is not expected to block translation, but 
it might affect mRNA stability. 

11. The sequences of all of the mRNAs would be searched for 
sequences that are fully or nearly complementary to the sequence 
of the miRNA. These sequences would be candidates for regula- 
tion by this mRNA. 

12. The amino group of the lysine residue, formed from the proto- 
nated form by a base, attacks the carbonyl group of acetyl CoA to 
generate a tetrahedral intermediate. This intermediate collapses to 
form the amide bond and release CoA. 

13. In mouse DNA, most of the Hpall sites are methylated and 
therefore not cut by the enzyme, resulting in large fragments. Some 
small fragments are produced from CpG islands that are unmethyl- 
ated. For Drosophila and E. coli DNA, there is no methylation and 
all sites are cut. 


Chapter 33 

1. The transgenic nematode would avoid the compound. The 
identity of the ligand is determined by the receptor, whereas the 
behavioral response is dictated by the neuron in which the receptor 
is expressed. 

2. Only a mixture of compounds Cs-COOH and HOOC-C7-COOH 
is predicted to yield this pattern. 

3. Bitter and sweet sensations are mediated by G proteins coupled 
to 7TM receptors, leading to millisecond time resolution. Salty and 
sour sensations are mediated directly by ion channels, which may 
lead to faster time resolution. 

4. Sound travels 0.15 m in 428 ps. The human hearing system 

is capable of sensing time differences of close to a microsecond, 
and so the difference in arrival times at the two ears is substantial. 
A system based on G proteins is unlikely to be able to reliably dis- 
tinguish between signals arriving at the two ears, because G proteins 
typically respond in milliseconds. 

5. Ifa plant tastes bitter, animals will avoid eating it even if it is 
nontoxic, which may provide a selective advantage to the plant. 

6. Using mice in which either the gene for T1R1 or the gene for 
T1R3 has been disrupted, test the taste responses of these mice to 
glutamate, aspartate, and a wide variety of other amino acids. 


7. These women have four functional color receptors: blue, red, 
green, and a red-green hybrid. The additional color receptor allows 
some colors that appear identical to most people to be distinguished. 
8. 380 (one for each receptor); there are (380 X 379)/2! = 72,010 
combinations of two receptors; (380 379 X 378)/3! = 9,073,260 
combinations of three receptors. 

9. The absorption of light converts 11 -cis-retinal into all-trans-retinal. 
10. These compounds are enantiomers and must bind to protein 
receptors to elicit a smell. Even these subtle structural differences 
can affect relative receptor binding affinities and, hence, the elic- 
ited odor. 

11. Vision: cGMP-gated channel; taste: amiloride-sensitive 
sodium channel; hearing; tip-link channel. 

12. For all senses, ATP hydrolysis is required to generate and 
maintain ion gradients and membrane potential. Olfaction: ATP 
is required for the synthesis of cAMP. Gustation: ATP is required 
for the synthesis of cyclic nucleotides, and GTP is required for 

the action of gustducin in the detection of bitter and sweet tastes. 
Vision: GTP is required for the synthesis of CGMP and for the 
action of transducin. Hearing and touch: ATP hydrolysis is 
required to generate and maintain ion gradients and membrane 
potential and may be required for other roles as well. 
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Chapter 34 

1. The innate immune system responds rapidly to common fea- 
tures present in many pathogens. The genes for the innate immune 
system’s key molecules are expressed without substantial modifica- 
tion. In contrast, the adaptive immune system responds to specific 
features present only in a given pathogen. Its genes undergo sig- 
nificant rearrangement and mutation to enable specific recogition 
of a vast number of potential binding surfaces. 

2. VJ and V(D)J recombination; variability in segment joining by the 
action of terminal deoxyribonucleotidy] transferase; somatic mutation. 
3. Affinity refers to the strength of a single interaction; avidity 
refers to the cumulative strength of multiple independent binding 
interactions. Avidity may play a significant role in the interaction 
between IgM and antigen because this immunoglobulin class 
features 10 binding sites. 

4. The intracellular signaling domain common to each of the 
TLRs is responsible for docking other proteins and reporting that 
a targeted pathogen-associated molecular pattern (PAMP), such 

as LPS, has been detected. If a mutation within this domain inter- 
fered with the intracellular docking and signal transduction, then 
TLR-4 would not respond to LPS. 

5. Viruses that contain dsRNA genomes would be expected to 
stimulate a TLR-3-mediated immune response. 

6. (a) AG” = —37 kJ mol7! (—8.9 kcal mol7!) 

(b) K, = 3.3 X 10°M7! 

(c) Ron = 4 X 10° M7! 571. This value is close to the diffusion- 
controlled limit for the combination of a small molecule with a 


A4l 


Answers to Problems 


protein (see p. 245). Hence, the extent of structural change is likely 
to be small; extensive conformational transitions take time. 

7. The fluorescence enhancement and the shift to blue indicate 
that water is largely excluded from the combining site when the 
hapten is bound. Hydrophobic interactions contribute signifi- 
cantly to the formation of most antigen—antibody complexes. 

8. (a) An antibody combining site is formed by CDRs from both 
the H and the L chains. The Vy and Vy, domains are essential. A 
small proportion of F, fragments can be further digested to produce 
F,, a fragment that contains just these two domains. Cy1 and Cr 
contribute to the stability of F,, but not to antigen binding. 

(b) A synthetic F, analog 248 residues long was prepared by 
expressing a synthetic gene consisting of a Vy gene joined to a Vy 
gene through a linker. See J. 5. Huston et al., Proc. Natl. Acad. Sci. 
U.S. A. 85:5879-5883, 1988. 

9. (a) Multivalent antigens lead to the dimerization or oligomer- 
ization of transmembrane immunoglobulins, an essential step in 
their activation. This mode of activation is reminiscent of that of 
receptor tyrosine kinases (Section 14.2). 

(b) An antibody specific for a transmembrane immunoglobulin 
will activate a B cell by cross-linking these receptors. This experi- 
ment can be carried out by using, for example, a goat antibody to 
cross-link receptors on a mouse B cell. 

10. Bcells do not express T-cell receptors. The hybridization 

of T-cell cDNAs with B-cell mRNAs removes cDNAs that are 
expressed in both cells. Hence, the mixture of cDNAs subsequent 
to this hybridization are enriched in those encoding T-cell receptors. 
This procedure, called subtractive hybridization, is generally useful 
in isolating low-abundance cDNAs. Hybridization should be car- 
ried out by using mRNAs from a closely related cell that does not 
express the gene of interest. See S. M. Hedrick, M. M. Davis, 

D. I. Cohen, E. A. Nielsen, and M. M. Davis, Nature 308:149-153, 
1984, for an interesting account of how this method was used to 
obtain genes for T-cell receptors. 

11. TLR-4 is the receptor for LPS, a toxin found specifically in 
the walls of Gram-negative bacteria. Mutations that inhibit the 
function of TLR4 impair an affected person’s defenses against this 
class of bacteria. 

12. Ifthe HLA alleles are not matched, then the recipient’s T cell 
receptors will identify the MHC proteins of the transplanted tissue 
as nonself and transplant rejection is likely. 

13. Purify an antibody with a specificity to one antigen. Unfold the 
antibody and allow it to re-fold either in the presence of the antigen 
or in the absence of the antigen. Test the re-folded antibodies for 
antigen-binding ability. 

14. In some cases, V-D-J rearrangement will result in combining 
V, D, and J segments out of frame. mRNA molecules produced 
from such rearranged genes will produce truncated molecules if 
translated. This possibility is excluded by degrading the mRNA. 
15. The mutant bacteria may still stimulate an immune response 
without causing disease. Hence, they may be valuable starting 
points for the design of a live attenuated vaccine for the original 
pathogenic strain. 

16. The peptide is LLQATYSAV (L in second position, V in last). 
17. Catalysis is likely to require a base for removing a proton from 
a water molecule. A histidine, glutamate, or aspartate residue is 
most likely. In addition, a potential hydrogen-bond donor may be 
present and will interact with the negatively charged oxygen atom 
that forms in the transition state. 

18. A phosphotyrosine residue in the carboxyl terminus of Src and 
related protein tyrosine kinases binds to its own SH2 domain to 
generate the inhibited from of Src (Section 14.5). Removal of the 
phosphoryl group from this residue will activate the kinase. 
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19. (a) Kg= 10-’ M; (b) Ky = 10°? M. The gene was probably 
generated by a point mutation in the gene for antibody A rather 
than by de novo rearrangement. 


Chapter 35 

1. (a) Skeletal muscle and eukaryotic cilia derive their free energy 
from ATP hydrolysis; the bacterial flagellar motor uses a proton- 
motive force. 

b) Skeletal muscle requires myosin and actin. Eukaryotic cilia require 
microtubules and dynein. The bacterial flagellar motor requires 
MotA, MotB, and FliG, as well as many ancillary components. 

2. 6400 A/80 A = 80 body lengths per second. For a 10-foot auto- 
mobile, this body-length speed corresponds to a speed of 80 X 

10 feet = 800 feet per second, or 545 miles per hour. 

3. 4pN = 8.8 X io pounds. The weight of a single motor 
domain is 100,000 g mol~!/(6.023 X 107% molecules mol7!) = 
1.7 X 1071? g = 3.7 X 10°” pounds. Thus, a motor domain can lift 
(8.8 X 107 19/3.7 X 107°") = 2.4 X 10° times its weight. 

4. Both actin filaments and microtubles are built from subunits 
and these subunits bind and hydrolyze nucleoside triphosphates. 
Actin filaments are built of a single type of subunit and these sub- 
units bind ATP. Microtubules are built of two different types of 
subunits and these subunits bind GTP. 

5. The light chains in myosin stiffen the lever arm. The light 
chains in kinesin bind cargo to be transported. 

6. After death, the ratio of ADP to ATP increases rapidly. In 

the ADP form, myosin motor domains bind tightly to actin. 
Myosin—actin interactions are possible because the drop in ATP 
concentration also allows the calcium concentration to rise, clear- 
ing the blockage of actin by tropomyosin through the action of the 
troponin complex. 

7. Above its critical concentration, ATP-actin will polymerize. 
The ATP will hydrolyze through time to form ADP-actin, which 
has a higher critical concentration. Thus, if the initial subunit 
concentration is between the critical concentrations of ATP-actin 
and ADP.-actin, filaments will form initially and then disappear on 
ATP hydrolysis. 

8. A one-base step is approximately 3.4 A = 3.4 X 107* um. If 

a stoichiometry of one molecule of ATP per step is assumed, this 
distance corresponds to a velocity of 0.017 ms |. Kinesin moves 
at a velocity of 6400 A per second, or 0.64 ums '. 

9. A proton-motive force across the plasma membrane is 
necessary to drive the flagellar motor. Under conditions of 
starvation, this proton-motive force is depleted. In acidic solu- 
tion, the pH difference across the membrane is sufficient to 
power the motor. 

10. The mean distance between tumbles would be longer when the 
bacterium is moving up a gradient of a chemoattractant. 

11. (a) 1.13 X 107° dyne 

(b) 6.8 X 10" erg 

(c) 6.6 X 10°" erg per 80 molecules of ATP. A single kinesin motor 
provides more than enough free energy to power the transport of 
micrometer-size cargoes at micrometer-per-second velocities. 

12. The spacing between identical subunits on microtubules 

is 8nm. Thus, a kinesin molecule with a step size that is not a 
multiple of 8 nm would have to be able to bind at more than one 
type of site on the microtubule surface. 

13. KIF1A must be tethered to an additional microtubule-binding 
element that retains an attachment to the microtubule when the 
motor domain releases. 

14. Filaments built from subunits can be arbitrarily long, can be 
dynamically assembled and disassembled, and require only a small 
amount of genetic information to encode. 


15. Protons still flow from outside to inside the cell. Each proton 
might pass into the outer half-channel of one MotA—MotB com- 
plex, bind to the MS ring, rotate clockwise, and pass into the inner 
half-channel of the neighboring MotA—MotB complex. 

16. Ata high concentration of calcium ion, Ca?" binds to cal- 
modulin. In turn, calmodulin binds to a protein kinase that phos- 
phorylates myosin light chains and activates it. At low calcium ion 
concentration, the light chains are dephosphorylated by a Ca" 
independent phosphatase. 

17. (a) The value of k,.¢ is approximately 13 molecules per second, 
whereas the Kj, value for ATP is approximately 12 uM. 

(b) The step size is approximately (380 — 120)/7 = 37 nm. 

(c) The step size is very large, which is consistent with the presence 
of six light-chain-binding sites and, hence, very long lever arms. 
The rate of ADP release is essentially identical with the overall 
Reat; so ADP release is rate limiting, which suggests that both 
motor domains can bind to sites 37 nm apart simultaneously. ADP 
release from the hindmost domain allows ATP to bind, leading to 
actin release and lever-arm motion. 


Chapter 36 

1. (a) Before; (b) after; (c) after; (d) after; (e) before; (f) after. 

2. (a) Yes; (b) yes; (c) no (MW > 600). 

3. Ifcomputer programs could estimate log(P) values on the basis 
of chemical structure, then the required laboratory time for drug 
development could be shortened. The determination of the relative 
solubilities of pharmaceutical candidates by allowing each com- 
pound to equilibrate between water and an organic phase would no 
longer be necessary. 

4. Perhaps N-acetylcysteine would conjugate to some of the 
N-acetyl-p-benzoquinone imine that is produced by the metabo- 
lism of acetaminophen, thereby preventing the depletion of the 
liver’s supply of glutathione. 

5. In phase 1 clinical trials, approximately 10 to 100 healthy vol- 
unteers are typically enrolled in a study designed to assess safety. In 
contrast, a larger number of subjects are enrolled in a typical phase 2 
trial. Moreover, these persons may benefit from the drug adminis- 
tered. In a phase 2 trial, efficacy, dosage, and safety can be assessed. 
6. The binding of other drugs to albumin could cause extra 
coumadin to be released. (Albumin is a general carrier for 
hydrophobic molecules.) 

7. A drug that inhibits a P450 enzyme may dramatically affect 

the disposition of another drug that is metabolized by that same 
enzyme. If this inhibited metabolism is not accounted for when 
dosing, the second drug may reach very high, and sometimes toxic, 
levels in the blood. 

8. Unlike competitive inhibition, noncompetitive inhibition 
cannot be overcome with additional substrate. Hence, a drug 

that acts by a noncompetitive mechanism will be unaffected by 
changing levels of the physiological substrate. 

9. An inhibitor of MDR could prevent the efflux of a chemo- 
therapeutic drug from tumor cells. Hence, this type of an inhibitor 
could be useful in averting resistance to cancer chemotherapy. 

10. Agents that inhibit one or more enzymes of the glycolytic 
pathway could act to deprive trypanosomes of energy and thus be 
useful for treating sleeping sickness. A difficulty is that glycolysis 
in the host cells also would be inhibited. 

11. Imatinib is an inhibitor of the Bcr-Abl kinase, a mutant kinase 
present only in tumor cells that have undergone a translocation 
between chromosomes 9 and 22 (see Figure 14.33). Before initiat- 
ing treatment with imatinib, we could sequence the DNA of the 
tumor cells and determine (a) whether this translocation has taken 
place and (b) whether the sequence of bcr-abl carries any mutations 


that would render the kinase resistant to imatinib. If the transloca- 
tion has not taken place or if the gene carries resistance mutations, 
then imatinib would likely not be an effective treatment for the 
patient carrying this particular tumor. 

12. Sildenafil increases cGMP levels by inhibiting the phospho- 
diesterase-mediated breakdown of cGMP to GMP. Intracellular 
cGMP levels can also be increased by activating its synthesis. This 
activation can be achieved with the use of NO donors (such as 
sodium nitroprusside and nitroglycerin) or compounds that acti- 
vate guanylate cyclase activity. Drugs that act by the latter mecha- 
nism are currently in clinical trials. 

13. A reasonable mechanism would be an oxidative deamination 
following an overall mechanism similar to that in Figure 36.9, with 
release of ammonia. 
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NADPH + H*+ 0, + 


NADP* + H,O + NH, + 
CH 


3 


14. Kj ~ 0.3 nM. ICso ~ 2.0 nM. Yes, compound A should be 
effective when taken orally because 400 nM is much greater than 
the estimated values of Ky and ICso. 
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liver in, 680 
in muscle, 685 
one-carbon units in, 716 
overview of, 673-674 
oxaloacetate in, 692 
oxygenation in, 695-696 
propionyl CoA in, 693, 693, 694 
proteasomes in, 677, 677f, 678f 
in protein turnover, 675 
pyruvate in, 684-685, 685f, 691, 691f 
rate of, 675, 676t 
regulatory functions of, 678-679, 679t 
Schiff-base intermediates in, 681—682 
serine dehydratase in, 684, 691 
in starvation, 809 
succinyl coenzyme A in, 693 
tetrahydrofolate in, 716 
threonine dehydratase in, 684 
transamination in, 680—685 
ubiquitination in, 298t, 675-678 
urea cycle in, 685-690 
valine in, 694-695 
Amino acid residues, 34 
definition of, 34 
molecular weight of, 34 
in secondary structures, 38-45, 50-52, 51t 
Amino acid sequences, 18-19, 19f, 25, 
27-38 
alignment of, 175-181, 176f-182f 
alternative conformations of, 51-52, 51f 
of collagen, 44, 44f 
databases of, in sequence homology 
analysis, 181, 182f 
determination of, 79-84 
DNA sequencing in, 79-84 
Edman degradation in, 81-83, 81f 
genetic, 35 
protein cleavage for, 82-83, 82f, 83f 
uses of, 79-80 


directionality of, 34, 34f 

in disease, 35-36 

DNA-derived, 84 

evolution of, 36, 79, 174-181 

in genetic code, 129-130, 129t. See also 
Genetic code 

glycosylation sites in, 330 

homologous, 175-181. See also Sequence 
homologies 

identification of, 65—66 

internal repeats in, 79 

protein structure and, 18-19, 19f, 25, 
35-36, 49-59 

shuffled, 177-178, 177f 

substitutions in, 178-181, 179£-181f 

Amino acid side chains, 27, 28-32 

acidic, 31-33, 32f 

aliphatic, 28, 29f 

aromatic, 28 

basic, 30-31, 31f 

carboxamide-containing, 30, 30f 

charge of, 30-33, 31f, 32f 

definition of, 27 

hydrophilic, 30f, 46, 46f, 47, 47f£ 

hydrophobic, 28, 45-46, 46f, 47, 47f 

hydroxyl-containing, 30, 30f 

ionizable, 32, 32t 

sulfhydryl-containing, 30, 30f 

thiol-containing, 30, 30f 

Amino acid synthesis, 705-732 

activated methyl cycle in, 717-718, 717f 

adenylation in, 725-726, 725f 

ammonia in, 706-710 

branched pathways in, 723 

chorismate in, 719-720, 720f 

committed step in, 723 

enzymatic cascades in, 725-726, 
725f, 726£ 

enzyme multiplicity in, 724, 724f 

of essential amino acids, 711—712, 
719-723 

feedback inhibition/activation in, 
723-724, 726f 

hydroxyethyl-TPP in, 723, 723f 

metabolic precursors in, 711, 711f 

methylation in, 716-718, 717f 

nitrogen fixation in, 706-710 

of nonessential amino acids, 712—719 

one-carbon units in, 715-716, 715t, 716f 

overview of, 705-706 

phosphorylation in, 724 

rate of, 723 

regulation of, 723-726 

regulatory domains in, 724, 724f 

reversible covalent modification in, 
725-726, 725£ 

S-adenosylmethionine in, 716-718 

shikimate in, 719, 720f 

in starvation, 809 

substrate channeling in, 722-723, 722f 

tetrahydrofolate in, 715, 715-716, 716f 

threonine deaminase in, 723-724, 723f 

vitamin Byy in, 650-652, 650f, 651f, 717 
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Amino acid-binding regulatory domains, 
724, 724£ 
Amino sugars, 330 
2-Amino-3-carboxymuconate-6- 
semialdehyde, 696, 696 
Aminoacrylate, Schiff base of, 722, 
722, 722 
Aminoacyl adenylate, in translation, 893, 
893-895, 894f 
Aminoacyl-AMP, 893-895, 894f 
Aminoacyl-tRNA, 128, 128f, 893, 893-894 
Aminoacyl-tRNA synthetases, 128, 888, 
893-897 
activation sites of, 894, 894-895 
classes of, 897, 897t 
editing sites of, 895, 895 
proofreading by, 895 
specificity of, 894-895 
tRNA recognition by, 896-897 
Aminoglycosides, 909, 909-910, 909t 
5-Aminoimidazole ribonucleotide, 742, 742f 
5-Aminoimidazole-4-carboxamide 
ribonucleotide, 741t, 742f 
5-Aminoimidazole-4-(N- 
succinylcarboxamide) ribonucleotide, 
742, 742£ 
8-Aminolevulinate synthase, in porphyrin 
synthesis, 728, 729f 
Aminopterin, 750 
Aminotransferases 
in amino acid degradation, 680-682 
evolution of, 713 
pyridoxal phosphate-dependent, 712—713 
Ammonia 
in amino acid degeneration, 680-681, 
685-690 
in amino acid synthesis, 706-710 
conversion to urea, 685-690 
formation of, 680-681, 684, 713 
in glucose-alanine cycle, 684-685, 685f 
in glutamate synthesis, 709 
in glutamine synthesis, 709-710 
neurotoxicity of, 689 
in purine synthesis, 741 
in pyrimidine synthesis, 737, 738 
transport of, 684-685 
in urea cycle defects, 689 
AMP (adenosine monophosphate) 
in adenylation, 725, 725f 
degradation of, 752-753, 753f 
energy charge and, 446-447, 446f 
in fatty acid metabolism, 644-645, 646f 
in glycogen metabolism, 626 
in reversible covalent modification, 725 
structure of, 431 
synthesis of, 743-744, 743f, 751, 751f 
AMP-dependent protein kinase, 666-667 
Amphibolic pathways, 429 
Amphipathic molecules, 351 
a-Amylase, 328, 454 
in glycogen branching, 629 
Amyloid fibers, in neurodegenerative 
diseases, 55—56, 56f, 57 
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Amyloid plaques, in Alzheimer’s disease, 
55-56, 57 
Amyloid precursor protein (APP), 57 
Amyloidoses, 55-57 
Amylopectin, 328 
Amylose, 328 
Amyotrophic lateral sclerosis, 147-148, 157 
animal models of, 164 
mutations in, 157, 163 
Amytal, 558 
Anabolic steroids, 785, 948 
Anabolism, 428—429 
definition of, 706b 
electron carriers in, 439 
Anaerobes, obligate, 468-469 
pathogenic, 468, 469t 
Anaplerotic reaction, 516-517 
Anchor residues, 993, 993f 
Ancient DNA, amplification and 
sequencing of, 147, 188-189, 189f 
Andersen disease, 634t 
Androgen(s). See also Sex hormones; Steroid 
hormones 
functions of, 784—785 
reduction of, 785 
synthesis of, 780, 780f, 784, 784 
Androstenedione, 784, 784, 948 
Anemia 
Cooley, 210 
erythropoietin for, 331 
hemolytic, G6PD deficiency and, 
609-610, 611 
sickle-cell, 209-210, 209f, 210f 
Anfinesen, Christian, 49-50 
Angiogenin, sequence comparison for, 173, 174 
Angstrom (A), 7 
Animal testing, in drug development, 1037 
Anion exchange, 70 
Ankyrin repeats, 972-973, 972f 
Annealing, of DNA strands, 120-121 
Anomers, 321f, 323 
Anosmia, 958 
ANT (adenine nucleotide translocase). See 
ATP-ADP translocase 
Antagonists, 948 
Anthranilate, 720, 721 
Anthrax, 831 
Antibiotics. See also Drug(s) and specific 
antibiotics 
protein synthesis inhibition by, 909-910 
transcription termination by, 862-863 
translocation inhibition by, 909-910, 909t 
Antibodies, 979-980. See also 
Immunoglobulin(s) 
antigen binding by, 984—986, 985f, 986f 
antigen interaction with, 83-86, 85f 
antigen-binding units of, 981-983 
binding sites of, 984-985, 984f, 985f 
catalytic, production of, 243-244, 243f 
class switching in, 990-991, 991 
constant domains of, 984, 984f 
definition of, 84—85, 979 
diversity of, 980-981 


effector units of, 981-983 
formation of, 988-989, 990-991 
gene recombination in, 987-991, 988f 
class switching and, 990-991, 991f 
hypervariable loops of, 984, 984f 
monoclonal 
for cancer treatment, 420-421 
preparation of, 86-88, 87f 
number of, 980 
oligomerization of, 989-990 
polyclonal, 86, 86f 
production of, 86-88, 87f 
secretion of, 950, 989-990 
structure of, 85f, 981-982 
variable domains of, 983, 984, 984f, 985f 
Antibody diversity, 987-991 
Anticipation, in genetic disease, 842 
Anticoagulants, 310, 312 
Anticodon loop, 890f, 891 
Anticodons, 128, 889, 890f, 891-893, 891f, 
892f, 896. See also Codons 
Antidiuretic hormone, 96 
Antielastase, 307 
Antigen(s), 85-86 
ABO blood group, 335-336, 335f 
antibody binding of, 85, 85f, 981-986, 
985f, 986f 
cross-linking of, 982, 982f 
definition of, 979 
self, 1001 
Antigen—antibody interactions, 83-86, 85f, 
981-986, 985f, 986f 
Antigenic determinants, 85, 85f, 980 
Antigen-presenting cells, 996-998, 996f 
Antihemophilic factor, 311, 311f 
Antimalarial drugs, hemolytic anemia and, 
609-610 
Antimycin A, 558 
Antioxidants 
vitamin C as, 813 
vitamin E as, 542 
Antiporter, 380, 380f 
Antisense strand, 854 
Antiserum, 86 
Antithrombin III, 312 
Antitransporters, 380-381, 380f, 381f 
a,-Antitrypsin, 304-307 
abnormalities of, 312 
Antituberculosis drugs, 679 
AP endonuclease, 840 
Apoenzymes, 221 
Apolipoprotein B-48, 642, 773, 773t, 774, 
872-873 
Apolipoprotein B-100, 773t, 774, 774f, 775, 
872-873 
Apolipoprotein E, 773t, 774 
Apolipoproteins, 642, 773, 773t, 774, 774f, 
775. See also Lipoprotein(s) 
Apoproteins, 773 
Apoptosis 
mitochondria in, 559 
T cell, 1000 
zymogens in, 303 


Apoptosome, 559 


Appetite regulation, 794-797, 795f-797f, 


795t. See also Caloric homeostasis 
Approximation, catalysis by, 254 
Aquaporins, 394, 395f 
Aqueous solutions. See also Water 
hydrogen ions in, 13-17 
Arabidopsis, 783 
Arachidonate, 664—665, 665f 
Arachidonic acid, 358 
Arber, Werner, 141 
Archaea, 3, 3f 
membranes of, 350-351 
proteasomes of, 677-678, 678f 
transcription in, 869 
Arginase, 687 
Arginine, 687 
nitric oxide from, 727—728 
structure of, 30-31, 31f, 687 
synthesis of, 712, 714, 714f 
in urea cycle, 687 
Argininosuccinase, 687 
deficiency of, 689, 689f 
Argininosuccinate, 686, 686, 689, 689 
Argininosuccinate synthetase, 686 
Argonaute-miRNA, 953, 953f 
Aromatase, 785 
Aromatic amino acids, 28. See also 
Amino acid(s) 
degradation of, 695-696 
synthesis of, 719-722, 720f, 721f 
Arrestin, 408, 969 
Arsenite poisoning, 517, 518f 
Arteriosclerosis. See Atherosclerosis 
Arthritis, degenerative, 332 
Artificial chromosomes, 151 
Ascorbic acid (vitamin C), 442, 442t, 
812-813, 813 
deficiency of, alcohol-related, 
812-813 
forms of, 813, 813 
Asparaginase 
in amino acid degradation, 692 
in glycoproteins, 330, 331 
structure of, 30, 30f 
synthesis of, 713 
Asparagine 
in amino acid degradation, 692 
carbohydrate units of, 57 
synthesis of, 713, 713f 
Aspartate 
asparagine formation from, 713, 713f 
in catalytic triads, 258-260. See also 
Catalytic triads 
conversion to fumurate, 687, 692 
conversion to oxaloacetate, 692 
in double-displacement reactions, 237 
in purine synthesis, 743 
structure of, 31, 32f, 237, 290 
synthesis of, 712-713 
taste of, 962, 965 
Aspartate aminotransferase, 683 
in amino acid degradation, 680, 682 


Aspartate transcarbamoylase (ATCase), 
290-296 
active sites of, 293-294 
allosteric interactions in, 292—295 
cytidine triphosphate inhibition of, 291, 
291f, 295-296, 296f 
kinetics of, 291, 291f, 294-296, 
294f-296f 
in pyrimidine synthesis, 290-295, 290f, 
738, 751 
structure of, 290, 291-292, 292 
subunits of 
catalytic, 291-292 
interaction of, 292—295, 294f-296f 
regulatory, 291-292 
T-to-R state transition in, 294-296, 
294f, 295f 
ultracentrifugation studies of, 
291-292, 292f 
Aspartokinases, 724, 724f 
Aspartyl proteases, in peptide bond 
cleavage, 263, 264, 264f 
Aspirin 
discovery and development of, 
1038-1040 
as prostaglandin inhibitor, 358, 
665, 1040 
Assays 
definition of, 67 
of enzyme activity, 67 
in protein purification, 67 
ATCase (aspartate transcarbamoylase), 289 
Atenolol, 1045, 1045 
Atheroma, 1040 
Atherosclerosis, 759, 774-775, 
776-779, 777£ 
drug therapy for, 1040 
HMG-CoA reductase inhibitors for, 779, 
779, 1040, 1040-1041, 1041 
homocysteine in, 719 
Atkinson, Daniel, 446 
Atorvastatin, 1041, 1041 
ATP (adenosine triphosphate), 112, 112, 
279, 279 
as activated carrier of phosphoryl 
groups, 438 
ADP units in, 447, 447£ 
as allosteric effector of ATCase, 
296, 296f 
binding of 
conformational changes and, 280-282, 
282f, 284 
by RNA, 190-191, 190f 
body requirements for, 525, 554-560 
in citric acid cycle, 498, 512, 
514-515, 514f 
electrostatic repulsion and, 433 
energy charge and, 446-447, 446f 
during exercise, 805-806, 805t 
in fatty acid metabolism, 644-645, 646f 
formation of, 430-431 
from acetyl CoA oxidation, 437-438 
from ADP, 430-431, 434, 435, 435f 


binding-change mechanism in, 547, 547f 
from carbon oxidation, 434, 435-438, 
435f, 437£ 
chemiosmotic hypothesis for, 544-545, 
544f, 545f 
creatine phosphate in, 434-435, 805 
during exercise, 434-435, 435f, 
805-806, 805t 
in fatty acid metabolism, 648 
from glycolysis, 463-466 
ion gradients in, 437-438, 437f 
in mitochondria, 526 
NADH in, in ATP synthesis, 439 
from oxidative phosphorylation, 
436-437, 437f, 525, 543-550. See 
also Oxidative phosphorylation 
in photosynthesis, 565, 566, 578-581 
proton-motive force in, 525-526, 
544-545, 544f, 545f 
rotational catalysis in, 547-550, 
547f-549f 
yield in, 554-555, 555t 
in gluconeogenesis, 484 
in glycolysis 
as phosphoryl-group donor, 455, 458 
as product, 463-466 
in GMP synthesis, 743-744, 743f, 751 
high-energy bonds of, 433-434 
hydration stabilization in, 433 
hydrolysis of 
in coupled reactions, 431-433 
in DNA replication, 824-825 
in DNA supercoiling, 830-831, 830f 
free energy from, 429-435 
heat generation in, 489 
myosin and, 279-282, 281f-283f 
in kinesin movement, 1021-1022, 
1021f 
in muscle contraction, 1009-1012, 
1011f, 1012f, 1015, 1016f 
rotational motion from, 547—550, 
548f, 549f 
in splicing, 876-877 
in transcription, 876-877 
in membrane transport, 374-381. See also 
Ion pumps 
in mitochondria, 526 
in muscle contraction, 1009-1012, 1011f, 
1012f, 1015, 1016f 
in myosin movement, 1009-1012, 1011f, 
1012f, 1015, 1016f 
in nitrogen fixation, 707, 708, 709-710 
in oxidative phosphorylation, 525-526 
in pentose phosphate pathway, 607f, 
608-609 
in phosphorylation, 298-301, 432-434, 
437-438 
phosphoryl-group-transfer and, 434—435, 
434t. See also Phosphorylation 
phosphoryl-transfer and, 434f 
in purine synthesis, 742f 
as reaction energy source, 429-435 
resonance structures of, 433 
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in signal transduction, 405 
structure of, 430, 431 
in substrate cycles, 489 
triphosphate moiety of, 430 
in ubiquitination, 675-676, 675f 
utilization rates for, 435 
ATP synthase (Complex V), 578-579, 579f 
a subunit of, 546, 548, 549 
a subunit of, 545, 545 
ATP-driven rotation in, 547-550, 
547£-549f 
b subunit of, 546 
B subunit of, 545, 545-547, 547f 
binding-change mechanism and, 
547, 547£ 
c ring of, 548-549, 549f 
catalytic action of, 546-548 
in chloroplasts, 578-579, 583, 583f 
Fo subunit of, 545, 546 
F, subunit of, 545, 545-546 
G proteins and, 550 
y subunit of, 545, 545-546, 547 
rotation of, 547-550, 547£-549f, 1023 
inhibition of, 558 
in mitochondria, 544—550, 558 
in oxidative phosphorylation, 
544-550, 558 
in photosynthesis, 578-579, 583, 583f 
P-loop NTPase domains in, 284, 284f 
proton flow through, 546-547 
rotational, 547-550, 547£-549f 
respiratory chain linkage to, 544-545, 
544f, 545f 
structure of, 545, 545-546 
in thylakoid membrane, 583 
ATP synthasome, 554 
ATP-ADP cycle, energy exchange in, 
430-431, 435, 435f 
ATP-ADP translocase, 552-553, 
553f, 554f 
inhibition of, 558 
ATPase(s) 
AAA 
in amino acid degradation, 677 
DnaA as, 834 
in dynein, 1010, 1011f 
mitochondrial. See ATP synthase 
in motor proteins, 1008-1010, 1011f 
P-type, 374-378 
evolution of, 374, 378 
functions of, 378 
in membrane transport, 374-378. See 
also lon pumps 
ATPase domains, of myosin, 279-280, 280f 
ATP-binding cassettes, 378-380, 379f 
ATP-citrate lyase, 662 
ATP-driven pumps, 371 
ATP-grasp domain, 482, 482f 
ATP-grasp fold, 737 
ATP-—magnesium complex, 280 
ATP-myosin complex, 279-280, 280f 
Atrazine, 584, 584 
Attenuation, 931-932, 932f 
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Auditory processing, 971-973 

Autoimmune diseases, 1001 

Autoinducers, 930, 930f 

Autonomously replicating sequence, 
151, 151f 

Autoradiography, 73, 143 

Autotrophs, 565, 590 

Axel, Richard, 959 

Axonemes, 1019 

Azathioprine, 1048, 1048 


B cell, 989-990 
memory, 1002 
B cell receptor, 989, 989f 
B vitamins. See Vitamin(s) 
Backbone models, 61—62, 62f 
Backbones, in nucleic acids, 110, 110f, 111 
Bacteria, 3, 3f. See also Escherichia coli 
chemotaxis in, 1024-1025, 1024f 
DNA replication in, 831-834. See also 
DNA replication 
flagellar movement of, 1022-1025 
glyoxylate cycle in, 518-519, 519f 
nitrogen fixation in, 706-710 
photosynthetic, 584, 584t 
reaction center of, 569-572, 570f 
proteasomes of, 677-678, 678f 
Bacterial artificial chromosomes, 151 
Bacterial photosynthetic reaction center, 
569-572, 570£ 
Bacteriochlorophyll, 570, 570 
Bacteriochlorophyll b, 570-571 
Bacteriopheophytin b, 570, 570, 570-571 
Bacteriorhodopsin, 356, 356f 
Baculovirus vectors, 163-164 
BAL (British anti-lewisite), 517, 518f 
Ball-and-chain model, of ion channel 
inactivation, 388-389, 389f 
Ball-and-stick models, 22, 23f, 61, 61f 
Band centrifugation, 77-78, 77f 
Basal transcription complex, 941 
Base-excision repair, 840 
Base-pair substitutions, tests for, 843 
Base-paired hairpin. See Hairpin turns 
Bases 
abbreviations for, 112, 112f 
in acid—base reactions, 13-17. See also 
Acid—base reactions 
nucleic acid. See Bases/base pairs 
sequence of, notation for, 112, 112f 
Bases/base pairs, 110-113, 111, 114t. See 
also Double helix 
adenine, 4, 111, 111 
adenine-thymine, 5, 5 
complementarity of, 822, 822f, 823f 
cross-linkage of, 838, 839f 
cytosine, 4, 111, 111 
damage to, 838-839 
deamination of, 680—684, 838, 841 
errors in base pairing and, 117-118, 
837-838 
repair of, 839-843. See also DNA 


repair 


formation of, 891-893, 901. See also 
Translation 
codon—anticodon interactions in, 
891-893 
in genetic code, 128-131, 129t. See also 
Genetic code 
guanine, 4, 111, 111 
guanine-—cytosine, 5, 5 
hydrogen bonds in, 5, 5f, 10. See also 
Hydrogen bonds 
methylated, in tRNA, 889-890 
mismatched, 117-118, 118f, 837-838 
nomenclature of, 736t 
nonstandard, 117-118, 118f, 837-838 
non-Watson—Crick, 837-838 
in nucleosides, 111, 111. See also 
Nucleoside(s) 
in nucleotides, 111-112, 112. See also 
Nucleotide(s) 
oxidation of, 838 
purine, 111, 111 
synthesis of, 740-745, 740f 
pyrimidine, 111, 111 
synthesis of, 736-740, 736f 
recognition sites and, 141, 141f, 272, 
275-277 
in recombinant DNA technology, 139. 
See also Recombinant DNA 
technology 
RNA, homologous, 186-187, 187f 
rules for, 114 
stacking of, 10, 10f, 114-115, 119f 
hypochromism and, 120, 120f 
thymine, 4, 111, 111 
in translation, 891-893, 901 
uracil, 111, 111 


Watson—Crick, 5, 113-115, 114f, 186, 187f 


wobble in, 891-893 
Basic-leucine zipper, 942 
Bathorhodopsin, 967-968 
Bcr-Abl, in chronic myelogenous leukemia, 
421, 421f 
B-DNA helix, 115-116, 116t 
Bennett, Claude, 987 
Benzaldehyde, 958, 958 
Benzoate, 689 
Berg, Howard, 548, 1023 
Berg, Paul, 148, 644 
Beriberi, 517 
B cells, in diabetes, 802 
8 chains, of hemoglobin, 199, 199f, 211 
B sheet, 40-42, 40f42f 
amino acid residues in, 50—52, 51t 
B strands, 41-42, 41f 
amino acid residues in, 50—51, 51t 
of membrane proteins, 357, 357f 
8 turns, 42, 42f 
amino acid residues in, 50—51, 51t 
Bo subunit, of FE. coli, in DNA replication, 
831-832, 831f 
B-adrenergic receptor, 404-405, 404f 
B-adrenergic receptor kinase, 408 
B-arrestin, 408 


B-globin gene. See also Hemoglobin 
transcription and processing of, 
131-132, 132f 
B-hemoglobin, 199, 199f, 211. See also 
Hemoglobin 
B-lactam ring, of penicillin, 244, 244f 
B2-microglobulin, 992-993, 993f 
B-oxidation pathway, in fatty acid 
metabolism, 646—647 
B-thalassemia, 210-211, 211 
B-tubulin, 1018-1019, 1018f, 1019f 
Bicarbonate 
dehydration of, carbonic anhydrase in, 
266-271 
in pyrimidine synthesis, 737 
Bicarbonate ion, in oxygen transport, 
207-208, 208f 
Bifunctional enzyme, 487 
Bile, in drug metabolism, 1035-1036 
Bile salts, 642, 642f 
synthesis of, 779-780, 780f 
Bilirubin, 730, 730 
Biliverdin, 730, 730 
Biliverdin reductase, 730, 730 
Binding energy, 229 
definition of, 254 
enzyme specificity and, 275-277 
Binding-change mechanism, 547, 547f 
Biochemistry 
definition of, 1 
unifying concepts in, 1-4, 2f, 3f 
Bioenergetics. See also Energy 
of coupled reactions, 429-430, 431-433 
enzymes and, 222-225 
of metabolism, 429-435 
thermodynamic principles and, 11-13 
Biofilms, 930-931 
Bioinformatics, 173-192. See also Sequence 
homologies 
Biological diversity, biochemical unity 
underlying, 1-4, 2f 
Biological macromolecules, 2 
Biopterin, 695 
Biotin, 441t 
in fatty acid synthesis, 657 
in gluconeogenesis, 482-483, 483f 
1,3-Bisphosphoglycerate, 434 
in glycolysis, 464, 484 
oxidation of, 436-437 
phosphoryl-transfer potential of, 434, 434t 
2,3-Bisphosphoglycerate 
in glycolysis, 464 
oxygen affinity of hemoglobin and, 
204-205 
Bisubstrate reactions, 235-236 
Bitter taste, 962, 963-964 
Bjorkman, Pamela, 992 
Blackburn, Elizabeth, 837 
BLAST search, 181, 182f 
Bleach, urine as, 687b 
Bleeding disorders, 311-312, 312 
The Blind Watchmaker (Dawkins), 53 
Blood clotting. See Clotting 


Blood groups, glycosyltransferases in, 
335-336 
Blood—brain barrier, 1033 
Blow, David, 257 
Blue photoreceptors, 969-970, 970f 
Boat form, 324, 325f 
Body mass index (BMI), 792, 793f 
Body weight, regulation of, 791-798. See 
also Caloric homeostasis 
Bohr, Christian, 206 
Bohr effect, 206-208, 207f 
Bombardment-mediated transformation, 167 
Bonds 
covalent, 7 
cleavage of, 13 
disulfide, 35, 35f 
cleavage of, 49 
reduction of, 83 
electrostatic interactions, 7—8 
in antigen-antibody binding, 984, 
985, 985f 
ATP and, 433-434 
in enzyme-substrate complex, 228, 229f 
glycosidic 
in disaccharides, 327 
in monosaccharide alcohol/amine 
complexes, 326, 326f 
high-energy, 433-434 
hydrogen. See Hydrogen bonds 
hydrophobic effect and, 9-10, 9f 
hydrophobic interactions and, 10 
isopeptide, 675, 675f 
noncovalent, 7-11. See also Noncovalent 
bonds 
peptide. See Peptide bonds 
phosphodiester, 110, 110, 111 
hydrolysis of, 272-274 
in resonance structures, 7 
surface complementarity and, 11 
units of measure for, 7 
van der Waals interactions. See Van der 
Waals interactions 
Bonitus, Jacob, 517b 
Bordetella pertussis, 422 
Bortezomib (Velcade), 678, 679 
Bovine spongiform encephalopathy, 
56-57, 56f 
Boyer, Herbert, 148 
Brain 
glucose metabolism in, in starvation, 
809-810, 810t 
sensory processing in. See Sensory 
systems 
vanishing white matter in, 908-909, 909f 
Branched-chain amino acids. See also 
Amino acid(s) 
degradation of, 693-695 
Branched-chain ketoaciduria, 697 
Branching enzyme, in glycogen 
metabolism, 629 
Brenner, Sydney, 128 
Briggs, George, 231 
British anti-lewisite (BAL), 517 


3-Bromoacetol phosphate, 242, 242 
Bromodomains, 949-950, 950f 
Brown fat, 556-557 
Brown, Michael, 767b, 775, 776 
Bubble boy disease, 753 
Buchner, Eduard, 454 
Buchner, Hans, 454 
Buck, Linda, 959 
Buffers, 15-17, 15f, 16f 

in carbon dioxide hydration, 270, 270f 
Burkitt lymphoma, 869 
t-Butyloxycarbonyl amino acid, 97, 97 
Butyryl ACP, 658t, 659, 659 
bZip, 942 


C genes, 987 
in antibody switching, 991 
C3 plants, 600 
C4 pathway, in photosynthesis, 
599-600, 600f 
Ca’* pump, 374-378, 375f, 376f 
CAAT box, 127, 867, 867£ 
CAD, 688 
Caenorhabditis elegans 
genome of, 159 
RNA interference in, 166 
Caffeine, metabolism of, 783 
Calciferol (vitamin D), 442, 442t 
deficiency of, 786 
sources of, 786 
synthesis of, 785, 785f 
Calcineurin, 990, 990 
Calcitriol, 785, 785 
Calcium ATPase, in phosphoinositide 
cascade, 409 
Calcium ATPase, 375-377 
Calcium ion(s) 
in calmodulin activation, 410-411 
in insulin secretion, 802 
in phosphorylase kinase activation, 
624, 624f 
prothrombin binding of, 310-311 
in pyruvate dehydrogenase complex 
activation, 514 
in signal transduction, 409-411, 
409f-411f, 802 
structure of, 403 
in vision, 969 
Calcium ion channels, 367, 384f. See also 
Ion channels 
sequence homologies of, 367, 384, 
384f, 386 
Calcium ion pump, 374-378, 375f, 376f 
evolution of, 378 
Calmodulin, 79, 79, 624 
in glycogen metabolism, 624 
in signal transduction, 410-411 
Calmodulin-dependent protein kinases, 411 
Calnexin, 339 
Caloric homeostasis, 791—798 
definition of, 791 
in diabetes, 799-802 
dieting and, 797-798 
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evolution and, 792-793 
exercise in, 803-806 
fasting and, 796 
food intake and, 793-794 
insulin in, 796 
melanocyte-stimulating hormone in, 796 
obesity and, 792-803. See also Obesity 
satiation signals in, 795-797, 796f 
signaling in, 794-797, 794t, 795f-797£ 
suppressors of cytokine signaling in, 
797, 797£ 
Calorie (cal), 223b 
Calreticulin, 339 
Calvin cycle, 590-601 
C4 pathway and, 599-600 
energy expenditure in, 597 
evolution of, 600 
hexose sugar formation in, 594-597, 594f 
oxygenase reaction in, 593, 593f 
pentose phosphate pathway and, 609 
3-phosphoglycerate formation in, 
591-592, 592f 
reactions in, 590-597, 590f, 596f 
regulation of, 598-600 
ribulose 1,5-bisphosphate regeneration in, 
594-597, 596f 
salvage pathway in, 593-594, 593f 
stages of, 590-591, 590f, 596f 
thioredoxin in, 598-599, 598t, 599 
CaM kinase, 411 
cAMP 
in glycogen metabolism, 625-626, 
625f, 626 
in protein kinase A activation, 301, 301f 
as second messenger, 402—403 
in signal transduction, 405, 406-407 
structure of, 301, 403 
cAMP receptor protein (CRP), 
927-928, 927f 
Camptothecin, 831 
Cancer 
aerobic glycolysis in, 478-479 
breast, taxol for, 1019-1020 
carcinogen testing and, 843-844 
citric acid cycle defects in, 515 
colorectal, 842-843 
cytochrome P450 and, 783 
defective DNA repair in, 842-843 
drug resistance in, 1050-1051 
enhancers in, 869 
fatty acid synthase in, 663 
glycolysis in, 478-479, 479f 
hypoxia inducible factor 1 in, 515 
immune response in, 1001 
leukemia, 421 
in Li-Fraumeni syndrome, 843 
metabolic derangements in, 515 
mutagens and, 838 
myc gene in, 869 
oncogenes and, 420 
protein kinase inhibitors for, 421 
proto-oncogenes and, 420 
signal transduction defects in, 420 
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Cancer—(continued) 
skin, in xeroderma pigmentosum, 842 
telomerase inhibitors for, 837 
thymidylate inhibitors for, 749-750 
tumor hypoxia in, 478-479 
tumor-suppressor genes and, 420 
Warburg effect in, 478 
CAP (catabolite activator protein), 
927-928, 927£ 
CAP-cAMP complex, 927-928 
Capsaicin, 973-974, 973f 
Captopril, 264 
Carbamate, 208, 208 
in Calvin cycle, 592-593 
Carbamic acid, 737, 737, 738, 738f 
Carbamoyl phosphate, 290, 686 
in pyrimidine synthesis, 737 
in urea cycle, 685—687, 685f 
Carbamoy] phosphate synthetase, 689f 
active sites on, 737—738, 738f 
deficiency of, 689, 689f 
isozymes of, 688 
in pyrimidine synthesis, 737, 751 
structure of, 737, 737-738, 738 
synthesis of, 737 
in urea cycle, 685-686 
Carbamoylaspartate, 290, 738, 738. See 
also Aspartate transcarbamoylase 
(ATCase) 
biosynthesis of, 290-296 
Carbohydrate—asparagine adduct, 57 
Carbohydrates, 319-340 
abbreviations for, 330b 
disaccharides, 327-329. See also 
Disaccharides 
in glycoproteins, 57, 329-337 
lectins and, 337-339, 339f 
metabolism of, 454 
monosaccharide, 320-327. See also 
Monosaccharides 
N-linked, 330, 330f, 334, 336 
oligosaccharides, 327. See also 
Oligosaccharides 
O-linked, 330, 330f, 334 
overview of, 319-320 
peptide bond cleavage in, 337 
polysaccharides, 328, 328f, 329f. See also 
Polysaccharides 
in red blood cells, 335 
sequencing of, 336-337 
starch, 328-329, 329f 
in viral infections, 339-340 
Carbo-loading, 806 
Carbon dioxide 
activated, biotin carrier for, 483, 483f 
in Calvin cycle, 591-597 
from carbon oxidation, 435 
hydration of, 206 
carbonic anhydrase in, 266-271 
in oxygen release by hemoglobin, 207-208, 
207£, 208f 
transport of, C4 pathway for, 
599-600, 600f 


Carbon fuels 
body reserves of, 809 
definition of, 498 
for exercise, 805-806, 805t 
for glucose metabolism, 808, 809 
oxidation of. See also Catabolism 
in citric acid cycle, 498. See also 
Citric acid cycle 
free energy of, 434, 435-438, 435f 
Carbon monoxide, in oxidative 
phosphorylation inhibition, 558 
Carbon monoxide poisoning, 213 
Carbon skeletons, of amino acids, metabolic 
fates of, 690-696, 691f 
Carbonic acid, biosynthesis of, 266-271 
Carbonic anhydrases, 207, 207f, 266-271 
alpha-type, 271 
beta-type, 271 
catalytic activity of, 266-271 
evolution of, 271 
gamma-type, 271, 271 
structure of, 267, 267—268, 271, 271 
zinc site in, 267—269, 267£-269f 
Carboxamides, 30, 30f 
2-Carboxy-3-keto-D-arabinitol 1,5- 
bisphosphate, 592f 
Carboxyaminoimidazole ribonucleotide, 
742, 742£ 
Carboxybiotin, 483, 483 
Carboxyglutamate, 57, 58, 310, 310 
biosynthesis of, 310-311 
in prothrombin, 310-311 
in vitamin K deficiency, 57, 310 
Carboxyhemoglobin, 213-214 
y-Carboxylation, 298t 
Carboxyl-terminal domain (CTD), 865, 
868, 877, 877£ 
1-Carboxyphenylamino-1-deoxyribulose 
5-phosphate, 721, 722 
Carcinoembryonic antigen, 1001 
Carcinogens, tests for, 843-844 
Cardiac muscle, H and M isozymes in, 
297, 297£ 
Cardiolipin, 349, 349 
functions of, 762 
synthesis of, 762-763 
Cardiotonic steroids, 377, 377£ 
Cardiovascular disease, 744—745, 759, 
776-779 
homocysteine in, 719 
Cargo receptors, 913-914 
Carnitine, 645, 645 
deficiency of, 646 
Carnitine acyltransferase I, 645, 666 
Carnitine acyltransferase II, 646 
Carotenoids, 582—583 
Carriers, 371 
Cartilage, 332, 332f 
Carvone, 958, 958 
Caspases, 303, 559 
Cassette mutagenesis, 157, 157f 
Catabolism, 428 
definition of, 706b 


electron carriers in, 438-439, 438f, 439f 
energy from, 435-438 
free energy of, 434, 435-438, 435f, 437f 
stages of, 437-438, 437f 
Catabolite activator protein (CAP), 
927-928, 927f 
Catabolite repression, 927-928 
Catalase, 542 
Catalysis 
by approximation, 254 
in carbon dioxide hydration, 266-271 
Circe effect in, 235b 
covalent, 254, 255-260. See also Covalent 
catalysis 
in DNA cleavage, 272 
in double-displacement (ping-pong) 
reactions, 237 
efficiency of, 235, 235t 
electrophilic, 683 
enzyme activity in 
assays for, 67. See also Protein 
purification 
specific, 67 
enzyme-substrate complex formation in, 
226-227 
general acid-base, 254 
in metabolic regulation, 446 
metal ion, 221, 221t, 254-255 
in carbon dioxide hydrolysis, 267-269, 
267f-269f 
in DNA cleavage, 274, 277 
Michaelis constant (Ky) for, 231-235 
in peptide bond cleavage, 258-260, 259f 
rate of, 231-235. See also Enzyme 
kinetics; Reaction rates 
reaction equilibrium and, 224—225 
reaction rate acceleration and, 206, 220t, 
225-229, 226f 
by RNA, 879-881 
in selective binding of transition state, 
225-226, 225f, 227-228, 243-244 
in sequential reactions, 236-237 
site-detected mutagenesis and, 
262-263, 262f 
Catalytic antibodies, production of, 
243-244, 243f 
Catalytic groups, 227 
Catalytic RNA, 879-881, 900 
Catalytic strategies, 254-286 
of carbonic anhydrases, 253, 266-271 
covalent, in chymotrypsin, 255-260 
of myosins, 253-254 
overview of, 253-254 
of restriction endonucleases, 253, 
271-279 
of serine proteases, 253, 255-266 
Catalytic triads, 257-260 
in chymotrypsin, 257-260, 261f 
in elastase, 261, 261f 
site-directed mutagenesis and, 
262-263, 262f 
in subtilisin, 261, 261f 
in trypsin, 260-261, 261f 


Catalytically perfect enzymes, 234, 235, 460 
Cataracts, 472 
Cation exchange, 70 
CCA terminal region, 891 
CCA terminus, of tRNA, 890f, 891 
CD3, 995, 995 
CD4, 997 
CD8, 994-995, 995f 
cDNA (complementary DNA), 154-155, 
154f, 162f 
in quantitative polymerase chain reaction, 
162, 162f 
cDNA library, 154 
CDP, hydrolysis of, 431 
CDP-diacylglycerol, 761, 761 
CDP-ethanolamine, synthesis of, 763 
CDRs (complementarity-determining 
regions), 984, 984f, 985-986 
Cech, Thomas, 852, 879 
Celecoxib, 1044, 1044 
Celecoxib (Celebrex), 1044 
Cell cycle, 836, 836f 
Cell types, in eukaryotes, 938, 939t 
Cell-mediated immunity, 991 
Cell-to-cell ion channels, 372, 393-394, 
393f, 3946 
Cellular energy. See Energy 
Cellular immune response, 980 
Cellular respiration, 498, 499f, 593f, 594. 
See also Respiratory chain 
definition of, 526b 
in photosynthesis, 593f, 594 
regulation of, 554-560 
Cellulase, 313 
Cellulose, 328-329 
Centrifugation 
band, 77-78, 77£ 
density-gradient equilibrium, 
119-120, 119f 
differential, 67-68, 68f 
gradient, 77-78, 77f£ 
homogenate in, 67, 68f 
sedimentation coefficients in, 76, 76t, 77f£ 
supernatant in, 67, 68f 
zonal, 77-78, 77£ 
Ceramide, 764 
sphingolipids from, 763-764 
synthesis of, 764, 764f 
Cerebroside, 350, 350 
synthesis of, 764 
Cetuximab, 421 
CF ,-CF complex, 578-579. See also ATP 
synthase (Complex V) 
cGMP 
structure of, 403 
in vision, 968f, 969 
cGMP phosphodiesterase, in vision, 968 
cGMP-gated Ca’* channel, in vision, 968 
Chain, Ernest, 1037 
Chain-terminated fragments, in DNA 
sequencing, 143 
Chair form, 324, 325f 
Changeux, Jean-Pierre, 202 


Channels 
ion. See Ion channels 
water, 394 
Charcot-Marie- Tooth disease, 1018 
Chargaff, Erwin, 114 
Charge separation, photoinduced, 
569, 569f 
Charged tRNA, 891f, 893 
Chemical modification reaction, 256 
Chemical protons, 540f 
Chemical reactions. See Reaction(s) 
Chemical shifts, in NMR spectroscopy, 
101, 101f 
Chemiosmotic hypothesis, 544-545, 
544f, 545f 
Chemistry, combinatorial 
in drug development, 1041. See also Drug 
development 
in evolution studies, 188-189 
Chemoattractants, 1024 
Chemorepellants, 1024 
Chemotaxis, 1024-1025, 1025f 
Chemotherapy, cancer, resistance in, 
1050-1051 
Chemotrophs, 428 
Cheng-—Prusoff equation, 1031 
CheY, 1024 
ChIP (chromatin immunoprecipitation), 945 
Chirality, 27 
of amino acids, 28, 29f, 713 
Chitin, 333, 333f 
Chloramphenicol, 909t, 910 
Chlorobium thiosulfatophilum, 584, 584t 
Chlorophyll a, 568, 568-572, 568f, 569f 
in photosystem II, 573, 573f 
Chlorophyll b, 582, 582-583, 582f 
Chloroplasts, 567-568, 567f 
ATP synthesis in, 578-581, 578f 
evolution of, 568 
genome of, 568 
starch in, 597 
structure of, 567, 567£ 
Chlorpromazine, 1038 
discovery of, 1038 
mechanism of action of, 
1038, 1039f 
Cholecalciferol, 785, 785 
Cholecystokinin, in caloric homeostasis, 
794-795, 795£ 
Cholera, 421-422, 765 
Cholesterol, 350, 350 
“bad,” 744 
bile salts from, 779-780 
elevated levels of, 744-745, 759, 
776-779, 777£ 
“good,” 744-745, 776 
labeling of, 767f 
in lipid membrane, 363 
metabolic fates of, 775—776 
metabolism of, 775—779 
properties of, 767, 767b 
receptor-mediated endocytosis and, 385f 
steroid hormones from, 780-782 
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transport of, 773-774, 774-775 
reverse, 778-779 
Cholesterol synthesis, 767-779 
condensation mechanism in, 769, 769f 
hepatic, 770, 774£ 
isopentenyl pyrophosphate in, 768 
mevalonate in, 767—768 
rate of, 770-773 
regulation of, 770-779 
site of, 773, 774£ 
squalene in, 768-770 
stages of, 767 
sterol regulatory element binding protein 
in, 770-771, 771£ 
Choline, in phospholipids, 349 
Chondroitin 6-sulfate, 331, 332 
Chorismate, in amino acid synthesis, 
719-720, 720£ 
Chromatin, 937-941, 938-941 
definition of, 938-939 
structure of, 938f 
DNA packing in, 941 
in gene regulation, 944-951 
remodeling of, 944-951, 949-951, 950f 
structure of, 939 
transcription factors and, 946-947 
Chromatin immunoprecipitation 
(ChIP), 945 
Chromatin-remodeling engine, 950, 950f 
Chromatography 
affinity, 70-71, 70f, 75, 78 
gel-filtration, 69, 69f, 75 
high-pressure liquid, 71, 71f 
ion-exchange, 69-70, 69f, 75, 75t 
in amino acid identification, 80—81, 80f 
Chromogenic substrate, 256 
Chromophores, 967 
Chromosomes 
bacterial artificial, 151 
yeast, 937, 938f 
yeast artificial, 151, 151f 
Chronic myelogenous leukemia, treatment 
of, 421 
Chylomicron remnants, 773 
Chylomicrons, 642, 642f, 773, 773t 
Chymotrypsin, 186, 255-260 
active sites of, 185-186, 186f, 257-258 
catalytic triad in, 257-260 
covalent catalysis in, 255-260 
acyl-enzyme intermediate in, 257, 259f 
tetrahedral intermediate in, 
258-259, 259f 
as two-stage process, 256-257, 257f, 259f 
homologs of, 260-261, 261f 
inhibition of, 53f, 241-242, 241f 
in peptide bond cleavage, 259f 
serine residue of, 255-256, 256f 
specificity of, 256f, 260, 260f 
structure of, 256, 257-258, 258, 260, 261f 
substrate preferences of, 234-235, 235t 
trypsin and, 260-261, 261f 
Chymotrypsin inhibitor, structure of, 53f 
Chymotrypsinogen, 257-258, 303-305 
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Cilia 
cochlear, 971-973, 971f, 972f 
microtubules in, 1019 
Ciprofloxacin, 831, 831 
Circadian rhythms, 974 
Circe effect, 235, 235b 
Circular DNA, 117, 117£ 
Cirrhosis, 811—812 
Cis configuration, of peptide bonds, 36-37, 
36f, 37£ 
Citrate 
in fatty acid metabolism, 662, 662f, 
666, 667£ 
isomerization of, in citric acid cycle, 
506, 506f 
Citrate synthase, in citric acid cycle, 
504, 505£ 
Citric acid cycle, 497-520 
acetyl coenzyme A in, 497, 500-501, 501, 
503-505, 514-515, 653, 656 
aconitase in, 506, 506 
ATP in, 498, 512, 514-515, 514f, 
554-555, 555t 
in cancer, 515 
in cellular respiration, 499f 
citrate isomerization in, 506, 506f 
citrate synthase in, 503-505, 505f 
citryl coenzyme A in, 505 
definition of, 497 
electron-transport chain in, 498, 512 
enol intermediate in, 504 
enzyme complexes in, 512 
evolution of, 518 
function of, 498 
glycolysis and, 499-503, 499f 
intermediates in, from amino acid 
degradation, 690-696 
isocitrate dehydrogenase in, 506, 514, 514f 
isocitrate in, 506-507, 507£ 
a-ketoglutarate dehydrogenase complex 
in, 499, 507, 514-515, 514£ 
a-ketoglutarate in, 506-507, 507f, 692 
ketone bodies in, 810, 810f 
malate in, 510 
in mitochondria, 498, 498f 
net reaction of, 510 
nucleoside diphosphokinase in, 508 
overview of, 497-499 
oxaloacetate in, 498, 498f, 504, 505f, 
509-510, 515, 653 
oxalosuccinate in, 506, 507f 
pyruvate carboxylase in, 516 
pyruvate dehydrogenase complex in, 
499-503, 499F, 499t, 502f, 503£ 
pyruvate dehydrogenase in, 500 
rate of, 556f 
reaction rate in, 556 
reactions of, 510-512, 511f, 511t 
regulation of, 512-516, 555-556, 556f 
as source of biosynthetic precursors, 
516-518 
stoichiometry of, 510-512, 511f, 511t 
substrate channeling in, 512 


succinate dehydrogenase in, 509-510 
succinyl coenzyme A in, 507-508, 
507f, 514 
succinyl coenzyme A synthetase in, 
507-508 
Citrulline, 686, 686, 689 
Citrullinemia, 698t 
Citryl coenzyme A, in citric acid cycle, 505 
Clamp loaders, 832 
Class I MHC proteins, 991-996, 993f, 
998-999. See also Major 
histocompatibility complex proteins 
lass II MHC proteins, 996-999, 996f 
lass switching, 990-991, 991f 
lathrin, 775, 776f 
leavage 
DNA. See DNA, cleavage of 
protein, 82-83, 82f 
leland notation, 236, 237 
leland, W. Wallace, 236 
linical trials, 1048-1049, 1049f 
lones, DNA, 154-155 
loning, 87-88 
expression, 154-155, 155f 
plasmid vectors in, 149-151, 149f, 150f 
Cloning vectors, 148, 149 
Clotting, 307-313 
extrinsic pathway of, 308, 308f 
impaired, 310-313, 311-313 
in hemophilia, 311 
intrinsic pathway of, 308, 308f 
regulation of, 311-312 
zymogen activation in, 307-312 
Clotting factors, 312 
CMP (cytidine monophosphate), 762 
Coactivators, 947-948, 948f, 949-951 
Coagulation. See Clotting 
Coat proteins, 914 
Coated pits, 775, 776f 
Cobalamin (vitamin Bi2), 441t, 649 
in amino acid synthesis, 650-652, 650f, 
651f, 717 
as coenzyme, 650-652, 650f, 651f, 717 
in fatty acid metabolism, 649-652 
structure of, 650, 650-651 
Cobratoxin, 390 
Cochlear hair cells, 971-973, 971£, 972f 
Coding strand, 854 
Codons, 19, 109-110, 128. See also 
Anticodons 
definition of, 128 
in genetic code, 129-130, 129t 
initiation, 900-901 
in translation, 889, 891-893, 892f 
Coenzyme(s), 221, 221t 
vitamin, 441t 
Coenzyme A, 440, 440, 441t 
as acyl group carrier, 440 
ADP units in, 447, 447£ 
in fatty acid metabolism, 644 
Coenzyme By2, 650-652, 650f, 651f 
Coenzyme Q (ubiquinone), 532-533, 533f 
in fatty acid metabolism, 646 
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Cofactors, enzyme. See Coenzyme(s) 
Cognate DNA, 272 
cleavage of, 275-277 
Cohen, Stanley, 148 
Cohesive-end method, 148-149, 149f 
Coiled-coil proteins, 43, 44f 
Collagen, 303 
amino acid sequences of, 44, 44f 
ascorbate and, 813 
in cartilage, 332 
helix of, 44—45, 44£ 
Collagenase, 303 
Color blindness, 970-971 
Color vision, 966, 969-971. See also Vision 
in animals, 970, 970f 
defective, 970-971 
evolution of, 970, 970f 
Colorectal cancer, 842-843 
Combinatorial association, antibody 
diversity and, 988-989 
Combinatorial chemistry, in evolution 
studies, 188-189 
in drug development, 1041. See also Drug 
development 
Combinatorial control, 943 
Committed step, 474, 723 
Compactin, 1040, 1040 
Comparative genomics, 160-161, 160f 
Compartments, drug target, 1033, 1033f 
Competitive inhibition, 238—240, 238f, 
239f, 240f 
Complement cascade, 981 
Complementarity-determining regions 
(CDRs), 984, 984f, 985-986 
Complementary DNA (cDNA), 
154-155, 154f 
in quantitative polymerase chain reaction, 
162, 162f 
Complementary single-stranded ends, 148 
Computer databases, for amino acids, 
65-66, 181, 182f 
Concentration gradient, 373-374 
in ATP synthesis, 437-438, 437f 
Concerted mechanism, in substrate binding, 
294-296 
Concerted (MWC) model 
of allosteric enzyme kinetics, 296, 296f 
of hemoglobin oxygen binding, 202-203, 
203f, 214-216, 216f 
Cones, 966, 969-970, 969f 
Congenital erythropoietic porphyria, 
730-731 
Congestive heart failure, digitalis for, 377 
Conjugation, in drug metabolism, 1034-1035 
Connexin, 394 
Connexon, 394, 394f 
Consensus sequences, 126b, 126f, 299 
in phosphorylation, 299 
in promoters, 857, 857f 
in splicing, 132f, 133, 873-874, 873f 
Conservative substitutions, 178 
Constant (C) genes, 987 
in class switching, 991 


Constant regions, 984, 984f 
Constitutional isomers, 321, 321f 
Continuous genes, 131, 132 
Controlled termination of replication, 
143-144, 143f 
Convergent evolution, 185-186 
Cooley anemia, 210-211 
Cooperative binding, 200-201, 201f, 
203f, 294 
Cooperative transition, in protein folding, 52 
Cooperativity, 289 
Coproporphyrinogen III, 729, 729 
Cordycepin, 872 
Core enzyme, 856 
Core promoter, 857 
Corepressors, 927 
Corey, Robert, 38 
Cori, Carl, 454 
Cori cycle, 490, 490f, 685 
Cori disease, 634t 
Cori, Gerty, 454, 634 
Coronary artery disease. See Atherosclerosis 
Corrin ring, 650-651, 650f 
Corticosteroids. See also Steroid hormones 
synthesis of, 783 
Corticosterone, 784 
Cortisol, synthesis of, 780, 780f, 783, 784 
Corynebacterium diphtheriae, 910 
Cotransporters, 380, 380f 
Coulomb’s law, 7—8 
Covalent bonds, 7. See also Bonds 
cleavage of, 13 
Covalent catalysis, 254, 255-260 
acyl-enzyme intermediate in, 257, 259f 
chymotrypsin and, 255-260 
tetrahedral intermediate in, 258-259, 259f 
as two-stage process, 256-257, 257f, 259f 
Covalent modification, of proteins, 57—58, 
57£, 297-302, 298t 
mechanisms of, 298t. See also 
Phosphorylation 
COX2 inhibitors, 1044 
development of, 1043-1045 
CpG islands, 946 
CpG, methylation of, 945-946 
Crassulacean acid metabolism, 600-601, 600f 
Cre recombinase, 845-846 
Creatine kinase, 236-237 
in sequential reactions, 236-237, 236f 
Creatine phosphate, 434 
during exercise, 434-435, 435f, 805, 805t 
phosphoryl-transfer potential of, 
434, 434t 
Creutzfeldt—Jakob disease, 56-57, 56f 
Crick, Francis, 5, 114, 127, 128, 891 
Cristae, 526, 527f 
Critical concentration, 1014 
Crixivan (indinivar), development of, 
1043, 1043f 
Cro, \ repressor and, 929 
Cross-links, 838, 839f 
Crotonyl ACP, 658t, 659, 659 
Crown gall, 166-167, 166f 


CRP (cAMP receptor protein), 
927-928, 927£ 
Crystallography, X-ray, 98-100, 98f-100f 
of enzyme-substrate complexes, 
227, 227 
c-Src, in cancer, 420 
CTD (carboxyl-terminal domain), 865, 868, 
877, 877£ 
CTP. See Cytidine triphosphate (CTP) 
Cumulative feedback inhibition, 725 
Cupric ion, 325 
Cyanide, 558 
Cyanobacteria, 568, 568f 
Cyanogen bromide, in protein cleavage, 
82, 82f 
Cyclic adenosine monophosphate. 
See cAMP 
Cyclic hemiacetals, 322 
Cyclic photophosphorylation, 
580-581, 580f 
Cyclin, 836 
ubiquitination of, 298, 298t 
Cyclin B, degradation of, 673f 
Cyclin destruction boxes, 676 
Cyclin-dependent protein kinases, 836 
Cycloheximide, 909t, 910 
in taste, 963-964 
Cyclooxygenase 2, 1044 
Cyclooxygenase inhibitors, development of, 
1044-1045 
Cyclophosphamide-glutathione conjugate, 
1034, 1034 
Cyclosporin, 990, 990 
Cystathionine, 718-719, 719 
Cysteine 
molecular models of, 23f 
pyruvate formation from, 691, 691f 
structure of, 30, 30f 
synthesis of, 714-715, 718-719 
Cysteine proteases, in peptide bond 
cleavage, 263, 263-264, 264f 
Cystine, 35, 35f 
Cytidine, 111 
synthesis of, 739-740 
Cytidine diphosphodiacylglycerol (CDP- 
diacylglycerol), 761, 761 
Cytidine monophosphate (CMP), 762 
Cytidine triphosphate (CTP), 761 
ATCase inhibition by, 291, 291f, 
295-296, 296f 
hydrolysis of, 431 
in pyridine synthesis, 291, 291f, 
295-296, 296f 
structure of, 290, 291 
synthesis of, 739-740 
Cytidine triphosphate synthetase, 739 
Cytochrome 
definition of, 535 
in photosynthetic reaction center, 
572, 572£ 
Cytochrome bs, 664, 664f 
Cytochrome bf complex, 575, 575f 
location of, 583, 583f 
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Cytochrome c, 533, 535 
in apoptosis, 559 
evolution of, 543, 543f 
in oxidative phosphorylation, 531, 
532f, 532t 
Cytochrome c oxidase (Complex IV), 531, 
532f, 532t 
in oxidative phosphorylation, 531 
in respiratory chain, 537-540, 
538f-540f 
Cytochrome c oxidoreductase, 535-536 
Cytochrome P450, 782-783, 782f 
in drug metabolism, 1034 
Cytochrome P450 monooxygenases, 781 
Cytochrome reductase, 531, 532f, 532t, 
535-536, 536f 
Cytoglobin, 212 
Cytokines, 998, 998f 
Cytoplasm 
fatty acid synthesis in, 657 
glycolysis in, 498 
Cytosine, 4, 4,111, 111 
deamination of, 838, 841 
Cytoskeleton, 1007, 1008f 
Cytotoxic T cells, 980, 994-996, 997-998, 
997f£. See also T cell(s) 


D amino acids, 27, 27f 
D genes, 988 
in antibody switching, 991 
D stereoisomers 
monosaccharide, 320 
of monosaccharides, 321f, 322f 
DAG. See Diacylglycerol (DAG) 
Dalton, 34 
Dark reactions, of photosynthesis, 566, 589, 
590-601. See also Calvin cycle; 
Photosynthesis 
Darst, Seth, 853 
Databases, of amino acid sequences, 65-66, 
181, 182f 
Dawkins, Richard, 53 
DCC (dicyclohexylcarbodiimide), 97, 97 
Deamination, 838, 838f, 841 
in amino acid degradation, 680-684 
Decarboxylation, 484 
in citric acid cycle, 500, 507 
in fatty acid synthesis, 659 
in gluconeogenesis, 484 
in pentose phosphate pathway, 601 
Decorated actin, 1014 
Degenerative arthritis, 332 
Degron, 676 
Dehydration, in amino acid 
degradation, 684 
Dehydroascorbic acid, 813, 813 
7-Dehydrocholesterol, 785, 785 
Dehydrogenases, NAD* binding sites in, 
469, 469f 
3-Dehydroquinate, in amino acid synthesis, 
719, 720 
3-Dehydroshikimate, 720 
Deletions, production of, 156 
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Denatured proteins, 49-50, 49f, 50f 
Density-gradient equilibrium 
sedimentation, 119-120, 119f 
Deoxyadenosine, 111, 872 
5'-Deoxyadenosy]l radical, 651, 651f 
5’-Deoxyadenosylcobalamin, 650 
Deoxyadenylate, 111 
3-Deoxyarabinoheptulosanate 7-phosphatase, 
in amino acid synthesis, 719, 720 
Deoxycycline, 240 
Deoxycytidine, 111 
Deoxycytidylate, 111 
Deoxyguanosine, 111 
Deoxyguanosine 3-monophosphate 
(3’-dGMP), 112, 112 
Deoxyguanylate, 111 
Deoxymyoglobin, 196, 197. See also 
Myoglobin 
Deoxyribonucleic acid. See DNA 
Deoxyribonucleoside 3'-phosphoramidites, 
in DNA synthesis, 144, 144, 144f 
Deoxyribonucleoside triphosphate, in 
replication, 822, 823f 
Deoxyribonucleotide synthesis, 745-750 
deoxyuridylate in, 748 
dihydrofolate reductase in, 749 
regulation of, 752, 752f 
ribonucleotide reductase in, 745, 
745-750, 745f, 746, 752, 752f 
thymidylate in, 748, 749f 
Deoxyribose, 110, 110, 322f 
Dephosphorylation, 300 
(—)Deprenyl, 242, 242 
Dermatan sulfate, 331, 332 
1-Desamino-8-D-arginine vasopressin, 96, 96 
Desaturase, 664, 664£ 
Designer genes, 157 
Desmolase, 783 
a-Dextrinase, 454 
DHAP. See Dihydroxyacetone phosphate 
(DHAP) 
DHU loop, 891 
of tRNA, 890f 
Diabetes insipidus, 96 
Diabetes mellitus, 325-326, 798-803 
as autoimmune disease, 1001 
derivation of name, 798b 
glucose homeostasis in, 798-800 
incidence of, 798 
insulin in, 798-800 
insulin resistance and, 799-802 
ketosis in, 633f, 653, 655, 803 
obesity and, 798-803 
pancreatic failure in, 801-802 
type 1, 798, 802-803 
type 2, 798-802 
diabetes mellitus, type 2 
pathogenesis of, 800-802 
treatment of, 802 
Diabetic ketoacidosis, 803 
Diacylglycerol (DAG), 408, 642 
in phospholipid synthesis, 761, 761f, 
762, 762£ 


in signal transduction, 408-409, 409f 
synthesis of, 761 
Diacylglycerol 3-phosphate, 348, 348 
in membrane lipid synthesis, 760 
Diacylglycerol kinase, 760 
Dialysis, in protein purification, 69, 69f 
Dianabol, 948 
Diastereoisomers, 321, 321f 
Diazotrophic microorganisms, 707 
5,5-Dibromo-4,4'-dichloro-indigo, 924 
Dicarboxylate carrier, 554, 554f 
Dicoumarol, 310, 310 
Dictyostelium discoideum, myosin-ATP 
complex in, 280, 280f 
Dicyclohexylcarbodiimide (DCC), 97, 97 
Dideoxy method, 143-144, 143f 
2,4-Dienoyl CoA reductase, 649, 649 
Diet, 20, 20f. See also specific nutrients 
high-protein low-carbohydrate, 798 
ketogenic, 656 
low-phenylalanine, 674 
spicy food in, capsaicin in, 
973-974, 973f 
starved—fed cycle and, 807-808. See also 
Starvation 
Dietary fiber, 329 
Dieting, 797-798 
Differential centrifugation, 67-68, 68f 
Diffusion 
active, 371, 373 
facilitated, 371, 373 
lipid 
lateral, 361-364, 362f 
transverse, 362 
simple, 372 
Digestion, 437, 674, 674f. See also Amino 
acid degradation 
chymotrypsin in, 255-260. See also 
Chymotrypsin 
enzymes in, 255-260, 303-307, 303t, 
304f, 674, 674f 
starved-fed cycle and, 807-808 
Digitalis, Na*—K* pump inhibition 
by, 377 
Digitoxigenin, 377, 377 
Diglyceride acyltransferase, 761 
Dihydrobiopterin, 695, 695 
Dihydroceramide, 764 
1,25-Dihydrocholecalciferol, 785, 785 
Dihydrofolate, in deoxyribonucleotide 
synthesis, 748, 749 
Dihydrofolate reductase, 695, 695f 
in deoxyribonucleotide synthesis, 749 
Dihydrolipoyl dehydrogenase, in citric acid 
cycle, 499t, 501, 502f 
Dihydrolipoyl transacetylase, in citric acid 
cycle, 499t, 501 
Dihydroorotate, 738, 738 
Dihydropteridine reductase, 695, 695f 
Dihydrosphingosine, 764 
in sphingolipid synthesis, 737 
Dihydrotestosterone, 785 
Dihydrouridine (UH2), 890 


Dihydroxyacetone, 320, 320, 321f 
structure of, 321f 
in transaldolase reaction, 605-606, 606f 
Dihydroxyacetone phosphate (DHAP), 326, 
326f, 458, 479 
in Calvin cycle, 594f, 595, 595 
in fructose metabolism, 469-470, 469f 
in gluconeogenesis, 480f 
in glycolysis, 456f, 458-459, 458f, 460f, 
469-470, 480f 
isomerization to glyceraldehyde 

3-phosphate, 224, 224, 

458-459, 460f 
Dihydroxycholesterol, 783, 783 
Diuisopropylphosphofluoridate (DIPF), 

enzyme inhibition by, 241, 

241f, 241 
2,2-Dimercaptopropanol (BAL), 517, 518f 
Dimerization arm, 416 
Dimers, 48 
Dimethylallyl pyrophosphate, 742f, 768, 

768, 768-769 
Dimethylbenzimidazole, 651 
N, N-Dimethylpropargylamine, 242, 242 
2,4-Dinitrophenol (DNP), 558, 558 
Dioxygenases, 696 
Dipalmitoyl phosphatidylcholine deficiency, 

765-766 
DIPF (diisoprophylphosphofluoridate), 

enzyme inhibition by, 241, 241, 241f 
Diphosphatidylglycerol, 349, 349 

functions of, 762 
synthesis of, 762 
Diphthamide, 910-911 
Diphtheria, 910-911 
Dipolar ions, 27—28, 27f 
Direct repair, 840 
Disaccharides, 327-329. See also 
Carbohydrates 
abbreviations for, 330b 
structure of, 327, 327f 
Discontinuous genes, 131 
Discontinuous (split) genes, 131-132, 
132f, 133f 
evolutionary advantages of, 133 
Diseases and disorders 
albinism, 698t 
alcaptonuria, 697 
alcohol-related, 811-813 
Alzheimer disease, 55-56, 57 
amino acid sequences and, 35-36 
amyloidoses, 55-57 
amyotrophic lateral sclerosis, 147-148 
Andersen disease, 634t 
anemia, 330-331 
anthrax, 831 
anticipation in, 842 
argininosuccinase deficiency, 689, 689f 
arsenite poisoning, 517, 518f 
atherosclerosis. See Atherosclerosis 
autoimmune, 1001 
beriberi, 517 
bleeding disorders, 310-313, 311-313 


bovine spongiform encephalopathy, 56 

cancer. See Cancer 

carbamoyl phosphate synthetase 
deficiency, 689, 689f 

carbon monoxide poisoning, 213 

cardiovascular disease, 719, 759, 774-775, 
776-779. See also Atherosclerosis 

carnitine deficiency, 646 

cataracts, 472 

Charcot-Marie- Tooth disease, 1018 

cholera, 421-422, 765 

citrullinemia, 698t 

congenital disorders of glycosylation, 336 

congestive heart failure, 377 

Cooley anemia, 210-211 

coronary artery disease, 759, 774-775, 
776-779, 777E. See also 
Atherosclerosis 

Creutzfeldt—Jakob disease, 56 

diabetes insipidus, 96 

diabetes mellitus. See Diabetes mellitus 

diagnosis of, polymerase chain reaction 
in, 146-148 

diphtheria, 910-911 

drug-resistant, 1050-1051 

emphysema, 307 

environmental factors in, 19-20 

epilepsy, 656 

familial hypercholesterolemia, 759, 
776-779 

galactosemia, 472 

gargoylism, 332, 332f 

gene therapy for, 167-168 

genetic variations and, 19-20 

glucose 6-phosphate dehydrogenase 
deficiency, 609-610, 611 

glycogen storage diseases, 634—635, 634t 

gout, 753-754 

heart disease, 719 

hemolytic anemia, 609-610, 611 

hemophilia, 311 

Hers disease, 634t 

HIV infection. See Human 
immunodeficiency virus infection 

homocystinuria, 698t 

Huntington disease, 55-56, 842 

Hurler disease, 332, 332f 

hyperlysinemia, 698t 

I-cell disease, 336 

inborn errors of metabolism, 697-698, 698t 

infant respiratory distress syndrome, 
765-766 

lactose intolerance, 471 

Leber hereditary optic neuropathy, 559 

Lesch—Nyhan syndrome, 754 

Li-Fraumeni syndrome, 843 

long QT syndrome, 392-393 

mad cow disease, 56 

malaria, 610 

maple syrup urine disease, 697 

mercury poisoning, 517 

misfolded proteins in, 56-57 

mitochondrial, 558-559 


mucopolysaccharidoses, 332 

multidrug resistance in, 378-379 

multiple myeloma, 87-88 

mushroom poisoning, 865, 865-866 

mutations causing, 147-148, 157 

neurological, protein misfolding in, 
56-57, 56f 


ornithine transcarbamoylase deficiency, 689 


osteoarthritis, 332 
osteogenesis imperfecta, 45 
osteomalacia, 786 
Parkinson disease, 55—56, 57, 242, 676 
phenylketonuria, 674, 697-698 
phosphatase deficiency, 514 
porphyrias, 730-731 
predisposition to, 19-20 
prion, 56-57, 56f 
protein aggregates in, 56-57 
recombinant DNA technology and, 
146-148 
retinitis pigmentosa, 877-878 
ricin poisoning, 911 
rickets, 786 
scurvy, 57, 812-813 
seizures, 656 
severe combined immunodeficiency, 
167-168 
sickle-cell anemia, 209-210, 209f, 210f 
smallpox, 1003 
spina bifida, 755 
splicing defects and, 877-878, 879t 
steatorrhea, 642 
Tay-Sachs disease, 765-766 
thalassemia, 210-211 
transmissible spongiform 
encephalopathies, 55-56 
tuberculosis, 679 
tyrosinemia, 698t 
urea cycle defects, 668-669 
vanishing white matter, 908-909, 909f 
vitamin D deficiency, 786 
Wernicke—Korsakoff syndrome, 812 
whooping cough, 422 
Zellwegger syndrome, 652 
Dismutation, 541b, 542f 
D isomers, 321f, 322f 
monosaccharide, 320 
Displacement loop, 845, 845f 
Dissociation constant (Kg), 1030-1031 
for ligand binding, 1030-1031, 1030f 
Distal histidine, 198, 198 
Distributive enzymes, 832b 
Disulfide bonds, 35, 35f 
cleavage of, 49 
Diuron, 584, 584 
Divergent evolution, 185 
Diversity (D) genes, 988 
in class switching, 991 
D-loop, 845, 845f 
DNA, 110 
A-form, 115-116, 116, 116t 
ancient, amplification and sequencing of, 
147, 188-189, 189f 
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annealing of, 120-121 
antisense strand of, 854 
backbone of, 110, 110f, 111 
bases in, 4, 4, 4f, 5, 5-6, 110-111, 111. 
See also Bases/base pairs 
B-form, 115-116, 116, 116t 
in chromatin, 938f, 939-941 
circular, 117, 117f 
cleavage of 
in cognate vs. noncognate DNA, 
275-277, 276f 
in-line displacement of, 272-274 
magnesium in, 274, 275f, 277 
mechanisms of, 272—274 
methylation in, 277-278 
phosphodiester bridge hydrolysis in, 
272-274, 272£ 
phosphorothioates in, 273-274, 274f 
restriction enzymes in, 141 42, 271-279 
restriction-modification systems in, 
272, 272£ 
sites of, 141, 141f, 271-272, 272, 
275-277. See also Recognition sites 
stereochemistry of, 273-274, 274f 
cloned, 154-155 
coding strand of, 845 
cognate, 272 
cleavage of, 275-277 
complementarity with mRNA, 126, 126f 
complementary, 154-155, 154f, 162f 
condensed, 828 
cross-links in, 838, 839f 
damage to 
causes of, 837-839 
repair of, 839-843. See also DNA repair 
denaturation of, in acid-base reactions, 
14-15, 14£ 
directionality of, 112 
double helix of, 5—15, 6f, 26f, 109, 
113-115. See also Double helix 
evolution of, molecular studies of, 
189-191, 190f 
functions of, 2, 18, 19 
hybridization of, 121 
hypersensitive sites in, 944 
junk, 19 
lagging strand of, 823f, 824 
leading strand of, 823f, 824 
length of, 113 
linker, 148-149, 149f, 939 
linking number of, supercoiling and, 
826, 827£ 
methylation of, 718, 718f, 945-946 
in amino acid synthesis, 716-718, 718f 
in cleavage, 277-278, 278f 
minor groove in, 822, 822f 
mitochondrial, 527-528 
genetic code of, 131, 131t 
sequencing of, 158 
noncoding, 159-160 
in nucleosome, 939-941 
operator, 922f 
overview of, 4—6, 819-820 
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DNA—(continued) 
packing of, 941 
palindromic, 141 
phosphodiester bridges of, 110, 
110, 111 
hydrolysis of, 272—274 
polarity of, 112 
primer strand of, 122, 145, 821 
promoter sites in, 126-127, 126f, 852 
in bacteria, 852, 856-858, 857f£ 
properties of, 4-6 
in protein encoding, 18-19 
recombinant. See Recombinant DNA 
relaxed, 117, 117f, 826, 828f 
renaturation of, 120-121 
rewinding of, in transcription, 859, 859f 
sense strand of, 845, 854 
size of, 113 
stem-loop motif in, 118f 
sticky ends of, 148, 149f 
structure of, 4—6, 4£-6f, 110, 
110-113, 112 
sugar-phosphate units of, 4, 4f 
sugars in, 110-111, 110f 
supercoiled, 117, 117f. See also 
Supercoiled DNA 
synthesis of, 5—6, 6f, 113, 119-120, 
121-122, 121f, 122f. See also 
Replication 
recombinant methods of, 139-157. 
See also Recombinant DNA 
technology 
telomeric, 836-837, 836f, 837f 
template strand of, 121-122, 122f 
in replication, 121-122, 122f, 820-821 
in transcription, 124, 125f, 126, 126t, 
854, 855f 
topoisomerases and, 828, 828-831, 
829f, 830f 
topoisomers of, 826, 828f 
unwinding of, 826-831, 858, 858f 
in transcription, 858, 858f 
Watson—Crick model of, 5, 113-115, 
114f, 819-820. See also Double 
helix 
X-ray diffraction patterns of, 110, 113f 
Z-form, 116, 116f, 116t 
DNA amplification, polymerase chain 
reaction in, 145-146, 145-147 
DNA blots, 142, 142f 
DNA fingerprint, 141 
DNA fragments 
amplification of, 145-146 
joining of, 148-149, 149f 
production of, 141-142 
separation and visualization of, 141-142 
DNA gyrase, 831 
DNA ligase, 148-149, 823f, 824, 841 
DNA linker, 148-149, 149f 
DNA microarrays, 162-163, 162f 
DNA mismatch repair, 840, 840f. See also 
DNA repair 
DNA photolyase, 840 
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NA polymerase(s), 121-122, 121f, 821-823 

bacterial, 835t 

classification of, 821 

deoxyribonucleoside triphosphate binding 
by, 822, 823f 

error-prone, 835t, 837-838 

eukaryotic, 835-836, 835t 

Klenow fragment of, 821, 821f 

in leading/lagging strand synthesis, 
832-833, 832f, 833f 

metal ions of, 821-822, 821f 

primer for, 821, 821b, 823, 823f 

in prokaryotes, 831-834 

in proofreading, 839-840, 839f 

reaction mechanism of, 821-822, 821f 

specificity of, 221, 822 

structure of, 821, 821 

types of, 835t 

NA polymerase a, 835, 835t 

NA polymerase B, 835, 835t 

NA polymerase 6, 835, 835t 

NA polymerase I, 834 

NA polymerase IT, 835t 

NA polymerase III, 831, 831-832, 832-833 

sliding clamp unit of, 831-832, 831f, 834 

NA polymerase III holoenzyme, 832-833, 

833f, 834 

NA polymerase switching, 835 

NA probes, 80, 142, 144-145 

generation of, 144-145, 151-153 

solid-phase approach in, 144-145, 144f 

for genomic library, 151-153, 152f 

NA recombination, 844—846 

definition of, 844 

functions of, 844 

Holliday junctions in, 845-846, 845f 

initiation of, 844-845 

mechanisms of, 844-846, 845f 

RecA in, 845 

recombinases in, 845-846 

NA recombination synapse, 846 

NA repair, 122, 820, 837-844 

base-excision, 840 

defective, in cancer, 842-843 

direct, 840 

double-strand, 841 

enzyme complexes in, 840, 840f 

glycolases in, 840, 841 

ligase in, 824, 841 

mismatch, 840, 840f 

nonhomologous end joining in, 841 

nucleotide-excision, 840-841, 841f 

proofreading in, 839-840 

single-strand, 839-841 

tumor-suppressor genes in, 842 

uracil DNA glycolase in, 841, 841f 

NA replication, 5—6, 6f, 119-121, 819, 
820-837 

in bacteria, 831-834 

base complementarity in, 822, 822f, 823f 

cell cycle and, 836, 836f 

clamp loaders in, 832 

controlled termination, 143-144, 143f 


coordinated processes in, 831-837 
cross-linkage in, 838, 839f 
definition of, 113 
directionality of, 112, 122, 823, 832 
DNA polymerase II holoenzyme in, 
832-833, 833f, 834 
DNA polymerases in, 121-122, 121f, 
821-823, 835-836, 835t. See also 
DNA polymerase(s) 
DNA probes in, 145 
DnaA in, 834, 834f 
in E. coli, 831-834 
errors in, 837-838 
repair of, 837-844. See also DNA repair 
in eukaryotes, 835-836 
helicases in, 120, 824, 824-825, 824f, 
825f, 832 
initiation of, 823, 834-836 
in bacteria, 834, 834f 
in eukaryotes, 835-836 
sites of, 834-836 
lagging strand in, 823f, 824 
synthesis of, 823f, 824, 832-833, 833f 
leading strand in, 823f, 824 
synthesis of, 823f, 824, 832-833, 833f 
licensing factors in, 835 
ligase in, 823f, 824 
Okazaki fragments in, 823-824, 
823f, 833 
origin of, 834, 834f 
origin of replication complexes in, 835 
prepriming complex in, 834, 834f 
primer in, 122, 145, 821, 821b, 823, 823f 
processivity in, 831-832, 831f 
proofreading in, 839-840, 839f 
rate of, 831-832 
recombinant DNA technology in, 145. See 
also Recombinant DNA technology 
replication fork in, 823, 823f, 
832-833, 833f 
RNA polymerase in, 823, 823f 
semiconservative, 118-120, 120f 
sites of 
in bacteria, 834, 834f 
in eukaryotes, 835-836 
sliding DNA clamp in, 831-832, 831f 
specificity of, 822, 822f, 823f 
strand joining in, 823f, 824 
strand separation in, 120, 824-825, 825f 
telomeres in, 836-837, 836f, 837f 
template in, 121-122, 122f, 820-821 
trombone model of, 833, 833f 


DNA sequencing, 143-144, 143f 


in amino acid sequencing, 84 
chain-termination method in, 143, 143f 
fluorescence detection in, 143-144, 143f 
in forensics, 147, 147f 

for Neanderthals, 188-189, 189f 

Sanger dideoxy method in, 143-144, 143f 


DNA transfer. See also Recombinant DNA 


technology 
by electroporation, 167, 167f 
gene guns for, 167 


by microinjection, 163, 163f 
vectors for, 149-151, 149f, 150f, 154-155 
DNA vectors, 149-151, 149f-151f 
DnaA, assembly of, 834, 834f 
DNA-binding domains, 941-942, 943 
in eukaryotes, 943 
in prokaryotes, 942. See also DNA- 
binding proteins 
DNA-binding proteins. See also 
Transcription factors 
basic-leucine zipper in, 942 
homeodomains in, 942 
match with regulatory site, 922-923 
in prokaryotes, 922-928, 922f, 
923-928, 923f 
transcription inhibition by, 924-927 
DNA-binding sites 
chromatin and, 941—942, 943 
evolution of, 927 
hypersensitive, 944 
Dolichol phosphate, 334, 334 
Domains 
DNA-binding, 941-942, 943 
homeodomains, 943 
immunoglobulin, 981-982 
of living organisms, 3, 3f 
protein, 47, 47f 
exon encoding of, 133, 133f 
Dopamine, 1038 
Double helix, 5, 5f, 26f, 109, 113-121 
A, 115-116, 116t 
B, 115-116, 116t 
base pairing in, 5, 5f, 10-11, 10f, 
113-115, 114, 114t. See also 
Bases/base pairs 
discovery of, 5, 113-114 
disruption of, in acid-base reactions, 
14-15, 14f 
formation of, 6, 6f 
electrostatic interactions in, 10 
heat released in, 12-13, 13f 
hydrogen bonds in, 10 
hydrophobic interactions in, 10 
van der Waals forces in, 10 
hydrogen bonds in, 5, 5f, 10, 114, 118f 
left-handed, in Z-DNA, 116, 116t 
melting of, 120-121, 120f, 824-825, 825f 
in replication, 118-121 
unwinding of, 826, 828-829 
supercoiling and, 826 
topoisomerases in, 828-831, 829f, 830f 
in transcription, 858, 858f 
Z, 116, 116t 
Double-displacement reactions, 237 
Double-reciprocal plots, 233, 233f, 240, 240f 
Dreyer, William, 987 
Drosophila melanogaster 
alternative splicing in, 879 
sensory bristles in, 972 
Toll receptor in, 978-979 
Drug(s). See also specific drugs 
absorption of, 1031-1033, 1032f 
ADME properties of, 1031-1036, 1032f 


agonist, 948 
antagonist, 948 
concentration of, 1031 
distribution of, 1033, 1033f 
excretion of, 1035-1036 
immune-modulating, 990 
metabolism of, 1034-1036 
cytochrome P450 in, 783 
receptors for, 948, 949f 
resistance to, 1050-1051 
response to, genetic variations in, 
1047-1048 
routes of administration for, 1032 
side effects of, 1031, 1036 
genetic variations in, 1047-1048 
therapeutic index of, 1036 
Drug development, 1001-1052 
animal testing in, 1037 
candidate drugs in 
absorption of, 1031-1033, 1032f 
ADME properties of, 1031-1036, 1032f 
distribution of, 1003f, 1033 
effective concentrations of, 1031, 1031f 
essential characteristics of, 1030-1037 
ligand binding and, 1030-1031, 1030f 
metabolism and excretion of, 1034-1036 
number of, 1041-1042 
oral bioavailability of, 1032-1033, 1032f 
potency of, 1030-1031 
routes of administration for, 1032 
side effects of, 1031, 1036-1037 
target compartments of, 1003f, 1033 
therapeutic index of, 1036 
clinical trials in, 1048-1049, 1049f 
combinatorial chemistry in, 1041 
drug resistance and, 1050-1051 
dual pathways for, 1029-1030, 1030f 
genetic variations and, 1047-1048 
genomics in, 1045-1048 
high-throughput screening in, 1041 
phases of, 1048-1049 
screening libraries in, 1039-1042, 1042f 
serendipitous observation in, 1037-1038 
7TM receptors in, 404, 1045 
split-pool synthesis in, 1041, 1042f 
structure-based, 1042-1045, 1043f, 1044f 
Dynamic instability, 1019 
Dynein, 1008 
ATP binding to, 1010, 1011f 
structure of, 1010, 1011f 


E. coli. See Escherichia coli 
E site, ribosomal, 900, 900f, 904 
Eo (oxidation-reduction potential), 528-531 
Ear, hair cells of, 971-973, 971f, 972£ 
Early humans, DNA sequencing for, 
188-189, 189f 
EcoRI endonuclease, 278 
EcoRV endonuclease, 272, 273-277, 
274£-277£ 
binding affinity of, 275 
recognition site of, 275-277, 275f 
Editing, RNA, 852 
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Edman degradation, 81-83, 81f, 82f, 93 
protein cleavage for, 82-83, 82f 
Edman, Pehr, 81 
EF hand, 410-411, 411f 
Effector functions, 981 
EGF. See Epidermal growth factor (EGF) 
Eicosanoids, 664—665, 665, 665f 
Elastase, catalytic triad in, 261, 261f 
Elastase inhibitor, 307 
Electrochemical potential. See Membrane 
potential 
Electrocyclic reactions, 720 
Electromagnetic spectrum, 966f 
Electron carriers. See also Transporters 
in metabolism, 438-439, 438f, 439f 
in oxidative phosphorylation, 438-439, 
438f, 439f, 531-543, 532t, 551f, 
552-553, 552f, 554f. See also 
Respiratory chain 
in photosynthesis, 568-572, 569f, 
571, 572£ 
Electron sink, 605, 683 
Electron transfer. See also Electron- 
transport chain 
in oxidative phosphorylation, 528-531, 
542-543. See also Respiratory chain 
rate of, 542f 
in photosynthesis, 568-572, 569f, 571f, 
572-577, 572£ 
Electron-density maps, 99-100, 100f 
Electronic nose, 961-962, 962f 
Electron-transfer potential, 528 
in oxidative phosphorylation, 528-530 
Electron-transferring flavoprotein 
(ETF), 646 
Electron-transport chain, 498, 512, 528 
electron flow through, 540f 
inhibition of, 558, 582f 
rate of, 555-556 
in oxidative phosphorylation, 438-439, 
438f, 439f, 531-543, 532t, 540f. See 
also Respiratory chain 
in photosynthesis, 568-572 
Electrophilic catalyst, 683 
Electrophoresis 
gel, 71-73, 72f, 73f, 141-142, 142f 
in restriction fragment separation, 
141-142, 142f 
SDS, 73, 78 
SDS—polyacrylamide, of membrane 
proteins, 355, 355f 
two-dimensional, 74, 74£ 
Electroporation, 167, 167f 
Electrospray ionization (ESI), 91-92 
Electrostatic interactions, 7—8. See also Bonds 
in antigen—antibody binding, 984, 985, 985f 
ATP and, 433 
ELISA (enzyme-linked immunosorbent 
assay), 88-89, 89f 
Elongation factor(s), 902-903, 903f 
G, 904-905, 904-906, 904f 
Ts, 903 
Tu, 902-903, 903f 
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Embden, Gustav, 454 
Embden—Meyerhof pathway, 454. See also 
Glycolysis 
Emphysema, 307 
Enantiomers, 321, 321f 
Endergonic reactions, 222 
Endocytosis, receptor-mediated, 365f, 366, 
775-776, 776f 
Endonucleases. See Restriction enzymes 
(endonucleases) 
Endoplasmic reticulum, 365, 979f 
in diabetes, 802 
fatty acid synthesis in, 664 
protein glycosylation in, 333-334, 333f 
protein synthesis in, 911-914 
protein targeting from, 911-914 
ribosome binding to, 911-914 
rough, 911 
smooth, 911 
Endoplasmic reticulum (ER) stress, 802 
Endosymbiosis, 365, 527 
Endotoxin, 978 
End-product inhibition. See Feedback 
inhibition 
Enediol intermediate, in glycolysis, 459, 460f 
Energy. See also Bioenergetics; 
Thermodynamics 
activation, 222, 226 
from ATP hydrolysis, 429-435 
binding, 229 
definition of, 254 
enzyme specificity and, 275-277 
body requirements for, 549b 
from carbon oxidation, 435-437, 435-438, 
435f, 437f£ 
from catabolism, 434, 435-438, 
435f, 437£ 
enzymatic transformation of, 221-222 
free. See Free energy 
glycogen storage of, 328, 615, 629-630 
kinetic, 11 
laws of thermodynamics and, 11-13, 222 
lipid storage of, 641-642 
from oxidative phosphorylation, 437-438 
potential, 11 
of reactions, 222—225 
units of, 7, 223b 
Energy charge, 446-447, 446f 
Energy homeostasis. See Caloric 
homeostasis 
Enhancers, 127, 868-869, 943-944, 943f 
in cancer, 869 
Enol phosphate, 465 
Enolase, in glycolysis, 465 
5-Enolpyruvylshikimate 3-phosphate, 720 
Enoyl CoA, 646-647, 647, 648, 649 
Enoyl CoA hydratase, 646-647 
Ensemble studies, 246, 246f 
Enterohepatic cycling, 1036, 1036f 
Enteropeptidase, in trypsin activation, 
305, 306f 
Enthalpy, 11-12 
Entropy, 11-12 


Envelope, nuclear, 365 
Enzymatic cascades, 307 
in amino acid synthesis, 725-726, 
725£, 726£ 
in blood clotting, 307-308, 308f 
Enzymatic cleavage, of proteins, 82-83, 82f 
Enzyme(s), 219-248 
active sites of, 185-186, 227-228, 
227£, 228f 
convergent evolution of, 185-186 
definition of, 185-186 
mapping of, 241-242 
affinity tags for, 241-242 
allosteric, 237, 237f, 289, 290-296. See 
also Allosteric enzymes 
amounts of, regulation of, 446 
apoenzymes, 221 
bifunctional, 487 
catalytic activity of, 205-208. See also 
Catalysis 
assays for, 67. See also Protein 
purification 
specific, 67 
catalytic efficiency of, 235, 235t 
catalytically perfect, 234, 235, 460 
classification of, 248-249 
coenzymes. See Coenzyme(s) 
cofactors for, 221, 221t 
core, 856 
covalent modification of, 57—58, 57f, 
297-302, 298t 
definition of, 26, 219 
denatured, 49-50, 49f, 50f 
digestive, 255-260, 303-307, 303t, 304f, 
674, 6746 
distributive, 832b 
energy-transducing, 221-222 
holoenzymes, 221 
inhibition of, 238-245 
affinity tags in, 241-242, 241f 
competitive, 238-240, 238f, 239f, 240f 
feedback, 291, 446. See also Feedback 
inhibition 
group-specific agents in, 241f 
group-specific reagents in, 241 
irreversible, 238, 238f, 241-242 
noncompetitive, 230-240, 238, 238f, 
239-240, 239f, 240f 
reversible, 238, 238f 
suicide (mechanism-based), 242, 
750, 754 
transition-state analogs in, 243, 243f 
uncompetitive, 238, 238f, 239, 239f, 
240, 240f 
isozymes, 289-290, 296-297, 297f 
kinetic properties of, 229-237. See also 
Enzyme kinetics 
kinetically perfect, 234, 235, 460 
in linked pathways, 744. See also 
Metabolic pathways 
in multienzyme complexes, 660-661 
nomenclature for, 248-249 
overview of, 220—222 


processive 
definition of, 832b 
in DNA replication, 831-832 
prosthetic groups of, 221 
proteolytic, 220-221, 302-313. See also 
Proteolytic enzymes 
reaction equilibrium and, 224-225 
reaction rate acceleration by, 220, 220t, 
225-229, 226f 
restriction, 141-142, 141f, 253, 271-279. 
See also Restriction enzymes 
specificity of, 220-221, 221f, 275-277 
binding energy and, 275-277 
substrate binding by. See also Enzyme- 
substrate complex 
concerted mechanism in, 294—296 
cooperative, 200-201, 201f, 294 
energy for, 229, 254, 275-277. See also 
Binding energy 
sequential model of, 294 
specificity of, 220-221, 221f, 275-277 
substrates of, 220. See also Enzyme- 
substrate complex 
in transition state formation, 225-226, 
225f, 227-228 
turnover number of, 234, 234t 


Enzyme cofactors. See Coenzyme(s) 
Enzyme kinetics, 229-237 


definition of, 229 

in first-order reactions, 229—230 

fraction of active sites filled (fgg) in, 234 

initial velocity and, 230, 230f 

Resi! Kg, 234-235 

kinetic perfection and, 234, 235, 460 

Lineweaver—Burk plot of, 233, 233f 

Michaelis constant (Kyy) in, 
231-235, 233t 

Michaelis-Menten, 229-237, 231f 

in pseudo-first-order reactions, 230 

in second-order reactions, 230 

sigmoidal, 200, 291, 291f, 294, 295f 


Enzyme multiplicity, 724, 724f 
Enzyme regulation, 289-314 


allosteric, 289, 290-296 

dephosphorylation in, 300 

isozymes in, 289-290, 296-297, 297f 

phosphorylation in, 298-301 

protein kinases in, 298-301 

proteolytic activation in, 290, 302-313 

reversible covalent modification in, 290, 
297-392 

zymogens in, 303-313 


Enzymes 


ensemble studies of, 246, 246f 
single molecule studies of, 246, 246f 


Enzyme-substrate complex, 226-227, 


227, 227£ 

attractive forces in, 228, 228f 

cooperative binding and, 200-201, 201f, 
203f, 294 

dissociation constant for, 233-234 

in double-displacement (ping-pong) 
reactions, 237 


induced-fit model of, 228, 228f, 254, 
457, 457£ 
lock-and-key model of, 228, 228f 
in sequential reactions, 236—237 
Epidermal growth factor (EGF), in signal 
transduction, 402f, 415-418, 
416f-418f 
Epidermal growth factor receptor, 415-418, 
416f-418f 
in cancer, 421 
Epigenetic factors, 20 
Epigenome, 938 
Epilepsy, ketogenic diet for, 656 
Epimers, 321f, 322 
Epinephrine, 403 
in fatty acid metabolism, 643, 666-667 
in glycogen metabolism, 624-626 
in signal transduction, 402f, 403-410, 
406f. See also Signal transduction 
synthesis of, 726, 726f 
Epitopes, 85, 85f, 980 
EPO (erythropoietin), 331, 331f 
e chains, 982, 983f 
Equilibrium constant 
for dissociation of water, 14 
free-energy change and, 222-225, 224t 
Equilibrium potential, 391-392, 391f 
Erbitux, 421 
Error-prone DNA polymerase, 835t, 838 
Erythrocytes. See Red blood cells 
Erythromycin, 909t, 910 
Erythropoietin (EPO), 331, 331f 
Erythrose 4-phosphate, 603 
in amino acid synthesis, 719, 720f 
in Calvin cycle, 595 
in pentose phosphate pathway, 602f, 
603, 604f 
Escherichia coli. See also Bacteria 
DNA cloning and expression in, 
154-155 
DNA repair in, 839-840 
DNA replication in, 831-834. See also 
DNA replication 
flagella of, 1022-1025 
gene expression in, 922-928, 923-928. 
See also Gene expression/regulation, 
in prokaryotes 
genome of, 113, 113f, 182f, 928, 937 
lac operon of, 922, 925-928, 925f, 926f 
lactose permease symporter in, 381, 381f 
pyruvate dehydrogenase complex in, 
499f, 499t 
replication in, 831-834 
restriction enzymes of, 272-279, 
274f-277£ 
ESI (electrospray ionization), 91-92 
Essential amino acids, 674, 674t, 711, 
711t, 712£ 
synthesis of, 719-723. See also Amino 
acid synthesis 
Essential fatty acids, 664 
Estradiol, 784 
synthesis of, 785 


Estrogen(s). See also Sex hormones; Steroid 
hormones 
regulation of, 946 
synthesis of, 780, 780f, 784, 785 
Estrogen receptors, 946 
drug binding to, 948, 949f 
Estrogen response elements (EREs), 946 
Estrone, synthesis of, 780, 780f, 784, 785 
ETF (electron-transferring flavoprotein), 646 
ETF : ubiquinone reductase, 646 
Ethanol. See also Alcohol(s) 
fermentation of, 466-467, 467f 
formation of from pyruvate, 466-467, 467f 
in glycolysis, 453 
metabolism of, 232, 783, 810-813 
toxicity of, 811-813 
Ethanolamine, 349, 763, 763f 
Ethylene, synthesis of, 718, 718f 
Eukarya, 3 
Eukaryotes, 3, 3f 
cell types in, 938, 939t 
Eversion, 376 
Evolution, 3, 3f 
of amino acid sequences, 36, 79, 174-181 
of blood types, 335-336 
caloric homeostasis and, 792—793 
of Calvin cycle, 600 
of carbonic anhydrases, 271 
of chloroplasts, 568 
of citric acid cycle, 518 
convergent, 185-186, 185f 
of cytochrome c, 543, 543, 543f 
of cytochrome P450, 783 
divergent, 185 
of DNA binding sites, 927 
endosymbiotic events in, 527, 568 
essential elements of, 189 
experimental studies of 
ancient DNA amplification and 
sequencing in, 188-189 
combinatorial chemistry in, 188-189 
molecular, 188-191 
of feedback inhibition, 724 
of glucose metabolism, 490-491 
of glycogen metabolism, 627 
horizontal gene transfer in, 278-279 
of immune system, 978-981, 984 
of immunoglobulin fold, 984 
of introns and exons, 133 
of ion pumps, 374, 378-379 
of metabolic pathways, 447 
of microRNA, 954 
of mitochondria, 527 
obesity and, 792-793 
of olfaction, 959-960, 959f 
of photoreceptors, 970, 970f 
of photosynthesis, 568, 584, 600 
of proofreading, 895 
of proteasomes, 677—678, 678f, 679 
of protein degradation pathways, 
677-678 
of proteins, 133 
of P-type ATPases, 374, 378-379 
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of restriction enzymes, 278-279 
of ribosomes, 900 
RNA catalysis and, 881 
of rubisco, 600 
sequence homology and, 174-181, 
176f-182f. See also Sequence 
homologies 
signal transduction pathways, 419 
split genes in, 133 
of succinyl coenzyme A, 509 
tertiary structure and, 183-184 
time line for, 3f 
of transaminases, 713 
of transcription, 869 
of urea cycle, 688 
of vision, 970, 970£ 
Evolutionary trees, 187-188, 187f 
Exercise. See also Muscle contraction 
ATP in, 434-435, 435f, 805-806, 805t 
caloric homeostasis and, 803-806 
carbo-loading for, 806 
citric acid cycle during, 514 
creatine phosphate in, 434—435, 435f, 
805, 805t 
fatty acid metabolism in, 804, 804f, 806 
fuel sources for, 805-806, 805t 
gene expression and, 804 
gluconeogenesis during, 489-490, 490f 
glycolysis during, 473-474, 475f, 
478-479, 489-490 
lactate in, 489-490, 490f, 805 
mitochondrial biogenesis and, 804, 804f 
oxidative phosphorylation in, 805-806 
superoxide dismutase and, 542 
Exergonic reactions, 222 
Exon(s), 110, 132, 133 
evolution of, 133 
Exon shuffling, 133, 133f 
Exonuclease(s), 821 
in DNA polymerase II holoenzymes, 
832, 833f 
in proofreading, 839-840, 839f 
Expression cloning, 154-155, 155f 
Expression vectors, 149, 154-155 
External aldimine, 682, 682 
in amino acid degradation, 682, 682f 
in amino acid synthesis, 712f, 713 
Extrinsic pathway, coagulation, 308, 308f 


FoF ;-ATPase. See ATP synthase 
Fp fragments, 981-982, 982f 
in antigen—antibody binding, 985f, 
986, 986f 
Facilitated diffusion, 371, 373 
F-actin, 1013, 1013f 
Factor VIII, 311, 311f 
Factor XIII,, 309 
FAD (flavin adenine dinucleotide) 
ADP units in, 447, 447£ 
in catabolism, 438-439, 439, 439£ 
in citric acid cycle, 498, 498f, 501-502, 
509-510, 512 
in fatty acid oxidation, 646, 647f 
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FADH) (flavin adenine dinucleotide reduced) 
in catabolism, 438-439, 439 
in citric acid cycle, 498, 503, 510, 512 
in fatty acid oxidation, 646-647 
in oxidative phosphorylation, 505, 535 
electron-transfer potential of, 530-531 
reduction potential of, 528-530 
Familial hypercholesterolemia, 759, 776-779 
Farnesyl pyrophosphate, 769, 769 
Farnesylation, 298t 
5-Farnesylcysteine methyl ester, 359, 359 
Fasting, 796 
Fat 
body, energy storage in, 641 
brown, 556-557, 557f 
fecal, 642 
neutral. See Triacylglycerol(s) 
white, 556 
Fat cells, 641 
energy storage in, 641 
Fatty acid(s), 346-348, 347f, 347t 
carbon atoms in, 347-348, 347£ 
chain length in, 437-438 
covalent attachment of, 639 
essential, 664 
examples of, 347t 
functions of, 639 
in lipid membrane, 362-363, 363f 
in muscle 
during exercise, 804, 804f, 806 
in obesity, 800-801 
odd-chain, 649-650, 650f 
polyunsaturated, eicosanoids from, 
664-665, 665, 665f 
saturation of, 348 
storage form of, 648-649 
structure of, 346-348, 347, 362-363, 363f 
synthesis of, 643-644, 643f, 644f 
unsaturated, 648-649, 664-665 
Fatty acid metabolism, 437f, 639-669 
acetyl CoA carboxylase in, 657, 658t, 
666-667, 666f 
acetyl CoA in, 646-647, 657 
activation in, 644—645 
acyl carnitine in, 645-646, 646f 
acyl carrier proteins in, 657-658, 658, 
658t, 659f 
in animals vs. plants, 656 
B-oxidation pathway in, 646-647 
chylomicrons in, 642, 642f 
citrate in, 662, 662f, 666, 667f 
coenzyme A in, 644-645 
coenzyme By» in, 651-652 
cytoplasmic, 657 
degradation in, 640f, 646-657, 666-667 
reactions in, 647f, 647t 
vs. synthesis, 657, 658t 
diet and, 667 
in endoplasmic reticulum, 664 
energy yield from, 641, 647-648 
enoyl CoA hydration in, 646-647, 
647£, 647t 
epinephrine in, 643, 666-667 


ethanol and, 811 
during exercise, 804, 804f, 806 
fatty acid synthase in, 658-659, 660f, 
661f, 663 
fatty acid unsaturation in, 664-665 
glucagon in, 643, 666-667 
in glucose homeostasis, 808 
hormones in, 643, 666—667 
ketone bodies in, 653-656 
lipases in, 641-642, 642f, 643 
lipid mobilization and transport in, 
642-644, 642f, 643f 
lipolysis in, 643-644, 644f 
malonyl CoA in, 657, 658t, 659, 666 
in muscle, 804, 804f 
oxidation in, 642-656, 647f, 647t 
palmitoyl CoA in, 647-648, 666 
pancreatic lipases in, 641-642, 642f 
peroxisomal, 652, 653f 
reactions of, 640-641, 640f 
regulation of, 666-667 
in starvation, 809, 809f, 810 
synthesis in, 640-641, 640f, 643-644, 
643f, 644f, 656-657 
in animals, 659-661 
citrate carriers in, 662, 662f 
condensation in, 658-659, 658t, 659f 
coordinated metabolic pathways in, 
654f, 663, 666f 
dehydration in, 658t, 659, 659f 
elongation in, 658, 658t, 659f, 664-665 
insulin in, 667 
key features of, 657 
NADPH sources for, 662—663 
reactions in, 658-659, 658t, 659f 
reduction in, 658t, 659, 659f 
of unsaturated fatty acids, 664-665, 665f 
vs. degradation, 657, 658t 
thiolysis in, 647-648, 647f, 647t 
triacylglycerol hydrolysis in, 
643-644, 643f 
of unsaturated fatty acids, 648-649, 648f, 
664-665 
Fatty acid synthase, 656-663, 660f 
in animals, 659-661, 660f, 661f 
in cancer, 663 
Fatty acid synthase inhibitors, 663 
Fatty acid thiokinase, 644-645 
Fatty acyl CoA dehydrogenase, 646 
in oxidative phosphorylation, 535 
F, fragments, 981 


FDA (Food and Drug Administration), 1049 


Fe protein, in nitrogen fixation, 707, 707 
Feedback inhibition, 291, 446 
in amino acid synthesis, 723-726 
in cholesterol synthesis, 770 
cumulative, 725 
enzyme multiplicity and, 724, 724f 
evolution of, 724 
in nucleotide synthesis, 751 
Feedforward stimulation, 476 
Fehling’s solution, 325 
FeMo cofactor, 708, 708-709 


Fermentation, 466—468 
to alcohol, 453, 466-467, 467£ 
as anaerobic process, 468-469 
in glycolysis, 453 
to lactic acid, 453, 467-468 
start/end points for, 469t 
Ferredoxin, 575-577, 576 
in nitrogen fixation, 707 
Ferredoxin-NADP* reductase, 576, 
576-577, 577£ 
Ferredoxin—thioredoxin reductase, 599 
Ferritin, 730, 951, 951-952 
Ferrochelatase, 730 
Fetal alcohol syndrome, 812 
Fetal hemoglobin. See also Hemoglobin 
oxygen affinity of, 205, 205f 
Fiber, dietary, 329 
Fibrin clot 
formation of, 308-309, 308f 
lysis of, 312 
Fibrin, structure of, 309, 309f 
Fibrinogen 
conversion to thrombin, 308-309 
structure of, 308, 308f 
sulfation of, 298t 
Fibrinopeptides, 308 
Fibrous proteins, 43-45 
50S subunit, of ribosome, 897-898 
Fight or flight response, 301 
Filters, selectivity, 385-386 
Fingerprint, DNA, 141 
First Law of Thermodynamics, 11. See also 
Thermodynamics 
First-order reactions, rate of, 229-230 
First-pass metabolism, 1035 
Fischer, Emil, 228, 1042 
Fischer projections, 21, 22f, 23f 
5’ cap, 870-871, 871f 
5’ splice sites, 870-871, 871f, 873-874, 873f 
Flagella 
microtubules in, 1019 
rotary movement of, 1022-1025 
structure of, 1022, 1022f, 1023f 
Flagellin, 1022, 1022f 
Flavin adenine dinucleotide. See FAD 
Flavin mononucleotide, 533-534, 534f, 860f 
Flavoproteins, 502 
electron-transferring, 646 
Fleming, Alexander, 1037 
FliG, 1023, 1023f 
FlM, 1023 
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Flip-flop, 362, 362f 
Flippases, 378 
Florey, Howard, 1037 
Fluconazole, 1033 
distribution of, 1003f, 1033f 
Fluid mosaic model, 362 
Fluorescamine, in Edman degradation, 
81, 81 
Fluorescence detection, in DNA 
sequencing, 143-144, 143f 
Fluorescence microscopy, 90-91, 90f 
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(FRAP), 361, 361f 
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f{MET. See Formylmethionine (fMet) 
fMRI. See Functional magnetic resonance 
imaging 
Folding funnel, 54f 
Folding, protein. See Protein(s), folding of 
Folic acid, 441t 
deficiency of, spina bifida and, 755 
Food and Drug Administration 
(FDA), 1049 
Food intake. See also Diet; Starvation 
starved—fed cycle and, 807-808 
Food Pyramid, 20f 
Food, spicy, capsaicin in, 973-974, 973f 
5-Foraminoimidazole-4-carboxamide 
ribonucleotide, 742f 
Forensics, polymerase chain reaction in, 
147, 147£ 
Formamide, molecular models of, 23f 
N°-Formiminotetrahydrofolate, 715, 716 
Formylglycinamide ribonucleotide, 741, 
742, 742, 742£ 
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901-902 
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N 10_Formyltetrahydrofolate, 715-716, 742f 
Fourier transform, 99-100 
Frameshift mutations, tests for, 843 
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photobleaching), 361, 361f 
Free energy, 12, 222-225, 226. See also 
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of activation, 222, 226 
ATP and, 430-431 
binding, 229, 254 
from carbon oxidation, 435-437, 
435-438, 435f 
in concentration gradients, 373-374, 437 
equilibrium constant and, 222-225, 224t 
Gibbs, 12, 226 
in membrane transport, 373-374, 373f 
in metabolism, 429-430 
from proton gradients, 559-560, 560f 
in redox reactions, 530 
reduction potential and, 528-530 
of translation, 893-894 
Free radicals. See Reactive oxygen 
species (ROS) 
Fructofuranose, 323, 323f, 324f 
Fructokinase, 469 
Fructopyranose, 324f 
Fructose, 322, 322f 
in glycolysis, 469-470, 469f, 470f 
metabolism of, 469-470, 470f 
open-chain form of, 323f 
ring form of, 323, 324f 
structure of, 322f 
Fructose 1,6-bisphosphatase, in 
gluconeogenesis, 482 


Fructose 1,6-bisphosphate, 458 
in Calvin cycle, 594, 594f 
in gluconeogenesis, 482, 484 
in glycolysis, 456f, 458, 460 
Fructose 1-phosphate, in glycolysis, 469-470, 
469f, 487 
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in Calvin cycle, 594, 594f, 595f, 596f 
in gluconeogenesis, 482, 484-485, 487 
in glycolysis, 455f, 456f, 458, 458f, 459, 
470, 487 
in pentose phosphate pathway, 602f, 
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in sucrose synthesis, 597, 598 
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Fuels. See Carbon fuels 
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in citric acid cycle, 510 
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resetting of, 407-408 
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Signal transduction 
small, 418, 418t 
structure of, 405, 405 
subunits of, 405 
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Galactokinase, 470 
Galactolipids, in thylakoid membrane, 567 
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disorders of, 471-472 
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Galactose 1-phosphate uridyl transferase, 470 
Galactose 6-phosphate, 470 
Galactose-glucose interconversion pathway, 
470-471 
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in lactose metabolism, 924-925 
Galactoside permease, 924 
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y chain(s), 982, 983f 
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disorders of, 765—766 
structure of, 738, 765f 
synthesis of, 764-765 
GAP. See Glyceraldehyde 3-phosphate 
Gap junctions, 372, 393-394, 393f, 394f 
GAPs (GTPase activating proteins), 418 
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Gastric enzymes, 674, 674f 
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Gastric zymogens, 303t 
GC box, 867, 867f 
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in citric acid cycle, 508 
hydrolysis of, 431 
in olfaction, 959, 960 
in signal transduction, 405, 418, 418f 
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continuous, 131, 132 
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designer, 157 
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Gene chips (microarrays), 162-163, 162f 
Gene disruption, 164-165, 165f 
Gene expression studies, 161-167 
cDNA in, 154-155 
DNA microinjection in, 163, 163f 
expression vectors in, 154-155 
gene disruption, 164-165 
gene-expression analysis in, 161-162, 162f 
recombinant DNA technology in. See 
Recombinant DNA technology 
RNA interference in, 165-166, 167£ 
transgenic animals in, 164, 164f 
viral vectors in, 163-164 
Gene expression/regulation, 109-110, 
123-128 
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definition of, 921 
in eukaryotes, 937-955 
chromatin remodeling in, 944-951, 
949-951 
coactivators in, 947-948, 948f, 
949-951 
DNA methylation and, 945-946 
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949-951, 950f 
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overview of, 937-938 
in pancreas, 939t 
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quorum sensing in, 930-931, 930f 
regulatory sites in, 922-928, 922f 
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signaling in, 929-930 
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translational, 951-953 
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regulated, 921 
transcriptional 
in eukaryotes, 941-944 
in prokaryotes, 921-928 
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Gene guns, 167 
Gene knockout, 164-165, 165f 
Gene therapy, 167-168 
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163-164. See also Gene expression 
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definition of, 128 
degeneracy of, 129, 130, 892 
major features of, 129-130 
of mitochondrial DNA, 131, 131t 
universality of, 131 
variations of, 131 
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Genetic imprinting, 20 
Genetic information, flow of, 109-110 
Genetic mutations. See Mutations 
Genetic switch, 928-931 
Genetic variation, 19-20 
Genome, 157-161, 159f-161f 
analysis of, 159-161 
comparative, 160-161, 160f 
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definition of, 66 
of Escherichia coli, 113, 113f, 928, 937 
of Haemophilus influenzae, 158, 158f, 181 
of Indian muntjak, 113, 113f 
mitochondrial, 527-528, 527f, 528f 
noncoding DNA in, 159-160 
number of genes in, 159 
proteome and, 66 
of puffer fish, 161, 161f 
of Reclinomonas americana, 528 
of Rickettsia prowazekii, 527-528 
of Saccharomyces cerevisiae, 158, 937, 938f 
sequencing of, 17-18, 157-161, 158f 
next-generation methods for, 160 
size of, 113 
Genomic libraries, 151-153, 152f, 153f 
Genomics 
comparative, 160-161, 160f 
in drug development, 1045-1048 
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Gerany] transferase, 769, 769 
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Ghrelin, 796 
Gibbs free energy, 12, 226 
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Gla domain, in prothrombin, 310, 310f 
Gleevec, 421 
Globin(s). See also Hemoglobin; Myoglobin 
evolution of, 187-188, 187£ 
sequence alignment in, 175-181, 
176t-182t 
structure of, 48, 48f, 183, 183-184 
types of, 211-212 
Globin fold, 199 
Globin genes, 211-212 
Globins, 175 
Glomeruli. See also Kidney 
drug metabolism in, 1035 
Glucagon, 488 
in fatty acid metabolism, 643, 666-667, 808 
in glucose homeostasis, 807-808, 808 
in glycogen metabolism, 624—626, 
807-808, 808 
Glucagon-like peptide 1 (GLP1), in caloric 
homeostasis, 795, 795f 
Glucocorticoids. See also Steroid hormones 
synthesis of, 780, 780f, 783 
Glucogenic amino acids, 690. See also 
Amino acid(s) 
Glucokinase, 807 
Gluconeogenesis, 479-491 
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definition of, 453, 479 
in diabetes, 803, 803f 
evolution of, 490-491 
fructose 1,6-bisphosphate in, 480f, 
482, 484 
fructose 6-phosphate in, 480f, 482, 
484-485 
glucose 6-phosphatase in, 480f, 484, 485f 
glucose 6-phosphate in, 480f, 484-485, 
485f, 807 
in glucose homeostasis, 807-810 
glycolysis and, 481-482, 486-491 
historical perspective on, 454 
in kidney, 481 
in liver, 481 
orthophosphate in, 484 
oxaloacetate in, 480f, 481, 482, 483-484 
phosphoenolpyruvate in, 480f, 481, 482, 
483-484 
phosphorylation in, 485 
reactions in, 479-485, 480f, 481-485 
coupled, 485 
regulation of, 486-491, 803, 803f 
sites of, 481 
in starvation, 809 
in starved—fed cycle, 807-810 
stoichiometry of, 485 
urea cycle and, 687 
Glucopyranose, 323, 323f 
boat form of, 324, 325f 
chair form of, 324, 325f 
Glucose 
body stores/requirements for, 479 
as cellular fuel, 455 
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metabolism of, 437f. See also 
Gluconeogenesis; Glycolysis 
carbon fuels for, 808, 809 
in diabetes mellitus, 798-800 
evolution of, 490-491 
historical perspective on, 454 
homeostatic mechanisms in, 798-800. 
See also Glucose homeostasis 
in liver, 806 
in muscle, 805—806 
reactions in, 428f, 454f 
in starvation, 809 
starved—fed cycle in, 807-808 
open-chain form of, 323, 323f 
overview of, 453-454 
properties of, 455 
regulated release of, 616-617. See also 
Glycogen metabolism 
ring form of, 323, 323f 
storage forms of, 328, 328f, 329f, 615, 
629-630 
structure of, 321-322, 321£-323f 
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taste of, 962, 964-965 
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Glucose 1-phosphate, 617 
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deficiency of, 634-635 
in gluconeogenesis, 484—485, 485f, 807 
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reactive oxygen species and, 609 
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antioxidant properties of, 609 
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Glucose 6-phosphate transporter, 635 
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insulin secretion and, 801-802 
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in nitrogen fixation, 709 
in nitrogen transport, 685, 685f 
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in nitrogen fixation, 709 
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structure of, 30, 30f 
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aminotransferase, 741, 751 
Glutamine synthetase, 709-710 
regulation of, 725-726, 725f 
structure of, 725 
Glutathione, 727, 727 
antioxidant properties of, 609 
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727, 727 
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Glutathione—drug conjugates, 1034, 
1034-1035 
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in Calvin cycle, 594 
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460, 460f, 470 
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in pentose phosphate pathway, 602, 602f, 
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Glyceraldehyde 3-phosphate 
dehydrogenase, 481 
in Calvin cycle, 594 
in glycolysis, 461, 462f, 463f, 464 
NAD* binding sites in, 469, 469f 
Glycerol, 644 
in gluconeogenesis, 481 
metabolism of, 437f, 643-644, 644 
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synthesis, 760 
Glycerol 3-phosphate shuttle, 551, 551f 
Glycerol phosphate acyltransferase, 760 
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porphyrins from, 728-730, 729f 
serine formation from, 691 
structure of, 28, 29f 
synthesis of, 714-715, 716 
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pathway, 605, 605f 
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synthesis of, 780, 780f 
Glycoforms, 330 
Glycogen, 328, 328f, 617 
branching in, 615, 616f, 629 
glucose storage in, 328, 328f, 615, 
629-630 
as homopolymer, 328 
storage sites for, 616, 616f 
structure of, 328, 328f, 615, 616f, 629f 
Glycogen metabolism, 615-636 
branching enzyme in, 629 
carbonium ion intermediate in, 619 
degradation in, 616-627 
cAMP in, 626 
initiation of, 624-626 
regulatory cascade in, 624-626, 625f 
termination of, 626—627 
degradative, 630-631 
disorders of, 634-635, 634t 
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evolution of, 627 
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G proteins in, 624-626 
glucagon in, 624-626 
a-1,6-glucosidase in, 620, 620f 
glycogen phosphorylase in, 617—624, 618, 
620f, 633-634, 633f. See also 
Glycogen phosphorylase 
glycogen synthase in, 627-629, 
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glycogen synthase kinase in, 629, 630f, 
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Glycogen metabolism—(continued ) 
hormones in, 624—626, 632-633, 632f 
insulin in, 632-633, 632f 
overview of, 615-617 
phosphoglucomutase in, 620, 643 
phosphoglycerate mutase in, 620 
phosphorolysis in, 617-621 
phosphorylase in, 617-624. See also 

Glycogen phosphorylase 
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631-634, 631f 
pyridoxal phosphate in, 618-619, 619f 
pyrophosphate hydrolysis in, 627-628 
as regulated release of glucose, 616-617 
regulation of, 629-634 
in liver, 621, 633-634, 633f 
in muscle, 631-633 
signaling in, 624-626 
synthesis in, 616-617, 627-635 
transferase in, 620, 620f 
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b, 622-623, 644, 644f 
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regulation of, 621-624 
structure of, 618, 618-619 
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Glycogen synthase, 627-629, 632-633, 633f 

Glycogen synthase kinase, 629, 630f, 631, 

632-633, 632f, 799 

Glycogen transferase, 620, 620f 

Glycogenin, 628 

Glycolases, in DNA repair, 840, 841 

Glycolate, in Calvin cycle, 593-594, 593f 

Glycolate oxidase, 594 

Glycolipids, 350, 350 

Glycolysis, 444, 453-479 
aerobic, 478-479 
aldehyde oxidation in, 461, 462f 
ATP formation in, 463-466 

yield in, 554-555, 555t 
1,3-bisphosphoglycerate in, 484 
in cancer, 478-479 
citric acid cycle and, 499-503, 499f 
committed step in, 474 
definition of, 453 
in diabetes, 803, 803f 
dihydroxyacetone phosphate in, 456f, 

458-459, 458f, 460f, 469-470 

enolase in, 465 
evolution of, 490-491 
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galactose in, 469f, 470 
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glucose 6-phosphate in, 456f, 457-458, 
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glucose transporters in, 477, 477t 

glucose trapping in, 455-457, 456f 
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in, 460, 461, 462f, 463f 
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461, 462f 
hexokinase in, 455-457, 456f 
in liver, 476 
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historical perspective on, 454 
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regulation of, 472-479, 486-491, 
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stages of, 456f 
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inhibition of, 244, 245f 
Glycoproteins, 329-337 
sequencing of, 336-337 
Glycosaminoglycans, 329-330, 331 


Glycosides, 326 
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complexes, 326, 326f 
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Glycosylated hemoglobin, 325-326 
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sites of, 330, 333-334, 337 
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in blood groups, 335 
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744, 751, 751f 
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Greider, Carol, 837 
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Group-specific reagents, 241 
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Phosphoryl group transfer 
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in translation, 902-903, 905f 
in olfaction, 959, 960 
in protein targeting, 913 
in signal transduction, 405-406, 407-408, 
407f, 418, 418f 
in translation, 902-903, 908, 913 
in tubulin, 1019 
in vision, 968, 968f 
GTPase(s) 
in protein targeting, 912 
in signal transduction, 407, 407f 
GTPase activating proteins (GAPs), 418 
GTPases, in signal transduction, 298, 298t 
Guanidinium, 31, 31 
Guanidinium chloride, in bond cleavage, 
49,49 


Guanine, 4, 4, 111, 111 
deamination of, 838 
degradation of, 753f 
Guanine-nucleotide exchange factor 
(GEF), 417 
Guanosine, 111 
Guanosine diphosphate. See GDP 
(guanosine diphosphate) 
Guanosine monophosphate. See GMP 
(guanosine monophosphate) 
Guanosine triphosphate. See GTP 
(guanosine triphosphate) 
Guanylate. See GMP (guanosine 
monophosphate) 
Guanylate cyclase, in vision, 969 
Guanylate kinase, structure of, 284 
Gustation, 962966. See also Taste 
Gustducin, 963, 963f 
Gyrase inhibitors, 831 


H chains. See Heavy (H) chains 
H isozyme, 297, 297f 
H zone, 1016, 101 6f 
Haber, Fritz, 707 
Haemophilus influenzae, genome of, 158, 
158f, 181 
Hair cells, 971-973, 971f, 972£ 
Hairpin turns, 42, 43f 
amino acid residues in, 50—51, 51t 
inmRNA, 42, 43f, 127, 127£ 
in transcription termination, 
859-860, 859f 
Haldane, John, 231 
Half-life 
of drugs, 1036 
of proteins, 675, 676t 
Haptenic determinant, 990 
Haptens, 990 
HAT (histone acetyltransferase), 
949, 950 
Hatch, Marshall Davidson, 599 
Haworth projections, 323 
Hayaishi, Omar, 696 
HDLs (high-density lipoproteins), 773, 
773t, 774-775, 776, 778-779. 
See also Lipoprotein(s) 
Hearing, 971-973 
Heart disease, homocysteine in, 719 
Heart failure, digitalis for, 377 
Heart, H and M isozymes in, 297, 297f 
Heat, from oxidative phosphorylation, 
556-557 
Heat shock protein 70, 183, 183-184 
Heat-shock promoter, 858, 858f 
Heat-shock response element, 868 
Heat-shock transcription factor, 868 
Heavy (H) chains, 981-982, 981f, 
987-989, 988f 
in class switching, 991 
formation of, 991 
Heavy meromyosin, 1009, 1009f 
Heimiacetals, 322 
Heinz bodies, 610, 610f 


Helicases, 120, 824-825, 824f, 825f 
in DNA replication, 120, 824-825, 
825f, 832 
in splicing, 876-877 
structure of, 824, 824-825, 824f, 825f 
Helix 
alpha, 38-40, 39, 40, 40f 
amino acid residues in, 50—51, 51t 
of a-keratin, 43, 43-44, 44£ 
of kinesin, 1012 
of membrane proteins, 356, 358, 
359-360, 360f, 360t 
of myosin, 1009-1010, 1010f, 1012f 
of collagen, 44-45, 44f 
double. See Double helix 
recognition, 923 
relay, 1012, 1012f 
screw sense of, 39, 40f 
Helix-loop-helix, in EF hand, 
410-411, 411£ 
Helix-turn-helix motif, 47, 47f, 
923, 923f 
Helper T cells, 980, 996-998, 998f. 
See also T cell(s) 
in HIV infection, 999-1000 
Hemagglutinin, 339, 340f 
Heme, 45, 196, 196-198 
in cytochrome c oxidase, 538-539 
degradation of, 730, 730f 
labeling of, 728, 728b, 728f 
in Q-cytochrome c oxidoreductase, 
535-536, 536f 
synthesis of, 728-730, 729f 
Hemiacetals, 322 
cyclic, 323 
Hemichannels, 394, 394f 
Hemiketals, 322-323 
Hemoglobin, 195-216 
a chains of, 199, 199f, 211, 211 
a dimers of, 199, 199f, 202, 202f 
B chains of, 199, 199f, 211 
B-globin gene of, transcription and 
processing of, 131-132, 132f 
evolution of, 175-181, 187-188, 187£ 
fetal, oxygen affinity of, 205, 205f 
glycosylated, 325-326 
Hill plot for, 213-214, 214f 
leghemoglobin and, 180, 180f, 181f, 183, 
183f. See also Leghemoglobin 
overview of, 195-196 
oxygen affinity of, 204-205, 204f 
pH and, 206-207, 206f 
oxygen binding by, 196-199 
allosteric effectors for, 206 
concerted (MWC) model of, 202-203, 
203f, 214-216, 216f 
cooperative nature of, 200-201, 
201f, 203f 
quantitative models of, 213-216, 
214f, 216f 
sequential model of, 203, 203f, 216 
structural changes in, 201-202, 202, 
204, 204£ 


C23 


Index 


oxygen release by, 200, 201f, 206-208 
Bohr effect and, 206-208, 207£ 
carbon dioxide in, 207—208, 208f 

in oxygen transport, 200-201, 

207-208, 208f 
oxygen-binding curve for, 199-200, 
200f, 201f 
with concerted model, 214—216, 216f 
for Hill coefficients, 214, 214f 
R state of, 202-204, 203f 
sequence homology with myoglobin, 
175-181, 176f-182f, 199 
structure of, 48, 48f, 199 
oxygen-binding-induced changes in, 
201-202, 202, 204f 
oxygen-binding—induced changes 
in, 204 
quaternary, 201-202, 202 
tertiary, 183, 183f 
T state of, 202-204, 203f, 207-208, 207f 
vs. myoglobin, 195-196 
Hemoglobin A, 199 
Hemoglobin gene, introns in, 131-132, 132f 
Hemoglobin H, 210 
Hemoglobin §, 209, 209-210, 209f 
Hemolytic anemia, G6PD deficiency and, 
609-610, 611 
Hemophilia, 311 
Henderson—Hasselbalch equation, 16-17 
Henseleit, Kurt, 685 
Heparin, 312, 331, 332 
Hepatic gluconeogenesis, 481 
Hepatic glycolysis, 475-477 
Hepatitis, alcoholic, 811 
Hepatotoxicity, of acetaminophen, 
1036, 1037£ 
Heptad repeats, in coiled-coil proteins, 
43, 44f 
Heptoses, 320 
Her2 receptor, 417, 421 
Herbicides, mechanism of action of, 584, 
584, 719-720 
Hereditary nonpolyposis colorectal cancer, 
842-843 
Hers disease, 634t 
Heterolytic cleavage, 651 
Heterotrophs, 566, 590 
Heterotropic effect, 296 
Hexokinase 

in glycogen metabolism, 620 

in glycolysis, 455-457, 456f, 457f 
in liver, 476 
in muscle, 473-474 

induced fit in, 457, 457£ 

Hexose monophosphate pathway. See 
Pentose phosphate pathway 

Hexose monophosphate pool, 594 

Hexose sugars, formation of, in Calvin 
cycle, 594-597, 594f 

Hexoses, 320-321 

High-density lipoproteins, 773, 773t, 
774-775, 776, 778-779. See also 
Lipoprotein(s) 
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High-energy bonds, 433-434 
High-pressure liquid chromatography, 
71, 71£ 
High-protein—low-carbohydrate diets, 798 
High-throughput screening, 1041 
Hill, Archibald, 213 
Hill coefficients, 214 
Hill plot, 213-214, 214 
Histamine, synthesis of, 726, 726f 
Histidine 
in carbon dioxide hydration, 269, 270, 270f 
in catalytic triads, 258-260. See also 
Catalytic triads 
conversion to glutamate, 692, 692f 
distal, 198, 198 
histamine from, 726, 726f 
in oxygen release from hemoglobin, 
207, 207£ 
proximal, 197 
structure of, 31, 31f 
Histidine operon, 932-933, 932f 
Histone acetyltransferase (HAT), 949, 950 
Histone deacetylase, 950 
Histones, 939-941, 940f 
acetylation of, 298t, 949-951, 950f 
amino-terminal tails of, coactivator 
modification of, 949-951, 950f 
in gene regulation, 939-941, 940f, 
949-951, 950f 
sequence homology of, 940-941, 940f 
structure of, 940-941, 940f 
HIV infection. See Human 
immunodeficiency virus infection 
H*-K* pump, 374 
HMG-CoA (3-hydroxy-3-methylglutaryl 
CoA), 653, 653, 654 
in cholesterol synthesis, 767—768, 
770-771 
metabolic fates of, 768f 
regulation of, 770-771 
HMG-CoA reductase, in cholesterol 
synthesis, 767-768 
HMG-CoA reductase inhibitors, 779, 779, 
1040, 1040-1041, 1041 
Hodgkin, Alan, 382 
Hoffmann, Felix, 1039-1040 
Hogness box, 126f, 127 
Holliday junctions, 845-846, 845f 
Holliday, Robin, 845 
Holoenzymes, 221 
Homeostasis, caloric, 791-798. See also 
Caloric homeostasis 
Homocysteine 
in activated methyl cycle, 717-718, 717f 
in amino acid degradation, 693, 693 
in amino acid synthesis, 717—718 
in cysteine synthesis, 719 
synthesis of, 717 
in vascular disease, 719 
Homocysteine methyltransferase, 717 
Homocystinuria, 698t 
Homogenate, centrifugation of, 67, 75t 
Homogentisate, 695, 696 


Homogentisate oxidase, 696 
in alcaptonuria, 697 
Homologous recombination, 970b 
in color blindness, 970 
in gene knockout, 164-165 
Homologs, 174-175, 175f 
Homology, 174-181, 175f 
sequence, 175-181, 176f-182f. See also 
Sequence homologies 
Homolytic cleavage, 651, 651f 
Homopolymers, 328 
Homotropic effects, 294 
Hood, Leroy, 987 
Horizontal gene transfer, 278-279 
Hormone(s). See also specific hormones 
in citric acid cycle regulation, 514 
eicosanoid, 664—665, 665, 665f 
in fatty acid metabolism, 643-644, 
666-667 
in gene expression, 946-949 
in gluconeogenesis, 488 
in glucose metabolism, 488 
in glycogen metabolism, 624-626, 
632-633, 632f 
local, 665 
metabolic functions of, 446—447 
sex 
regulation of, 946 
synthesis of, 780, 780f 
in signal transduction, 405-406. See also 
Signal transduction 
steroid, 780-782 
anabolic, 785, 948 
cardiotonic, 377, 377f 
hydroxylation of, 781-782 
structure of, 781, 781 
synthesis of, 780, 780f, 783 
Hormone receptor(s), 946-948, 947f-949f, 
948f, 949f 
DNA-binding domains of, 946, 947f 
drug binding to, 948-949, 949f 
ligand-binding domains of, 946-947, 947f 
nuclear localization of. See Nuclear 
hormone receptors 
Hormone-receptor complexes, in signal 
transduction, 405-406 
Human immunodeficiency virus, 
999-1000, 999f 
Human immunodeficiency virus infection 
aspartyl protease in, 264, 264f 
helper T cells in, 996-998 
immune system in, 999-1000, 999f 
protease inhibitors for, 264-265, 265, 266f 
development of, 1043, 1043f, 1044f 
resistance to, 1050 
vaccine for, 1003 
Humoral immune response, 979-980 
Huntingtin, 842 
Huntington disease, 55-56, 842 
Hurler disease, 332, 332f 
Huxley, Andrew, 382 
Hyaluronate, 331, 332 
Hybridization, DNA, 121 


Hybridoma cells, in monoclonal antibody 
production, 87-88, 87f 
Hydrogen bonds, 5, 8, 9, 9f, 11, 38-40. See 
also Bonds 
in a helix, 38-40, 39f 
in antigen—antibody binding, 984, 985, 
985f, 986 
in B sheet, 40-42, 40f, 41f 
in double helix, 5, 5f, 10, 114, 118f 
in enzyme-substrate complex, 228, 228f 
in water, 9-10, 9f 
Hydrogen ion(s). See also Acid(s) 
in aqueous solutions, 13-17 
oxygen affinity of hemoglobin and, 
206-207, 206f 
pH and, 13-14. See also pH 
sour taste of, 962, 962f, 965-966 
Hydrogen ion channels, in taste, 965-966 
Hydrogen-potassium ion pump, 374 
Hydrolases, 249t. See also Enzyme(s) 
Hydrolysis 
in metabolism, 444-445 
of peptide bonds. See Peptide bonds, 
cleavage of 
Hydronium ions, 13 
Hydropathy plot, 360, 361f 
Hydrophilic side chains, 30, 30f. See also 
Amino acid side chains 
protein folding and, 46, 46f, 47 
Hydrophobic effect, 9-10, 9f, 28 
Hydrophobic interactions, 10 
Hydrophobic side chains, 28, 29f. See also 
Amino acid side chains 
protein folding and, 45-46, 46f, 47 
Hydroxide ions, pH and, 13-14 
3-Hydroxy-3-methylglutaryl CoA. 
See HMG-CoA 
(3-hydroxymethylglutaryl CoA) 
L-3-Hydroxyacyl CoA dehydrogenase, 647 
3-Hydroxyanthranilate, 696, 696 
D-3-Hydroxybutyrate, 653, 653-655. See 
also Ketone bodies 
D-3-Hydroxybutyrate ACP, 658t, 
659, 659 
Hydroxyethyl-TPP 
in amino acid synthesis, 723-724, 723f 
in citric acid cycle, 500, 500 
Hydroxyl groups, 30, 30f, 44, 57-58 
Hydroxylation 
cytochrome P450 in, 782-783, 782f 
protein, 30, 30f, 57-58 
steroid, 781—782 
p-Hydroxymercuribenzoate, 291, 291, 293 
Hydroxyphenylpyruvate, 696 
in amino acid degradation, 695, 696f 
in amino acid synthesis, 720, 721f 
Hydroxyphenylpyruvate hydroxylase, 696 
17a-Hydroxyprogesterone, 784, 784 
Hydroxyproline, 44, 57 
synthesis of, 812f, 813 
Hyperbaric oxygen therapy, 206 
Hypercholesterolemia, 759, 774-775 
familial, 759, 776-779 


Hyperlysinemia, 698t 
Hypersensitive sites, 944 
Hyperuricemia, 753-754 
Hypervariable loops, 984, 984f 
Hypochromism, 120, 120f 
Hypomethylation, 945 
Hypoxanthine, 838, 838 
degradation of, 752, 753f 
Hypoxanthine-guanine 
phosphoribosyltransferase 
(HGPRT), 744 
deficiency of, 754 
Hypoxia inducible factor 1 (HIF-1), in 
cancer, 515 
Hypoxia, tumor, 479 
Hypoxia-inducible transcription factor 
(HIF-1), 479, 479f, 479t 


I band, 1016, 1016f 
Ibuprofen, 358 
I-cell disease, 336 
Ig. See Immunoglobulin(s) 
Ig-a-Ig-B, 989, 989f 
Imatinib mesylate, 421 
Imidazole, 31, 31 
4-Imidazolone 5-propionate, 692, 692 
Immune response, 977-1004. See also 
Antibodies; Antigen(s) 
antigen binding in, 984-986 
in autoimmune diseases, 1001 
B cells in, 989-990. See also B cell 
in cancer, 1001-1002 
cellular, 980 
class switching in, 990-991, 991f 
evolution of, 978-981 
humoral, 979-980 
overview of, 977-978 
selectins in, 339, 339f 
to self-antigens, 1001 
self-tolerance in, 1001 
signaling in, 989-990, 998 
T cells in, 980-981. See also T cell(s) 
Immune-modulating drugs, 990 
Immunity 
adaptive, 977, 978, 979-981 
cell-mediated, 961 
innate, 978-979 
Immunization. See Vaccine(s) 
Immunodeficiency, gene therapy for, 
167-168 
Immunoelectron microscopy, 91, 91f 
Immunogens, 980 
Immunoglobulin(s). See also Antibodies 
cleavage of, 982, 982f 
constant regions of, 984, 984f 
definition of, 83 
heavy chains of, 981—982, 981f, 
987-989, 988f 
in class switching, 991 
formation of, 991 
light chains of, 981-982, 981f, 987-989, 
987£, I88f 
in class switching, 991 


properties of, 982t 
segmental flexibility of, 982, 982f 
sequence diversity in, 983f, 984 
variable regions of, 983, 984f 
Immunoglobulin a, 989 
Immunoglobulin A (IgA), 982t, 983 
Immunoglobulin B, 989 
Immunoglobulin D (IgD), 982t, 983, 983f 
Immunoglobulin domains, 981-982 
Immunoglobulin E (IgE), 982t, 983, 983f 
Immunoglobulin fold, 984, 984f 
Immunoglobulin G (IgG), 981f-983f, 982, 
982t, 983f 
Immunoglobulin M (IgM), 982t, 983, 983f 
B cells and, 989, 989f 
Immunologic techniques 
antibody production for, 86-88, 87f 
hybridoma cells in, 87-88, 87f 
monoclonal antibodies in, 86-88, 
86f£-88f 
polyclonal antibodies in, 86, 86f 
for protein study, 84-91 
Immunological memory, 1002 
Immunoreceptor tyrosine-based activation 
motif (ITAM), 989 
Immunosuppressants, 990 
Immunotoxins, genetically engineered, 157 
IMP. See Inosine monophosphate (IMP) 
Inactivated vaccines, 1002 
Inborn errors of metabolism, 697-698, 698t 
Indian muntjac, genome of, 113, 113f 
Indinavir, 264-265, 265, 266f 
development of, 1043, 1043f 
Indirect ELISA, 88, 89f 
Indole, substrate channeling and, 722-723 
Indole-3-glycerol phosphate, 721, 722 
Induced pluripotent stem cells, 944 
Induced-fit model, of enzyme-substrate 
complex, 228, 228f, 254, 457, 457f 
Infant respiratory distress syndrome, 
765-766 
Influenza virus, 339-340 
Information science, 17 
Inhibition constant (K;), 1031 
Initiation factors, 902 
Initiator tRNA, 907 
In-line displacement, 272-274 
Innate immunity, 977. See also Immune 
response 
Inosinate. See Inosine monophosphate (IMP) 
Inosine 
in anticodons, 891 
degradation of, 752, 753f 
Inosine monophosphate (IMP), 742f, 743 
AMP from, 743-744, 743f 
GMP from, 743-744, 743f 
synthesis of, 743, 743f, 744, 751 
Inositol 1,4,5-trisphosphate (IP3), 403, 
408-409, 409f 
Inositol, in phospholipids, 349 
Insertional inactivation, 1494 
Insertions, production of, 157 
Insig, 771, 772f 
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Insulin 
amino acid sequence of, 35, 35f 
in appetite regulation, 796, 797 
biosynthesis of, 303 
in diabetes, 798-803. See also Diabetes 
mellitus 
exercise and, 803 
in fatty acid synthesis, 667 
in glucose metabolism, 488, 798-800 
glucose-stimulated secretion of, 801-802 
in glycogen metabolism, 632-633, 632f 
mass spectrometry of, 91-95, 93f 
in obesity, 798, 800-802 
regulation of, 798-800 
secretion of, 799, 799f, 801-802 
in signal transduction, 411-415, 
412f-415f, 798-800, 799f 
structure of, 35, 35f, 412, 412 
Insulin receptor, 412-415, 412f-415f, 
799, 799£ 
Insulin receptor substrates, 412-415, 414f, 
799, 800 
Insulin receptor tyrosine kinase, 412-415 
Insulin resistance, 798-802, 799-802 
Insulin-dependent diabetes mellitus, 
802-803. See also Diabetes mellitus 
Integral membrane proteins, 355, 355f 
Intercalation, 862 
Intermediate-density lipoproteins, 773, 
773t, 774. See also Lipoprotein(s) 
Internal aldimine, 682, 682 
in amino acid degradation, 682, 682f 
in amino acid synthesis, 712, 712f 
Internal guide sequence, 881, 881f 
Internal repeats 
in amino acid sequences, 79 
sequence homology of, 184, 185f 
Internet databases, of amino acid sequences, 
65-66, 181, 182f 
Intestinal microvilli, 328f 
Intestines, protein digestion and absorption 
in, 674, 674£ 
Intrinsic pathway, coagulation, 308, 308f 
Intrinsically unstructured proteins, 54-55 
Introns, 110, 131-132, 132f, 869 
evolution of, 133 
group I, 880 
removal of, 132-133, 132f. See also 
Splicing 
Inverted repeats, as recognition sites, 275 
Ion channels, 372, 382-393 
acetylcholine receptor as, 389-390, 
390f, 391f 
action potentials and, 382, 382f, 
391-392, 391f 
calcium, 367, 384f, 386 
cell-to-cell, 393-394, 393f, 394f 
definition of, 382 
equilibrium potential and, 391-392, 391f 
gap junction, 372, 393-394, 393f, 394f 
in hearing, 972-973 
inactivation of, 388-389, 389f, 392 
ligand-gated, 389-390, 391f 
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Ion channels—(continued ) 
nerve impulses and, 382 
opening and closing of, 389-390, 
391f, 392 
patch-clamp studies of, 383, 383f 
potassium, 364-393, 382. See also 
Potassium ion channels 
rate of transport in, 387 
responsive to mechanical impulses, 
972-973 
selectivity filter in, 385-386, 386f, 387f 
sequence homologies of, 384, 384f, 386 
sodium, 382, 383-384, 384f, 387-392. 
See also Sodium ion channels 
specificity of, 385-387 
in taste, 965-966, 965f 
TRP, 973 
voltage-gated, 387, 388f 
Ion, dipolar, 27—28, 27f 
Ion gradients, 373-374 
in ATP synthesis, 437-438, 437f 
Ion pumps, 371-372, 374-381 
Ca?*, 374-377, 375f, 376f 
evolution of, 374, 378-379 
H*_K*, 374 
Na*-K*, 374, 377 
Ion-exchange chromatography, 69-70, 70f, 
75, 75t 
in amino acid identification, 
80-82, 80f 
IP3 (inositol 1,4,5-trisphosphate), 408, 
408-409, 409f 
IPGT (isopropylthiogalactoside), 
926, 926 
IRE (iron-response element), 
951-952, 952f 
IRE-binding protein, 951-952 
Tron 
in heme, 730. See also Heme 
oxygen binding to, 196-198, 197f 
metabolism of, 951-953 
Iron center, of ribonucleotide 
reductase, 746 
Iron protein, in nitrogen fixation, 707, 
708-709 
Iron-molybdenum cofactor, 708, 
708-709 
Iron-response element (IRE), 951-952 
Iron-response element-binding protein 
(IRP), 951-952, 953f 
Iron-sulfur proteins, 506 
in nitrogen fixation, 708-709 
in oxidative phosphorylation, 534, 536 
IRP (iron-response element-binding 
protein), 951-952, 953f 
Isocitrate dehydrogenase, in citric acid cycle, 
506, 514, 514£ 
Isocitrate, in citric acid cycle, 
506-507, 507£ 
Isocitrate lyase, in glyoxylate cycle, 
518-519, 519f 
Isoelectric focusing, 73, 73f 
with SDS-PAGE, 74, 74 


Isoelectric point, 73 
Isoenzymes, 289-290, 296-297, 297f 
Isoleucine 

degradation of, 694-695 

in maple syrup urine disease, 697 

structure of, 28, 29f 

synthesis of, 723-724 
Isomerases, 249t. See also Enzyme(s) 
Isomerization, in metabolism, 443t, 444 
Isomers, 27 

carbohydrate, 321 

constitutional, 321, 321f 
3-Isopentenyl pyrophosphate, 742f, 

768, 768 
Isopeptide bonds, 675, 675f 
Isopropylthiogalactoside (IPTG), 926, 926 
Isovaleryl CoA, 694, 694 
Isozymes, 289-290, 296-297, 297f 
ITAM (immunoreceptor tyrosine-based 
activation motif), 989 


J genes, 987-988 

in antibody switching, 991 
Jacob, Francois, 924 
Jagendorf, André, 577 
Jenner, Edward, 1002 
Joining (J) genes, 987-988 

in class switching, 991 
Joule (J), 7, 223b 


K* ions. See Potassium 

k chains, 982, 983f, 987, 989 

Reat (turnover number), 234, 234t 

Reat/ Kyy ratio, 234-235 

Kg (dissociation constant), 1030-1031 
for enzyme-substrate complex, 233-234 
for ligand binding, 1030-1031, 1030f 

Kendrew, John, 196 

Kennedy, Eugene, 526, 644 

Keratan sulfate, 331, 332 

a-Keratin, a helices of, 43-44, 44, 44£ 

Ketimine, 682, 682, 712 

Ketoacidosis, in maple syrup urine 

disease, 697 

3-Ketoacyl CoA, 647 

a-Ketobutyrate, 719 
in amino acid degradation, 693, 693 
in amino acid synthesis, 719, 723 

Ketogenic amino acids, 690. See also 

Amino acid(s) 

Ketogenic diet, 656 

a-Ketoglutarate, 692 
in amino acid degradation, 692 
in amino acid synthesis, 709-710, 712 
in citric acid cycle, 507, 507f, 692 
in double-displacement reactions, 237 
in glutamate synthesis, 709, 710 
in 4-hydroxyproline synthesis, 812f, 813 
in nitrogen fixation, 709 

a-Ketoglutarate dehydrogenase complex 
in citric acid complex, 499-500, 507 
in citric acid cycle, 507, 514-515, 514f 

a-Ketoisocaproate, 693-694, 694 


Ketone bodies, 653-656 
acetyl CoA formation from, 633f, 
653-655 
in citric acid cycle, 810, 810f 
in diabetes, 633f, 653, 655, 803 
in ethanol metabolism, 811 
as fuel source, 654-655, 654£ 
in starvation, 809f, 810 
synthesis of, 810, 810f 
Ketose, 320, 320f 
in Calvin cycle, 594 
in transaldolase reaction, 605, 606f 
in transketolase reaction, 605, 605f 
Ketosis, diabetic, 633f, 653, 655 
3-Ketosphinganine, 764 
in sphingolipid synthesis, 737 
B-Ketothiolase, 647, 647t 
Keyhole limpet hemocyanin, 990 
Khorana, Har Gobind, 128 
K; (inhibition constant), 1031 
Kidney 
drug metabolism in, 1035 
gluconeogenesis in, 481 
Killed vaccines, 1002 
Killer T cells, 980, 994-995, 997-998, 997f. 
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in ATP hydrolysis, 280 
in carbon dioxide hydrolysis, 221, 221t, 
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in DNA cleavage, 274, 277 
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prostaglandin Ho, 356, 358f 
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transporter. See Membrane transport; 
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active, 371 
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concentration gradient in, 373-374 

free energy in, 373-374, 373f 
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ion channels in, 372, 382-393. See also 
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membrane potentials in, 373, 382, 
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evolution of, 447 
salvage, 740 
definition of, 740 
in nucleotide synthesis, 736, 736f, 
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Metabolism, 427—447 
activated carriers in, 438-442 
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bioenergetics of, 429-435 
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electron carriers in, 438-439 
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electron carriers in, 438-439, 440t 
energy charge in, 446-447, 446f 
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first-pass, 1035 
free energy in, 429-430 
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Methionine sulfoxide, 307, 307f 
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Methyl cycle, 717-718, 717f 
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in amino acid synthesis, 716-718, 
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Methylcobalamin, 717 
Methylcrotonyl CoA, 694, 694 
5-Methylcytidine (mC), 890 
5-Methylcytosine, 945, 945-946 
N°,N'°-Methylenetetrahydrofolate, 
715, 716 
Methylglutaconyl CoA, 694, 694 
Methylmalonyl CoA, 649, 650, 693 
Methylmalonyl CoA mutase, 650-652, 
652, 693 
N oN 10_Methyltetrahydrofolate, 749£, 750 
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Mice, transgenic, 164, 164f 
Micelle, 352, 352f 
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calculation of, 233 
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Michaelis-Menten equation, 229-237, 231f 
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Michaelis-Menten kinetics, 229-237, 231f 
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fatty acid metabolism in, 653f 
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immunoelectron, 91, 91f 
Microsomal ethanol-oxidizing system 
(MEOS), 811 
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Mitchison, Tim, 1019 
Mitochondria 
in apoptosis, 559 
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Mitochondrial diseases, 558-559 
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Mucoproteins, 330, 333, 333f 
Muller, Hermann, 839 
Mullis, Kary, 145 
Multidrug resistance, 378 
Multidrug-resistance (MDR) protein, 
378, 378£ 
Multienzyme complexes, 660-661 
substrate channeling by, 722-723, 722f 
Multiple myeloma, 87-88 
Muscle 
amino acid degradation in, 684 
cardiac, H and M isozymes in, 297, 297f 
drug-related injury of, 1040 
energy storage in, 641, 805t, 806 
glycolysis in, 473-474, 475f, 489-490 
metabolism in, 804—806, 804f 
interchanges with liver, 490, 490f, 
684-685 
in starvation, 809-810 
triacylglycerols in, 641 
Muscle contraction. See also Exercise 
actin in, 1012-1018. See also Actin/actin 
filaments 
ATP formation during, 434-435, 435f, 
489-490, 805-806, 805t 
ATP hydrolysis in, 1009-1012, 1011f, 
1012f, 1015, 1016f 
creatine phosphate and, 434—435, 435f 
gluconeogenesis in, 489-490, 490f 
lactate formation during, 435f 
myosin in, 1012-1018. See also Myosin 
pyruvate in, 489-490, 490f 
sliding filament model of, 
1016-1017, 1017f 
Mushroom poisoning, a-amanitin in, 865, 
865-866 
Mutagenesis 
in recombinant DNA technology, 
156-157, 156f, 157f 
site-directed, in catalysis investigation, 
262-263 
Mutagens, 838 
Mutases, 464 


Mutations 


base-pair substitution, tests for, 843-844 
deletion, production of, 156 
disease-causing, 157. See also Diseases 
and disorders 
functional effects of, 157 
identification of, 147-148 
frameshift, tests for, 843-844 
insertion, production of, 157 
point, production of, 157 
production of, 156-157, 156f, 157f 
in recombinant DNA technology, 156-157 
somatic, antibody diversity and, 989 
substitution, production of, 156, 156f 
MWC (concerted) model 
of allosteric enzyme kinetics, 296, 296f 
of hemoglobin oxygen binding, 202-203, 
203f, 214-216, 216f 
myc gene, in cancer, 869 
Mycobacterium tuberculosis, 679 
Myelin, protein content of, 355 
Myeloma, 87-88 
Myofibrils, 1016 
sarcomere of, 1016-1017, 1017f 
Myogenin, 165, 165f 
Myoglobin 
electron-density map of, 100f 
evolution of, 187-188, 187£ 
forms of, 196 
Hill plot for, 213-214, 214f 
overview of, 195-196 
oxygen binding by, 196-198, 
198£-200f, 200 
oxygen-binding curve for, 200-201, 
200f, 201f 
sequence homology with hemoglobin, 
175-181, 176f-182f, 199 
structure of, 46, 46—48, 46f, 196, 196-197 
tertiary, 183, 183f 
vs. hemoglobin, 195-196 
Myopathy, drug-related, 1040 
Myosin, 279-284, 1007-1018 
ATP binding to, 1010-1012, 1011f, 1012f 
ATP hydrolysis and, 279-283, 
281£-283f 
ATPase domains of, 279-280, 280f 
functions of, 279 
kinesin and, 1008. See also Kinesin 
lever arm of, 1011-1012, 1011f, 
1018, 1018f 
movement along actin, 1012-1018. See 
also Actin/actin filaments 
direct observation of, 1014-1015 
power stroke in, 1015 
rate of, 1018, 1018f 
P-loop of, 283-284 
relay helix of, 1012, 1012f 
S1, 1009, 1009f, 1011f 
$2, 1009, 1009£ 
structure of, 279, 279f, 1008-1010, 1009f, 
1010f, 1011f 
conformational changes in, 


280-283, 282f 


in thick filaments, 1016, 1017£ 

in thin filaments, 1016, 1017f 
Myosin-actin complex, 1014, 1014f 
Myristoylation, 298t 


Na’. See under Sodium 

NAD?* (nicotinamide adenine dinucleotide) 
in catabolism, 438, 438f 
in citric acid cycle, 501-502, 512 


dehydrogenase binding sites for, 469, 469f 


in fatty acid metabolism, 646, 647, 647f£ 
in glycolysis, 469 


NADH (nicotinamide adenine dinucleotide 


reduced) 
ADP units in, 447, 447£ 
in ATP synthesis, 439, 589 
in citric acid cycle, 498, 498f, 502, 503, 
512, 514f 
ethanol and, 811 
in fatty acid metabolism, 646-647, 
647£, 664 
in gluconeogenesis, 483-484 
in glycolysis, 466-468 
in oxidative phosphorylation, 525 
electron-transfer potential of, 530-531 
reduction potential of, 528-530 
transport of, 551-552, 551f, 552f 
vs. NADPH, 589-590 


NADH dehydrogenase, in respiratory chain, 


531-535, 532f 


NADH-cytochrome bs reductase, 664, 664f 
NADHL-Q oxidoreductase (Complex I), in 


respiratory chain, 531-535, 532f 


NADP? (nicotinamide adenine dinucleotide 


phosphate), 438, 439 
as electron carrier, 439 
in pentose phosphate pathway, 601, 602f, 
606-607 
in photosynthesis, 576-577 
in reductive biosynthesis, 439 
NADPH (nicotinamide adenine 
dinucleotide phosphate reduced), 
438, 439 
in Calvin cycle, 589-590, 590f, 596f, 599 
as electron carrier, 439 
pathways requiring, 601t 
in pentose phosphate pathway, 589-590, 
601-603, 604t, 607-609 
in photosynthesis, 566-567, 576-577, 
579-581, 589-590, 599 
in reductive biosynthesis, 439, 589-590 
in steroid hydroxylation, 781-782 
Nalidixic acid, 831, 831 
Nasal anatomy, in olfaction, 958, 959f. See 
also Olfaction 
Nathans, Daniel, 141 
Neanderthals, DNA sequencing for, 
188-189, 189f 
Neck linker, 1012, 1012f 
Neher, Erwin, 383 
Nernst equation, 391 
Nerve gases, 241 
Nerve impulse, propagation of, 382, 382f 
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N 
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euberg, Carl, 454 

eural-tube defects, 755 

euraminidase, 339, 340f 

euroglobin, 212 

eurological diseases. See Diseases and 
disorders 

europeptide Y, 796 


Neurotransmitters 


definition of, 389 
ion channels and, 389-390 
release of, 366f 


Neutral fat. See Triacylglycerol(s) 
Niacin, 441, 441t 

Nicolson, Garth, 362 
Nicotinamide adenine dinucleotide. 


See NADt 


Nicotinamide adenine dinucleotide 


phosphate. See NADP*; NADPH 


Nicotinamide, synthesis of, 726f, 727 
Nicotinic acid, 441t 


+ 2 array, 1019 
9S regulatory unit, 667f, 677 


Ninhydrin, in Edman degradation, 81, 81 
Nirenberg, Marshall, 128 
Nitric oxide 


in signaling, 728 
synthesis of, 727-728, 728f 


Nitric oxide synthase, 728 
Nitrogen excretion 


ammoniotelic organisms in, 690 
glucose—alanine cycle in, 684—685, 685f 
transamination in, 680—685 

transport to liver in, 684-685 

urea cycle in, 673, 685-690 

ureotelic organisms in, 685, 690 


Nitrogen fixation, 706-710, 707f 
Nitrogen, in ammonia, 706. See also 


Ammonia 


Nitrogenase, 707 


iron—molybdenum cofactor of, in nitrogen 
fixation, 708-709 


Nitrogenase complex, 707 
Nitrogen-starvation promoter, 858, 858f 
N-linked oligosaccharides, 330, 330f, 336 
NMP kinases. See Nucleoside 


monophosphate (NMP) kinases 


Nociceptors, 973 
N 


OESY (nuclear Overhauser enhancement 
spectroscopy), 101-104, 102f, 103f 


NompC, 972 
Noncompetitive inhibition, 238, 238f, 


239-240, 239f, 240f 


Noncovalent bonds, 7—8. See also Bonds 


electrostatic interactions, 7—8 

in enzyme-substrate complex, 228, 228f 

hydrogen bonds, 8, 9, 9f. See also 
Hydrogen bonds 

hydrophobic effect and, 9-10, 9f 

surface complementarity and, 11 

van der Waals interactions, 8, 8f, 10, 10f, 
11, 46 


Nonessential amino acids, 711, 711t, 712f 


synthesis of, 712-713 
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Nonheme iron proteins. See Iron-sulfur 
proteins 
Nonhomologous end joining, 841 
Nonpolar molecules, 9 
Nonreducing sugars, 325 
Nonshivering thermogenesis, 556-557 
Northern blotting, 142 
Nose. See also Olfaction 
anatomy of, 958, 959f 
electronic, 961—962, 962f 
Novobiocin, 831 
N-terminal rule, 676 
NTP. See Nucleoside triphosphates (NTPs) 
NTPase domains, P-loop, 284, 284f 
NTPases, P-loop. See P-loop NTPases 
Nuclear envelope, 365 
Nuclear hormone receptors, 946-948, 
947f-949F, 948f, 949f, 950 
DNA-binding domains of, 946, 947f 
drug binding to, 948-949, 949f 
ligand-binding domains of, 
946-947, 947f 
Nuclear localization signal, 79 
Nuclear magnetic resonance (NMR) 
spectroscopy, 101-104, 101f-103f 
Nuclear Overhauser enhancement 
spectroscopy (NOESY), 101-104, 
102f, 103f 
Nuclear pores, 365 
Nucleation-condensation model, 53 
Nucleic acids, 109. See also DNA; RNA 
abbreviations for, 112, 112f 
evolution of, molecular studies of, 
188-191, 190f 
sequence of, notation for, 112, 112f 
Nucleoside(s), 111, 111 
nomenclature of, 736t 
Nucleoside diphosphate (NDP), 739 
phosphorylation of, 431 
Nucleoside diphosphate kinases, 431, 739 
Nucleoside diphosphokinase, 629 
in citric acid cycle, 508 
Nucleoside monophosphate, 739 
phosphorylation of, 431 
Nucleoside monophosphate kinases, 
248-249, 431, 739 
P-loop of. See P-loop(s) 
structure of, 284 
Nucleoside phosphorylases, 752 
Nucleoside triphosphate-magnesium 
complex, 280 
Nucleoside triphosphates (NTPs), 739. 
See also P-loop NTPases 
hydrolysis of, energy from, 431 
Nucleosome core particle, 939 
Nucleosomes, 939-941, 940f 
Nucleotidases, 752 
Nucleotide(s), 110, 110, 111-112. 
See also Bases/base pairs; 
Deoxyribonucleotide synthesis; Purine 
nucleotides; Pyrimidine nucleotides 
definition of, 735 
disorders of, 752—755 


functions of, 735 
nomenclature of, 736t 
synthesis of, 735-755 
de novo pathways in, 736-740, 736f 
deoxyribonucleotide, 745-750, 752, 752f 
disorders of, 753-755 
inosinate in, 743-744, 743f 
purine, 740-745, 751, 751f 
pyrimidine, 736-740, 736f, 751 
regulation of, 750-752 
salvage pathways in, 736, 736f, 744-745 
substrate channeling in, 738 
transformation to amino acids. See 
Translation 
Nucleotide-excision repair, 840-841, 841f 
Nucleotides, nomenclature for, 112 
NusA, in transcription termination, 861 
Nutrition, 20, 20f. See also Diet; Food; 
Starvation 
starved-fed cycle and, 807-808 


Obesity, 792-803 
causes of, 792—794 
definition of, 792 
diabetes and, 798-803 
dieting and, 797-798 
evolution and, 792-793 
fatty acids in muscle and, 800-801 
health consequences of, 793t 
insulin resistance and, 798-800, 800 
metabolic syndrome and, 800 
pancreatic function in, 801-802 
prevalence of, 792-794 
Obligate anaerobes, 468-469 
pathogenic, 468-469, 469t 
Odorant receptors, 958-960, 960f, 961f 
Okazaki fragments, 823-824, 823f, 833 
Okazaki, Reiji, 823 
Oleate, 347, 347, 664, 664 
Olfaction, 958-962, 964, 964f 
anatomic structures in, 958, 959f 
combinatorial mechanisms in, 960-962 
by electronic nose, 961-962, 962f 
evolution of, 959-960, 959f 
impaired, 958 
odorant receptors in, 958-960, 960f, 961f 
7TM receptors in, 959 
signal transduction in, 959-960, 960f 
Oligomerization, in B-cell activation, 
989-990 
Oligonucleotide-directed mutagenesis, 
156, 156f 
Oligopeptides, 34. See also Peptide(s) 
Oligosaccharides, 327. See also 
Disaccharides; Polysaccharides 
abbreviations for, 330b 
in gangliosides, 764, 765f 
in glycoproteins, 329-337 
N-linked, 330, 330f, 336 
O-linked, 330, 330f, 336 
sequencing of, 336-337 
O-linked oligosaccharides, 330, 330f, 336 
QO, loops, 42, 43f 


Omeprazole, 1046, 1046 
Oncogenes, 420 
One-carbon units 
in amino acid degradation, 716 
in amino acid synthesis, 715-716, 
715t, 716£ 
Operator DNA, 922f 
Operon(s), 924-928 
histidine, 932-933, 932f 
lac, 925-928, 925f, 926f, 927. See also 
under lac 
phenylalanine, 932-933, 932f 
structure of, 924-925, 925f 
threonine, 932-933, 932f 
trp, 931 
Opsin, 967 
Optical trap, 1015, 1015f 
Oral bioavailability, 1032-1033, 1032f 
oriC locus, 834, 834f 
Origin of replication 
in bacteria, 834, 834f 
in eukaryotes, 835-836 
Origin of replication complexes, 835 
Ornithine, 686, 686 
Ornithine transcarbamoylase, 686 
deficiency of, 689 
Orotate, 738, 738-739 
Orotidylate, 739, 739 
Orotidylate decarboxylase, 739 
Orthologs, 174-175, 175f 
Orthophosphate 
in gluconeogenesis, 483 
resonance structures of, 433, 433f 
Osteoarthritis, 332 
Osteogenesis imperfecta, 45 
Osteomalacia, 786 
Oster, George, 548 
Overlap peptides, 82, 83f 
Overweight, 792, 793f 
Oxaloacetate, 237, 443 
in amino acid degradation, 692 
in amino acid synthesis, 712 
in Calvin cycle, 599, 600f 
in citric acid cycle, 498, 498f, 504, 505f, 
509-510, 515, 516f, 653 
in gluconeogenesis, 482, 483-484, 484f 
in glyoxylate cycle, 518-519, 519f 
synthesis of, 443-444 
transfer from mitochondria to cytoplasm, 
662-663 
in urea cycle, 687, 687f 
Oxidation, of xenobiotic compounds, 1035 
Oxidation—reduction potential, 
528-530, 529t 
measurement of, 528f, 529-530 
in oxidative phosphorylation, 528-531 
Oxidation—reduction reactions 
free energy change in, 530 
in metabolism, 443, 443t 
reduction potential of, 529-530, 529t 
Oxidative damage 
defenses against, 540-542 
in DNA, 838 


Oxidative phosphorylation, 437, 525-561. 
See also ATP, formation of 
ATP in, 525-526 
ATP yield in, 554-555, 555t 
ATP-ADP translocase in, 552-553, 
553f, 554f 
cellular respiration in, 526 
chemiosmotic model of, 544—545, 
544f, 545f 
in citric acid cycle, 498 
electron transfer in, 528-531. See also 
Respiratory chain 
rate of, 542-543, 542f 
during exercise, 806 
glycerol 3-phosphate shuttle in, 
551, 551f 
heat from, 556—557 
inhibition of, 558, 558f 
malate—aspartate shuttle in, 552, 552f 
NADH in, 525 
electron-transfer potential of, 
530-531 
reduction potential of, 528-530 
transport of, 551-552, 551f, 552f 
overview of, 525-526, 526f, 551f 
oxidative damage and, 540-542, 
541t, 542f 
proton gradients in, 530-531, 543-550, 
559-560, 560f 
proton pumps in, 525 
proton-motive force in, 525, 544-545, 
544f, 545f 
rate of, 555-556, 556f 
reduction potential in, 528-531 
respiratory chain in, 525, 531-543. See 
also Respiratory Chain 
in thermogenesis, 556-557 
transporters in, 551-553, 551f-553f 
uncoupling of, 556-557, 558 
vs. photosynthesis, 578, 579f 
Oxidoreductases, 249t 
Oxidosqualene cyclase, 769 
8-Oxoguanine, mutagenicity of, 838, 838f 
Oxyanion hole, 259, 259f 
of chymotrypsin, 261, 261f 
Oxygen 
fractional saturation of, 199 
hemoglobin affinity for, 204-205, 204f 
hemoglobin binding of, 196-199. See also 
Hemoglobin, oxygen binding by 
hemoglobin release of, 200, 201f, 
206-208, 208f 
carbon dioxide in, 207—208, 208f 
myoglobin binding of, 196-198, 
198f200f, 200 
partial pressure of, 200 
photosynthetic production of, 
572-574, 574£ 
reactive. See Reactive oxygen 
species (ROS) 
in steroid hydroxylation, 781-782 
Oxygen therapy, hyperbaric, 206 
Oxygen transport, 200-201, 207-208, 208f 


Oxygenases 
in amino acid degradation, 695-696 
mixed-function, 695, 781 
Oxygenation, in amino acid degradation, 
695-696 
Oxygen-binding curve, 199-200, 200f, 201f 
with concerted model, 214-216, 216f 
for Hill coefficients, 214, 214f 
Oxymyoglobin, 196, 198. See also 
Myoglobin 


P clusters, 708, 708f 
P site, ribosomal, 900, 900f, 903 
p53, 843 
p160 family, 948 
P680, 573-574, 573f 
P700, 576 
P960, 571 
Pain management, capsaicin in, 974 
Pain sensation, 973-974 
PALA (N-phosphonacetyl-L-aspartate), 
293, 293-294 
Palade, George, 526 
Palindrome, 141b 
Palindromic sequences, 141 
Palmitate, 347, 347 
oxidation of, 647-648, 648f 
synthesis of, 661-662 
Palmitoleate, 664, 664 
oxidation of, 648, 648f 
Palmitoleoyl CoA, 648, 648 
Palmitoyl CoA, 647-648, 666 
in sphingolipid synthesis, 764, 764 
Palmitoylcysteine, 359, 359 
Pamaquine, hemolytic anemia and, 610 
PAMP (pathogen-associated molecular 
pattern), 978-979, 979f 
Pancreas 
in diabetes, 801-802. See also Diabetes 
mellitus 
disorders of. See Diabetes mellitus 
gene expression in, 939t 
Pancreatic B cells, 802 
Pancreatic enzymes, 255-260, 303-307, 
303t, 304f, 674 
Pancreatic lipases, 641-642, 642f, 
643-644 
Pancreatic trypsin inhibitor, 306 
Pancreatic zymogens, 303-307, 303t, 304f 
Panthothenic acid (coenzyme A), 
440, 441t 
as acyl group carrier, 440 
ADP units in, 447, 447£ 
in fatty acid metabolism, 644-645 
Papain, 220-221 
Paralogs, 174, 175f 
Paraquat, 584 
Pardee, Arthur, 291 
Park, James, 1038 
Parkinson disease, 55—56, 242, 676 
Parnas, Jacob, 454 
Partition coefficient, in drug 
absorption, 1032 
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Passive transport, 371, 373. See also 
Membrane transport 
Pasteur, Louis, 454, 1002 
Patch-clamp technique, 372f, 383, 383f 
Pathogen-associated molecular pattern 
(PAMP), 978-979, 979f 
Pathological conditions. See Diseases and 
disorders 
Pauling, Linus, 38, 197, 209, 226b, 243 
pBR322 plasmid, 149, 149f 
PCR. See Polymerase chain reaction (PCR) 
Pectin, 329, 329f 
Penicillin 
discovery and development of, 1037-1038 
mechanism of action of, 244-245, 244f, 
245f, 1038 
resistance to, 1050 
structure of, 1037, 1037 
Pentose phosphate pathway, 589-590, 
601-611 
Calvin cycle and, 609 
functions of, 608t 
glycolysis and, 601-609 
modes of, 607-609, 607£ 
nonoxidative phase of, 601-606, 603f, 604t 
oxidative phase of, 601-603, 603f, 604t 
rate of, 607-609 
reactions of, 602f, 604t 
regulation of, 607-609 
in selected tissues, 608t 
stoichiometry of, 607-609, 607f 
transaldolase in, 601-604, 605, 606f 
transketolase in, 601-605, 605f 
Pentoses, 320 
Pepsin, 674 
Peptide(s). See also Protein(s) 
definition of, 34 
synthetic, 95-98, 96f, 97f 
Peptide bonds, 33, 33f, 36-37, 36f, 37f. 
See also Bonds 
angles of rotation for, 37, 37f, 38f 
cis configuration of, 36-37, 36f, 37f 
cleavage of, 58, 220-221, 255, 
258-260, 259f 
acyl-enzyme intermediate in, 257, 
257f, 259f 
chromogenic substrate for, 
256-257, 257f 
deacylation in, 259-260, 259f, 260f 
proteases in, 255-266 
aspartyl, 263, 264, 264f 
cysteine, 263, 263-264, 264f 
metal ion, 263, 264, 264f 
serine, 255-266 
tetrahedral intermediate in, 
258-259, 259f 
double-bond character of, 36 
formation of, 903-904, 904£ 
hydrolysis of, 220-221, 255-260 
length of, 36, 36f 
planarity of, 36, 36f 
torsion angles for, 37, 37f, 38f 
trans configuration of, 36-37, 36f, 37f 
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Peptide ligation, 96 
Peptidoglycans, 244, 244f 
Peptidyl] transferase center, 903-904 
Perforin, 996 
Perilipin A, 643 
Peripheral membrane proteins, 355-356, 355f 
Periplasm, 365 
Peroxide bridge, in cytochrome c oxidase, 
539f, 540, 540f 
Peroxide detoxification, 727 
Peroxisomes, 593 
fatty acid metabolism in, 652, 653f 
Perutz, Max, 199 
PFK. See Phosphofructokinase (PFK) 
P-glycoprotein, 378 
pH 
buffers and, 15-17 
definition of, 13 
oxygen affinity of hemoglobin and, 
206-207, 206f 
physiological, 16 
Phagocytes, 978 
Pharmacology. See also Drug development 
definition of, 1030b 
Phase I transformations, in oxidation, 1035 
Phase II transformations, in 
conjugation, 1035 
Pheny] isothiocyanate, in Edman 
degradation, 81, 81 
Phenylacetate, 689 
Phenylalanine 
degradation of, 695-696 
in phenylketonuria, 697-698 
structure of, 28, 29f, 696 
synthesis of, 719-720, 720f, 721f 
Phenylalanine hydroxylase, 695 
deficiency of, in phenylketonuria, 697-698 
Phenylalanine operon, 932-933, 932f 
Phenylketonuria (PKU), 674, 697-698 
Phenylpyruvate, 720, 721 
Pheromones, 974 
Phi (b) angle of rotation, 37-38, 37f, 38f 
Phosphatase(s) 
deficiency of, 514 
in signal transduction, 415 
Phosphatase and tensin homolog, 799 
Phosphate carrier, 554, 554f 
Phosphate, in myosin movement, 1015 
Phosphate solutions, as buffers, 16 
Phosphatidate, 348, 348, 762 
in membrane lipid synthesis, 760, 
761, 761f 
synthesis of, 760 
Phosphatidic acid phosphatase (PAP), 
761, 766-767, 766f 
Phosphatidylcholine, 349, 349 
melting temperature of, 362t 
synthesis of, 763, 763f 
Phosphatidylethanolamine, 349, 349 
synthesis of, 763, 763 
Phosphatidylethanolamine 
methyltransferase, 763 
Phosphatidylglycerol, 349, 349 


Phosphatidylinositol, 349, 349, 762, 
762, 763 
Phosphatidylinositol 3,4,5-trisphosphate 
(PIP3), in signal transduction, 414, 
799, 799f 
Phosphatidylinositol 4,5-bisphosphate 
(PIP2), 762 
in signal transduction, 408, 409f, 414, 
799, 799f 
Phosphatidylserine, 349, 349 
synthesis of, 763 
3'-Phosphoadenosine-5’-phosphosulfate 
(PAPS), 1035 
Phosphocreatine, synthesis of, 236-237 
Phosphodiester bridges, 110, 110, 111 
hydrolysis of, 272-274 
Phosphoenolpyruvate, 434, 445 
in amino acid synthesis, 719, 720f 
formation of, 445, 465 
in gluconeogenesis, 480f, 481, 482, 
483-484 
in glycolysis, 464f, 465 
NAD* regeneration in, 466-468 
phosphoryl-transfer potential of, 434, 
434t, 465 
Phosphoenolpyruvate carboxykinase, 
488, 488f 
Phosphofructokinase (PFK), 473 
in gluconeogenesis, 487 
in glycolysis, 455f, 458 
in liver, 475-476, 475f, 476f, 487, 488 
in muscle, 473-474, 473£ 
Phosphofructokinase 2, 487 
Phosphoglucomutase, 471, 620, 621f 
6-Phosphogluconate, in pentose phosphate 
pathway, 601, 603 
6-Phosphoglucono-6-lactone, in pentose 
phosphate pathway, 601-603, 603 
Phosphoglucose isomerase, 457 
2-Phosphoglycerate, 445 
in glycolysis, 445, 464, 466t 
3-Phosphoglycerate, 464 
in Calvin cycle, 591-592 
in cysteine synthesis, 714-715 
in glycine synthesis, 714-715 
in glycolysis, 463-464, 464f, 466t 
in serine synthesis, 714-715, 714f, 723 
3-Phosphoglycerate dehydrogenase, 723, 
723, 724, 724f 
Phosphoglycerate kinase, in glycolysis, 
463-464 
Phosphoglycerate mutase 
in glycogen metabolism, 464, 620 
in glycolysis, 464, 464f 
Phosphoglycerides, 348, 349, 351, 351f 
Phosphoglycolate, in Calvin cycle, 593, 
593-594 
3-Phosphohydroxypyruvate, 714 
in proline synthesis, 714, 714f 
Phosphoinositide cascade, 408-409, 409f 
in glycogen metabolism, 626 
Phospholipase GC, in signal transduction, 
408, 409f 


Phospholipid(s). See also Lipid(s) 
membrane, 352-355. See also 
Membrane(s); Membrane lipids 
phosphoglycerides, 348, 349 
structure of, 348, 348-349 
synthesis of, 760-767, 760f 
5-Phosphomevalonate, 768 
N-Phosphonacetyl-L-aspartate (PALA), 
293, 293-294 
Phosphopentose epimerase, 596, 596f 
in Calvin cycle, 596, 596f 
in pentose phosphate pathway, 
603, 604t 
Phosphopentose isomerase, in Calvin cycle, 
596, 596f 
Phosphoribomutase, 752 
5-Phosphoribosyl-1-pyrophosphate 
(PRPP), 720-721, 721, 755 
in purine synthesis, 741, 751 
in pyrimidine synthesis, 738-739 
Phosphoribosylamine, 741, 742 
5-Phosphoribosylanthranilate, 721, 722 
Phosphoribulose kinase, 596, 596f 
Phosphorolysis, in glycogen metabolism, 
617-621 
Phosphorothioates, in DNA cleavage, 
273-274, 274£ 
Phosphoryl group transfer, 432-434, 444 
active carriers in, 438-442, 438f, 439f 
ATP and, 434-435, 434f, 434t, 
436-437, 437f 
in glycolysis, 444, 444f 
Phosphorylase. See Glycogen 
phosphorylase 
Phosphorylase a, 622-623 
Phosphorylase b, 622-623 
Phosphorylase kinase, in glycogen 
metabolism, 623-624, 624f, 629 
Phosphorylaspartate, 374, 374 
Phosphorylation, 298-301 
in amino acid synthesis, 724 
ATP in, 298-301, 432-434, 436-437, 
437£, 444 
in chemotaxis, 1024, 1025f 
in cholesterol regulation, 772 
in citric acid cycle, 513-514 
in enzyme regulation, 298-301 
in fatty acid metabolism, 666-667, 666f 
in gluconeogenesis, 483-484, 485 
in glycogen metabolism, 617-618 
in glycogen synthesis, 629-630 
in glycolysis, 455-457, 456f, 457, 458 
in immune response, 989-990, 989f 
kinases in, 457 
in metabolism, 432-434, 444 
oxidative, 436-437, 525-561. See also 
Oxidative phosphorylation 
of phosphorylase kinase, 623-624, 624f 
phosphoryl-transfer potential and, 
434-435, 434f, 434t 
protein kinases in, 298-301 
in purine synthesis, 741-743, 742, 742f 
regulatory functions of, 300-301 


in signal transduction, 299t, 406-407, 
407f, 408-409, 409f, 411-415, 
413f, 417, 418 

substrate-level, 463-464, 507-508 

of sugars, 326-327 

Phosphorylation potential, 447 
Phosphorylcholine, antibody binding of, 

985-986, 985f 
Phosphorylethanolamine, synthesis of, 

763, 763 
Phosphoryl-group-transfer potential, 

434f, 434t 
Phosphoryl-transfer potential, 434-435 
Phosphoserine, 57, 58, 714, 714 
Phosphothreonine, 58 
Phosphotriesters, in DNA synthesis, 144 
Phosphotyrosine, 58 
Photoinduced charge separation, 

569-572, 569f 
Photophosphorylation, cyclic, 580-581, 580f 
Photoreceptors, 966 

cones, 966, 969-970 

evolution of, 970, 970f 

rods, 966-967, 966f 

Photorespiration 
in C4 pathway, 599-600 
in Calvin cycle, 593f, 594 
Photosynthesis, 565-586 

accessory pigments in, 581-584, 582f, 583f 

ATP synthase in, 578-579, 583, 583f 

in autotrophs, 590 

basic equation of, 565 

C4 pathway in, 599-600, 600f 

carotenoids in, 582-583 

chlorophyll a in, 568-572, 568f, 569f, 
582f, 583, 592 

chlorophyll b in, 582, 582f, 583 

in chloroplasts, 567-568, 567f 

components of, location of, 583-584, 583f 

cyclic photophosphorylation in, 

580-581, 580f 

dark reactions of, 566, 580, 589, 590-601. 
See also Calvin cycle 

coordination with light reactions, 599 

definition of, 565 

electron transfer in, 568-572, 569f, 571f 

energy conversion in, 566-567 

evolution of, 568, 584, 600 

herbicide inhibition of, 584, 584 

light reactions of, 566-567, 566f, 
572-584, 589 

coordination with dark reactions, 599 
in photosystem I, 567, 572, 572f, 574f, 

575-577, 575f, 583, 583f 

in photosystem IT, 567, 572-577, 
572£-575f, 583, 583f 

stoichiometry of, 580-581 

yield of, 580-581 

light-harvesting complexes in, 583, 583f 

photoinduced charge separation in, 

569-572, 569f 

proton-motive force in, 566, 566f, 
578-579 


Q cycle in, 575, 575f 
reaction centers in, 569-572, 570f 
energy transfer to, 581-582, 582f 
resonance energy transfer in, 
581-582, 582f 
in thylakoid membranes, 567-568, 567f, 
574-575, 577-581, 583, 583f 
in tropical plants, 599-600 
vs. oxidative phosphorylation, 
578, 579f 
yield in, 566b 
Z scheme of, 577, 577f 
Photosynthetic bacteria, 584, 584t 
reaction center of, 569-572, 570f 
Photosynthetic catastrophe, 567b 
Photosynthetic yield, 566b 
Photosystem I, 567, 572, 572f, 574f, 
575-577, 575£ 
location of, 583, 583f 
Photosystem II, 567, 572-577, 
572f-575£ 
location of, 583, 583f 
Phototrophs, 428 
Pigments 
accessory, in photosynthesis, 581-584, 
582f, 583f 
visual, 969-970, 970f. See also Color 
vision 
in cones, 969-970 
in rods, 966-967 
Ping-pong reactions, 237 
Pinopsin, 970, 970f 
PIP (phosphatidylinositol 4,5- 
trisphosphate), in signal transduction, 
414, 414 
PIP3 (phosphatidylinositol 3,4,5- 
bisphosphate), in signal transduction, 
414, 414 
pK, values, 15, A2 
PKU (phenylketonuria), 674, 697-698 
Planar bilayer membrane, 354, 354f 
Plants 
C3, 599 
Cy, 600 
crassulacean acid metabolism in, 
600-601, 600f 
genetically engineered, 166-167, 167f 
glyoxylate cycle in, 518-519, 519f 
lectins in, 339 
nitrogen fixation in, 706-710 
photosynthesis in. See Photosynthesis 
starch in, 597 
sucrose synthesis in, 597, 598f 
tropical, photosynthesis in, 599-600 
Plaque, atherosclerotic, 776, 777f£ 
Plaques, in genomic library screening, 
153, 153f 
Plasma cells, 979 
Plasmid(s), 148 
in drug resistance, 1050 
pBR322, 149, 149f 
pUC18, 149f 
tumor-inducing, 166-167, 167f 
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Plasmid vectors, 149-151, 149f, 150f 
Plasmin, 312 
Plasmodium falciparum 
glycan-binding proteins of, 340 
mitochondrial genome of, 527 
Plastocyanin, 575, 584 
Plastoquinol, 573, 573, 575 
Plastoquinone, 572-573, 573, 573, 584 
Pleckstrin homology domain, 414, 419 
P-loop, 283-284, 284f 
P-loop(s), in G proteins, 405 
P-loop NTPases 
ABC proteins as, 378-380 
in G proteins, 405 
in microtubules, 1019, 1019f 
in motor proteins, 1008-1012, 1009f 
PLP. See Pyridoxal phosphate (PLP) 
Pluripotent stem cells, 944 
Point mutations, production of, 157 
Poisons 
mushroom, 865, 865-866 
protein inhibition by, 910-911 
Polar head group, 351, 352 
Polarization, membrane, action potential 
and, 382 
Polyacrylamide gel electrophoresis, 71-73, 
72£, 73£ 
Polyampholytes, in isoelectric focusing, 73 
Polycistronic mRNA, 901 
Polyclonal antibodies, 86, 86f 
Polygalacturonase, 167 
Polylinkers, 150, 150f 
Polymerase chain reaction (PCR), 145-147, 
145£-147£ 
in ancient DNA amplification, 147, 
188, 189f 
procedure for, 145-146, 145f 
quantitative, 161-162, 162f 
real-time, 161-162, 162f 
uses of, 146-148 
Polymerase switching, 835 
Polymorphisms, 147 
Polypeptide chains, 34-35, 34f, 35f, 36-38. 
See also Protein(s) 
cleavage of, 82-83, 82f 
cross-linked, 35, 35f 
disulfide bonds of, 35, 35f 
formation of, 903-904, 904f. See also 
Translation 
side chains of, 34, 34f 
subunits of, 48, 48f 
Polypyrimidine tract, 873, 873f 
Polyribosomes, 906, 906f 
Polysaccharides, 328, 328f, 329f 
abbreviations for, 330b 
cellulose, 328-329 
glycogen, 328, 329f 
starch, 328, 329f 
Polysomes, 906, 906f 
Poly(A) tail, 871, 872 
Polyunsaturated fatty acids. See also 
Fatty acid(s) 
eicosanoids from, 664—665, 665, 665f 
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Pompe disease, 634t, 635, 635f 
Pores, nuclear, 365 
Porins, 47, 47£, 357, 357£ 
mitochondrial, 526-527 
Porphobilinogen, 728, 729 
Porphobilinogen deaminase, 728-729 
Porphyrias, 730-731 
Porphyrins 
disorders of, 730-731 
synthesis of, 728-730, 729f 
Potassium ion channels, 364-393, 382. 
See also Ion channels 
inactivation of, 388-389, 389f 
rate of transport in, 387 
selectivity filter in, 385-386, 386f, 387f 
sequence homologies of, 384, 384f, 386 
specificity of, 385-387 
structure of, 384-385, 388f 
voltage-gated, 387-388, 388f 
Potassium ions, in Na*—K* pump, 374, 
377. See also Ion pumps 
Potential energy, 11. See also Energy 
Power stroke, in myosin, 1015 
pppA, 854, 854 
pppG, 854, 854 
Predisposition, to disease, 19-20 
Pregnenolone 
corticosteroids from, 783 
progesterone from, 783 
structure of, 784 
synthesis of, 783 
Pre-mRNA, processing of, 870-872, 871f, 
878-879 
splicing in, 873-879, 875f-878f. See also 
Splicing 
Prephenate, 720, 721 
Prepriming complex, 834, 834f 
Pre-rRNA, processing of 
in bacteria, 863 
in eukaryotes, 869-870, 870f 
Pressure, sensation of, 973 
Pre-tRNA, processing of 
in bacteria, 871f 
in eukaryotes, 870-872, 871f 
in prokaryotes, 870 
Prevacid (lansoprazole), 1046, 1046 
Pribnow box, 126f, 127 
Prilosec (omeprazole), 1046, 1046 
Primary active transport, 371 
Primary messengers, 402 
Primary protein structure, 33-38. See also 
Amino acid sequences 
Primase, 823, 823f 
Primers 
in DNA replication, 122, 145, 821 
in polymerase chain reaction, 145 
Prions, 56—57, 56f 
Probes, DNA, 80, 142, 144-145, 
151-153 
generation of, 144-145, 151-153 
solid-phase approach in, 144-145, 144f 
for genomic library, 151-153, 152f 
Procaspases, 303 


Processivity 
definition of, 832b 
in DNA replication, 831-832 
Procollagen, 303 
Procollagenase, 303 
Proenzymes. See Zymogens 
Progestagens. See also Steroid hormones 
synthesis of, 780, 780f 
Progesterone 
aldosterone from, 783 
cortisol from, 783, 784 
synthesis of, 783, 784 
Programmed cell death 
mitochondria in, 559 
T cell, 1000, 1000f 
zymogens in, 303 
Proinsulin, 303, 802 
bacterial synthesis of, 155, 155f 
Prokaryotes, 3. See also Bacteria 
Proliferating cell nuclear antigen, 836 
Proline 
hydroxyl groups of, 44, 57, 57-58 
structure of, 28, 29f 
synthesis of, 714, 714f 
Proline racemase, inhibition of, 243, 243f 
Prolyl hydroxylase, 812-813 
Prolyl hydroxylase 2, in cancer, 515 
Promoters, 126-127, 126f, 852 
bacterial, 852, 856-858, 857f 
eukaryotic, 866-869, 866f 
for RNA polymerase I, 866, 866f 
for RNA polymerase II, 866-867, 866f, 
867-868 
for RNA polymerase III, 866, 866f 
heat-shock, 858, 858f 
nitrogen-starvation, 858, 858f 
Proofreading 
evolution of, 895 
in replication, 839-840, 839f 
in transcription, 856 
in translation, 895 
Proopiomelancortin (POMC), 796 
PROP (6-n-propyl-2-thiouracil), 963 
Propionyl CoA, 649, 649-650, 693, 693, 694 
conversion to succinyl CoA, 649-650, 650f 
6-n-Propyl-2-thiouracil (PROP), 963 
Prostacyclins, 664-665, 665, 665f 
Prostaglandin E2, 665, 665 
Prostaglandin H, 357-358, 358, 358f, 
665, 665f 
Prostaglandin H» synthase-1, 357-358, 
358, 358f 
Prostaglandin inhibitors, 358, 665, 1040 
aspirin as, 358, 665, 1040 
Prostaglandins, 664—665, 665, 665f 
Prosthetic groups, 221 
Protease(s) 
active sites of, 185-186, 185f, 257-258 
convergent evolution of, 185, 185-186 
aspartyl, in peptide bond cleavage, 263, 
264, 264f 
cysteine, in peptide bond cleavage, 263, 
263-264, 264f 


metal ion, in peptide bond cleavage, 263, 
264, 264£ 
in peptide bond cleavage, 255-266. See 
also Peptide bonds, cleavage of 
in carbohydrates, 337 
serine, in peptide bond cleavage, 255-266 
site-directed mutagenesis of, 262-263, 262f 
specificity of, 260, 260f 
Protease inhibitors, 306-307 
a ,-antitrypsin, 307 
pancreatic trypsin inhibitor, 306 
pharmaceutical, 264-266, 265, 358, 665 
development of, 1043, 1043f 
resistance to, 1050 
Proteasomes, 677—678, 677f, 678f, 992. See 
also Amino acid degradation 
eukaryotic, 677-678, 678f 
evolution of, 677-678, 678f, 679 
of M. tuberculosis, 679 
prokaryotic, 678, 678f 
Protein(s), 2. See also Peptide(s) 
acetylation of, 57 
adaptor, 414 
in signal transduction, 414, 417 
aggregated, in neurological diseases, 
56-57, 56f 
allosteric, 289, 290-296. See also 
Allosteric enzymes 
« helix of, 38-40, 39, 39f, 40, 40f 
amino acids of. See Amino acid(s) 
antibody-tagged, 88 
B sheet of, 40-42, 40f42f 
carbohydrate units of, 57, 329-337. See 
also Glycoproteins 
cleavage of, 49, 58, 82-83, 82f, 83f 
coiled-coil, 43, 44f 
covalent modification of, 57—58, 57f, 
297-302, 298t. See also 
Phosphorylation 
definition of, 34 
degradation of, 673-700. See also Amino 
acid degradation 
denatured, 49-50, 49f, 50f, 52, 52f 
dietary, digestion and absorption of, 
255-260, 303-307, 303t, 304f, 
674, 674. See also Amino acid 
degradation 
evolution of, 133 
experimental studies of. See Protein 
studies 
fibrous, 43—45 
flexibility of, 26-27, 26f 
folding of, 18, 19f, 38, 48-49, 52-57 
cooperative transition in, 52 
cumulative selection in, 53 
denaturation and, 49f, 50, 50f, 52, 52f 
heat shock protein 70 in, 183, 183-184 
hydrophobicity and, 45—46, 46f, 47, 47f 
intermediates in, 52-54, 53f 
Levinthal’s paradox in, 53 
mistakes in, in neurological diseases, 
56-57 
nucleation-condensation model of, 53 


prediction of, 54 
into regular structures, 38-45 
steps in, 53-54, 53f 
function of, structural correlates of, 
25-26, 26f, 57-59 
functional groups of, 26 
genes encoding, 19 
genetically engineered, 157 
glycan-binding, 337-338 
glycosylation of, 329-337 
congenital disorders of, 336 
sites of, 333-334, 337 
half-life of, 675, 676t 
hemoglobin as model of, 195-216. See 
also Hemoglobin 
hydroxylation of, 30, 30f, 57-58 
identification of, 65—66 
immunological methods for, 86 
intrinsically unstructured, 54-55 
as linear polymers, 33-35 
lysosomal, 911 
mass of, 34 
measurement of 
electrophoresis in, 74, 74f 
mass spectrometry in, 91-95, 92f, 
93f, 95f 
membrane. See Membrane proteins 
membrane-spanning, 911 
metabolism of, 437f 
metamorphic, 55, 55f 
molecular models of, 21-22, 23f, 60-62, 
61f, 62f 
molecular weight of, 34 
multiple conformations of, 54—55, 55f 
Q, loops in, 42, 43f 
peptide bonds of, 33, 33f, 36-37, 36f, 37f 
polypeptide chains of, 33-35, 34f, 35f, 
36-38 
post-translational modification of, 
57-59, 58f 
properties of, 25-26 
random-coil conformation of, 49, 49f£ 
refolding of, 50, 50f 
regulation of 
covalent modification in, 298t 
phosphorylation in, 298-301. See also 
Phosphorylation 
reverse turns in, 42, 43f 
ribosomal. See also Ribosome(s) 
in translation, 900 
rigidity of, 26, 26f 
secretory, 911 
signal sequences in, 79 
similarity among, 2, 2f 
sorting of, in Golgi complex, 334, 334f 
structure of, 2f, 3 
amino acid sequences and, 18-19, 19f, 
25, 35-36, 49-59 
functional correlates of, 25-26, 26f, 
57-59 
models of, 21—22, 23f, 60-62, 61f, 62f 
primary, 25, 33-38. See also Amino acid 
sequences 


quaternary, 26, 48, 48f 
secondary, 25, 38-45, 47 
supersecondary, 47, 47f 
tertiary, 45-47, 46f, 47f, 183-184. 
See also Protein(s), folding of 
subunits of, 48, 48f 
synthesis of. See also Translation 
initiation of, 130 
mRNA in, 123-124, 130, 130f 
synthetic, 96-98, 96f, 97f, 157 
translocation of, 911-914 
unfolding of, 52, 52f 
Protein blots, 89, 142 
Protein Data Bank, 60, 104 
Protein domains, 47, 47£ 
exon encoding of, 133, 133f 
Protein kinase(s), 298 
calmodulin-dependent, 410-411 
cyclin-dependent, 836 
dedicated, 299 
in enzyme regulation, 298-301 
multifunctional, 299 
in phosphorylation, 298-301 
regulation of, cAMP in, 301, 301f 
serine, 298-299, 299f, 299t 
in signal transduction, 299t, 406-407, 
408, 409, 409f, 419 
specificity of, 299 
threonine, 298-299, 299f, 299t 
tyrosine, 298-299, 299f 
in cancer, 420 
Protein kinase A 
pseudosubstrate binding to, 302, 302f 
regulation of, 301, 301f 
in signal transduction, 406-407, 419 
in glycogen metabolism, 624, 625f, 626 
structure of, 302, 302f 
Protein kinase B (Akt), in signal 
transduction, 414, 799, 799f 
Protein kinase C, in signal transduction, 
409, 409f 
Protein kinase inhibitors, for cancer, 421 
Protein phosphatase 1 (PP1), in glycogen 
metabolism, 626-627, 631-634, 631f 
Protein phosphatases, 300 
Protein purification, 65, 66-79 
affinity chromatography in, 70-71, 70f 
assays in, 67 
autoradiography in, 73 
dialysis in, 69, 69f 
differential centrifugation in, 67—68, 68f 
fractionation in, 67—68, 68f 
gel electrophoresis in, 71-73, 72f, 73f 
gel-filtration chromatography in, 69, 69f 
gradient centrifugation in, 77-78, 77f 
high-pressure liquid chromatography in, 
71, 71£ 
homogenization in, 67, 75t 
ion-exchange chromatography in, 69-70, 
70f, 80-82, 80f 
isoelectric focusing in, 73, 73f 
with SDS-PAGE, 74, 74£ 
monoclonal antibodies in, 87-88 
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protein concentration in, 67 
quantitative analysis of, 75-76, 75f, 75t 
recombinant DNA technology in, 78-79 
salting out in, 68, 75t 
SDS-PAGE in, 74, 74f, 75, 75f 
with isoelectric focusing, 74, 74f 
sedimentation-equilibrium technique 
in, 78 
two-dimensional electrophoresis in, 
74, 74£ 
Protein regulation 
covalent modification in, 298t 
phosphorylation in, 298-301. See also 
Phosphorylation 
Protein sequencing. See Amino acid 
sequences, determination of 
Protein sorting, 911-914 
Protein studies 
amino acid sequencing in, 79-84. 
See also Amino acid sequences, 
determination of 
electron-density maps in, 99-100, 100f 
electrospray ionization in, 91-92 
enzyme-linked immunosorbent assay in, 
88-89, 89f 
fluorescence microscopy in, 90-91, 90f 
genomic analysis in, 84 
immunoelectron microscopy in, 91, 91f 
immunologic techniques in, 84-91. See 
also Monoclonal antibodies 
mass spectrometry in, 91-95, 92f, 93f, 95f 
nuclear magnetic resonance spectroscopy 
in, 101-104, 101£-103f 
nuclear Overhauser enhancement 
spectroscopy in, 101-104, 102f, 103f 
proteomic analysis in, 84, 94-95, 95f 
purification methods for, 65, 66-79. See 
also Protein purification 
recombinant DNA technology in. See 
Recombinant DNA technology 
synthetic peptides in, 96-98, 96f, 97f 
time of flight analysis in, 92, 92f 
western blotting in, 89, 89f, 142 
X-ray crystallography in, 98-100, 98f100f 
Protein targeting, 334, 334f, 911-914 
Protein transport, 911-914. See also 
Membrane transport 
Protein turnover, 255, 673. See also Amino 
acid degradation; Amino acid synthesis 
Protein tyrosine kinases, 298-299, 299f, 412 
in cancer, 420 
receptor, 412 
Proteins, synthesis of, DNA technology in, 
78-79 
Proteoglycans, 329-330, 331f 
Proteolytic enzymes, 220-221, 302-313 
activation of, 302-313 
in blood clotting, 307-312 
chymotrypsinogen as, 303-305 
in digestion, 303-307 
functions of, 303 
inhibitors of, 306-307 
in starvation, 809, 810 
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Proteomes, definition of, 66 
Prothrombin, 309f, 310, 310f 
calcium binding by, 310-311 
Proton gradients, 437 
free energy from, 559-560, 560f 
in oxidative phosphorylation, 530-531, 
543-550, 559-560, 560f 
in photosynthesis, 571-572, 578-581 
Proton pumps 
in oxidative phosphorylation, 525 
in respiratory chain, 525 
Proton shuttle, in carbon dioxide hydration, 
269-270, 270f 
Proton transport, by flagella, 1023, 1024f 
Proton-motive force, 525 
in oxidative phosphorylation, 525, 
544-545, 544f, 569 
in photosynthesis, 566, 566f, 578-579 
Protons, chemical, 540f 
Proto-oncogenes, 420 
Protoplasts, 167 
Protoporphyrin, 196, 196-197 
Protoporphyrin IX, 729, 730 
Provitamin D3, 785, 785 
Proximal histidine, 197 
PRPP. See 5-Phosphoribosyl-1- 
pyrosphosphate (PRPP) 
Pseudo-first-order reactions, 230 
Pseudogenes, 159 
Pseudosubstrate sequence, 302 
Pseudouridylate, 863, 863 
Psi (W) angle of rotation, 37—38, 37f, 38f 
Psoralen, 876 
PTEN, 799 
P-type ATPases, 374 
evolution of, 378-379 
functions of, 378 
in membrane transport, 374-381. 
See also Ion pumps 
types of, 378 
pUC18 plasmid, 150f 
Puffer fish 
genome of, 161, 161f 
tetrodotoxin of, 384, 384 
Pumps. See Ion pumps; Proton pumps 
pur repressor, 927, 927f 
Purine nucleotides, 111, 111-112 
degradation of, 752—755, 753f 
derivatives of, 111, 111 
synthesis of, 726, 740-745, 740f, 
751. See also Nucleotide(s), 
synthesis of 
phosphorylation in, 741-743, 742f 
steps in, 740-743, 741t, 742f 
Purinosomes, 744, 744f 
Puromycin, 909t, 910, 910 
Pyranose, 322-325, 323f 
Pyridoxal phosphate (PLP), 674 
in amino acid degradation, 657, 681, 
681-684 
in amino acid synthesis, 712f, 713 
in glycogen metabolism, 
618-619, 618f 


Pyridoxal phosphate enzymes 
in amino acid degradation, 681-684, 683f 
in amino acid synthesis, 712f, 713 
catalytic activity of, 683-684, 683f 
Pyridoxal phosphate-dependent 
transaminases, 712—713 
Pyridoxamine phosphate, 682, 682, 712 
Pyridoxine (vitamin Be), 441, 441t, 681 
Pyrimidine(s), 111, 111 
derivatives of, 111, 111 
synthesis of, 726 
aspartate transcarbamoylase in, 
290-296, 738 
Pyrimidine nucleotides, 111, 111 
recycling of, 740 
synthesis of, 736-740, 736f, 751. See also 
Nucleotide(s), synthesis of 
Pyrimidine phosphoribosyltransferase, 739 
Pyrophosphate, hydrolysis of, 627-628, 645 
in fatty acid metabolism, 645 
5-Pyrophosphomevalonate, 768 
Pyrrole-2-carboxylic acid, 243, 243 
A'-Pyrroline 5-carboxylate, in proline 
synthesis, 714 
Pyruvate, 236 
in amino acid degradation, 684—685, 
685f, 691 
in amino acid synthesis, 712 
in citric acid cycle, 500-501 
conversion to acetyl coenzyme A, 
500-501, 500f, 501f 
in Cori cycle, 490, 490f, 685 
in ethanol formation, 466-467, 467£ 
formation of, 691, 691f 
in gluconeogenesis, 453, 479-485 
in glucose—alanine cycle, 685, 685f 
in glycolysis, 453, 464-466, 464f—466f 
metabolic fates of, 466-468, 466f 
in nitrogen transport, 684—685, 685f 
oxaloacetate synthesis from, 
443-444, 443f 
Pyruvate carboxylase 
in citric acid cycle, 516 
in gluconeogenesis, 481 
Pyruvate carrier, 554, 554f 
Pyruvate decarboxylase, 466 
Pyruvate dehydrogenase complex 
in citric acid cycle, 499-503, 499f, 499t, 
502f, 503f 
components of, 499, 499t 
phosphorylation of, 513-514 
reactions of, 502-503, 503f 
regulation of, 513-514 
structure of, 502, 502f 
Pyruvate dehydrogenase kinase, in 
cancer, 515 
Pyruvate kinase, in glycolysis, 465 
in liver, 476-477, 476£ 
in muscle, 474 
Pyruvate-P; dikinase, in C4 pathway, 599 


Q (coenzyme Q), 531, 532-533, 533f 
in fatty acid metabolism, 646 


Q cycle, 575, 575f 
in oxidative phosphorylation, 536-537 
in photosynthesis, 575 
Q pool 
in oxidative phosphorylation, 533 
in photosynthesis, 571f, 572 
Qa, 571-572, 571f, 573 
Qpz, 571, 571f, 573 
Q-cytochrome c oxidoreductase (Complex 
IIT), 531, 532f, 532t, 535-536, 536f 
QH), 571-572 
Quantitative polymerase chain reaction, 
161-162, 162f 
Quaternary protein structure, 26, 48, 48f 
Quinine, 962 
taste of, 962 
Quinones 
in oxidative phosphorylation, 531, 
533, 533f 
in photosynthesis, 570, 571 
Quinonoid dihydrobiopterin, 695, 695 
Quorum sensing, 930-931, 930f 
biofilms and, 930-931 


R* (metarhodopsin II), 968, 968f 
R groups. See Amino acid side chains 
R state, of hemoglobin, 202-204, 203f 
Radiation, mutagenicity of, 839 
Raloxifene, 948, 948 
Ramachandran diagrams 

for angles of rotation, 37, 38f 

for helices, 39-40, 40f 
Ramachandran, Gopalasamudram, 37 
Random-coil conformation, 49, 50f 
Ranitidine (Zantac), 1045, 1045 
Ras 

farnesylation of, 298t 

in signal transduction, 417-418, 418f 
Rate constant (k), 230 
Rate, reaction. See Reaction rates 
RBP4, 796 
Reaction(s) 

activation energy for, 222, 226 

anabolic, 428—429 

anaplerotic, 516-517 

biomolecular, 230 

bisubstrate, 236 

catabolic, 428, 435-438 

electron carriers for, 438-439 

catalysis of. See Catalysis; Enzyme(s) 

chemical modification, 256 

Cleland notation for, 236, 237 

coupled, thermodynamics of, 429-430, 

431-433 

double-displacement (ping-pong), 237 

electrocyclic, 720 

endergonic, 222 

enzyme-catalyzed. See Catalysis; 

Enzyme(s) 

exergonic, 222 

first-order, 229 

free energy change of, 12, 222-225 

group-transfer, 436-437, 443t, 444 


interconnected, 428-430, 428f 
ligation, 443-444 
lyase-catalyzed, 445 
metabolic, 428-430. See also Metabolism 
multiple-substrate, 235-237 
oxidation-reduction, 443, 443t 
free energy change in, 530 
in metabolism, 433t, 443 
reduction potential of, 528-530, 529t 
second-order, 230 
sequential, 236-237 
thermodynamics of, 222-225 
transition state in, enzymes and, 225-226, 
225f, 227-228 
Reaction centers, in photosynthesis, 569-572, 
570f, 582, 582£ 
energy transfer to, 581-582, 582f 
Reaction equilibrium, enzymes and, 224-225 
Reaction rates, 229-237. See also Kinetics 
catalytic acceleration of, 220, 220t, 
225-229, 226f 
definition of, 229 
in enzyme-catalyzed reactions, 229-237. 
See also Enzyme kinetics 
in first-order reactions, 229-230 
maximal, calculation of, 233 
Michaelis constant (Ky) for, 231-235 
in pseudo-first-order reactions, 230 
in second-order reactions, 230 
Reactive oxygen species (ROS), 
541-542, 541t 
cytochrome P450 and, 782-783, 782f 
glucose 6-phosphate and, 610 
mutagenicity of, 838 
Reactive substrate analogs, 241-242 
Reading frame, 130 
Reagents, group-specific, 241 
Real-time polymerase chain reaction, 
161-162, 162f 
RecA, 845 
Receptor(s). See also specific types 
ligand binding to, 926, 947 
in signal transduction, 402 
G-protein-coupled, 404-406 
insulin, 412-415 
serpentine, 404 
7TM, 404-406, 404t 
Receptor tyrosine kinases, in signal 
transduction, 412 
in cancer, 421 
Reclinomonas americana, genome of, 528 
Recognition helix, 923 
Recognition sites (sequences), 141, 141f, 
272, 275-277 
in cognate vs. noncognate DNA, 
275-277, 276£ 
distortion of, 276-277, 276f 
inverted repeats as, 275 
structure of, 275, 275f 
Recombinant DNA 
production of, 154-155, 154f 
transfer of, by microinjection, 163, 163f 
vectors for, 163-164. See also Vectors 


Recombinant DNA technology, 139-169 
in amino acid sequencing, 84 
autoradiography in, 73, 143 
bacterial artificial chromosomes in, 151 
bombardment-mediated transformation 

in, 167 
cloning in, 149-151, 149f-151f 
cohesive-end method in, 148-149, 149f 
complementary DNA in, 154-155, 
154f, 162f 
designer genes in, 157 
for disease-causing mutations, 157 
DNA microarrays in, 162-163, 162f 
DNA microinjection in, 163, 163f 
DNA probes in, 80, 142, 144-145, 
151-153 
DNA sequencing in, 143-144, 143f 
DNA synthesis in, 144-145, 148-149 
electroporation in, 167, 167f 
expression vectors in, 154-155 
fluorescence detection in, 143-144, 143f 
gel electrophoresis in, 141-142, 142f 
gene guns in, 167 
gene manipulation in, 161-168 
gene-expression analysis in, 161-162, 
162f, 163-164 
genomic libraries in, 151-153, 
152f, 153f 
lambda phage in, 150-151, 150f, 151f 
medical applications of, 167-168 
mutations in, 156-157 
overview of, 139-141 
in plants, 166-167, 167f 
plasmid vectors in, 149-151, 149f, 150f 
polymerase chain reaction in, 145-147, 
145f-147f, 188 
in protein purification, 78-79 
restriction-enzyme analysis in, 141-142 
solid-phase approach in, 144-145, 144f 
transgenic animals in, 164, 164f 
vectors in, 149-151, 149f-151f, 154-155 
yeast artificial chromosomes in, 
151, 151f 

Recombinases, 845-846 

Recombination 
in antibodies, 987-991, 988f 
DNA. See DNA recombination 
homologous, 970b 

in color blindness, 970 
in gene knockout, 164-165 
Recombination signal sequences 
(RSSs), 988 
Recombination synapse, 846 
Red blood cells. See also Hemoglobin 
2,3-bisphosphoglycerate in, oxygen 
affinity of hemoglobin and, 
204-205 

carbohydrates in, 335 

disorders of, 209-211 

Heinz bodies in, 610, 610f 

life span of, 730 

sickled, 209-210, 209f 

Red photoreceptors, 969-970, 969f, 970f 
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Redox couples, 529 
redox potential of, 529-530, 529t 
Redox potential. See Reduction potential 
Redox reactions. See Oxidation—reduction 
reactions 
Reducing sugars, 325 
Reductase, in nitrogenase complex, 
708-709 
Reductase mRNA, in cholesterol regulation, 
770, 771 
Reduction potential 
measurement of, 528f, 529-530 
in oxidative phosphorylation, 528-531 
Reed, Randall, 959 
Regulatory domains. See Domains 
Regulatory sites, 291 
Rejection, in transplantation, 998 
Relaxed DNA, 117, 117f 
Relay helix, 1012, 1012f 
Release factors (Rfs), 130, 906, 907f 
Renal function. See Kidney 
Renal gluconeogenesis, 481 
Repeating motifs, 79 
sequence homology of, 184, 185f 
Replication. See DNA replication 
Replication factor C, 836 
Replication fork, 823, 823f, 832-833, 833f 
Replication protein A, 835 
Replicon, 835 
Repressors, 924-925 
corepressors, 927 
lac, 922f, 923, 925-928, 926f, 927 
A, 928-929, 928f, 929f 
pur, 927, 927£ 
Resistin, 796 
Resonance structures, 7 
of ATP, 433 
improbable, 433f 
of orthophosphate, 433 
Respiration, cellular, 498, 499f, 
593f, 594 
definition of, 526b 
in photosynthesis, 593f, 594 
regulation of, 554-560 
Respiratory chain, 525, 531-543 
coenzyme Q in, 531 
components of, 531-532, 532f, 533f 
cytochrome c oxidase (Complex IV) in, 
531, 532f, 532t, 535-536 
electron carriers in, 438—439, 
438f, 439f 
linkage to ATP synthase, 544-545, 
544f, 545f 
NADH-Q oxidoreductase (Complex I) 
in, 531-535, 532f, 532t 
Q cycle in, 536-537, 537f 
Q-cytochrome c oxidoreductase 
(Complex IT) in, 531, 532f, 532t, 
535-536, 536f 
succinate-Q reductase (Complex II) in, 
531, 532f, 532t, 535 
Respiratory control, 556 
Respiratory distress syndrome, 765-766 
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Restriction enzymes (endonucleases), 
141-142, 271-279 
binding affinity of, 275-277 
in DNA cleavage, 141-142, 271-279. See 
also Restriction fragment(s) 
in E. coli, 272-279, 274£-277£ 
evolution of, 278-279 
recognition sites for, 141, 141f, 271-272, 
275-277. See also Recognition sites 
(sequences) 
in restriction-modification systems, 272 
specificity of, 141, 141f, 272, 
275-277, 275£ 
type II, 278-279, 278f 
Restriction fragment(s) 
separation of, gel electrophoresis in, 
141-142 
in Southern blotting, 142 
Restriction fragment length 
polymorphisms, 147 
Restriction-modification systems, 277 
11-cis-Retinal, 967, 967, 968f 
all-trans-Retinal, 967, 968 
Retinitis pigmentosa, 877-878 
Retinol (vitamin A), 442, 442, 442t 
deficiency of, alcohol-related, 812 
Retroviruses, 122-123, 123f, 999-1000 
as vectors, 163 
Reverse cholesterol transport, 778-779 
Reverse transcriptase, 123, 123f 
in complementary DNA production, 
154-155 
Reverse turns, 42, 43f 
amino acid residues in, 50—51, 51t 
Reversible covalent modification, 446, 
725-726, 725f 
RFs (release factors), 130, 906, 907£ 
Rhizobium, in nitrogen fixation, 706-707 
Rho, 127 
in transcription termination, 860-861 
Rhodopseudomonas viridis, photosynthetic 
reaction center in, 569-570, 570f, 584 
Rhodopsin, 404, 966-968 
absorption spectrum of, 967f 
cone photoreceptors and, 969-970, 970f 
structure of, 404, 404£ 
Rhodopsin kinase, 968 
Ribbon diagrams, 62, 62f 
Riboflavin (vitamin Bo), 441, 441t 
Ribonuclease 
amino acid sequences in, 49-50, 49f 
reduction and denaturation of, 49-50, 
49f, 50£ 
scrambled, 50, 50f 
sequence comparison for, 173, 174 
structure of, 49-50, 49f, 50f 
in transcription, 863 
Ribonuclease P, 863 
Ribonuclease III, 863 
Ribonucleic acid. See RNA 
Ribonucleoproteins 
small nuclear, 852, 875-876, 876t 
small nucleolar, 870, 875-877 


Ribonucleotide reductase, 718, 745—750, 
745£, 746, 752, 752f 
Ribose, 110, 110, 111, 321, 322, 322f 
ring form of, 323, 323f 
Ribose 1-phosphate, 752 
Ribose 5-phosphate, 596 
in Calvin cycle, 595f, 596, 596f 
in pentose phosphate pathway, 601, 602, 
602f, 603, 607-609 
Ribosomal RNA. See rRNA 
(ribosomal RNA) 
Ribosome(s), 130, 897-907 
in codon—anticodon interactions, 902 
definition of, 887 
endoplasmic reticulum-bound, 911-914 
in eukaryotes vs. bacteria, 907 
evolution of, 900 
functions of, 900 
initiation sites in, 902, 903f 
polyribosomal, 906, 906f 
as ribozymes, 887 
structure of, 887f, 897-898, 898f 
subunits of, 897-898, 898f 
formation of, 902 
tRNA binding by 
mechanism of, 900f, 903-904 
sites of, 900, 900f, 902-904 
Ribosome release factor, 906, 907f 
Riboswitches, 860, 860f 
Ribosylation, 298t 
Ribothymidylate, 863, 863 
Ribozymes, 879, 887 
Ribulose 1,5-bisphosphate carboxylase/ 
oxygenase. See Rubisco 
Ribulose 1,5-bisphosphate, in Calvin cycle, 
591-597, 592, 595f, 596, 596, 596f 
Ribulose 5-phosphate 
in Calvin cycle, 596, 596 
in pentose phosphate pathway, 602, 602f 
Ricin, 911 
Rickets, 786 
Rickettsia prowazekii, genome of, 527-528 
Rieske center, 536 
Rifampicin, 861-862, 862 
Rise, in a helix, 39 
RNA, 111 
ATP-binding region of, 190-191, 191f 
backbone of, 110, 110f 
catalytic activity of, 879-881, 900 
evolution of, molecular studies of, 
188-191, 190f 
in gene expression, 123-128 
messenger. See mRNA 
micro, 124 
nucleosides in, 111 
post-transcriptional processing of 
in bacteria, 863 
in eukaryotes, 869-879 
pre-mRNA, 870-872, 871f, 878-879 
splicing in, 873-879, 875f-878f 
pre-rRNA, 869-870, 870f 
in bacteria, 863 
in eukaryotes, 869-870, 870f 


pre-tRNA 
in bacteria, 870, 871f 
in eukaryotes, 870-872, 871f 
ribosomal. See rRNA 
self-splicing in, 879-881, 880f-882f 
sequence homologies of, 186-187, 187f 
small interfering, 124 
small nuclear, 124, 875-876 
in splicing, 875-876, 876t 
splicing of, 132-133, 132f, 851-852. See 
also Splicing 
stem-loop motif in, 117f 
structure of, 110, 111-112, 117-118, 
117£, 186-187, 187£ 
sugars in, 110f, 111 
synthesis of, 124-127, 125f, 126f, 851-882. 
See also Transcription 
transcriptional modification of, 127, 127f 
transfer. See tRNA 
types of, 123-124 
RNA blots, 142, 142f 
RNA editing, 852, 872-873, 873f 
RNA interference, 165-166, 167£ 
RNA polymerase(s), 124-127, 125f, 852, 
852, 853-858 
ADP ribosylation of, 298t 
backtracking and, 856, 856f 
bacterial, 853-858 
active site of, 853, 854f 
promoter recognition by, 856-857 
structure of, 852, 852, 853, 853t 
subunits of, 853, 853t, 856-858 
catalytic action of, 853-858 
in DNA replication, 823, 823f 
eukaryotic, 852, 864 
promoter elements in, 866-869, 866f 
in proofreading, 856 
RNA-directed, 122 
structure of, 852, 853 
RNA polymerase holoenzyme complex, 
857, 857-858, 857f 
RNA polymerase I, 865, 865t, 866 
rRNA processing by, 869-870, 870f 
RNA polymerase II, 865, 865t, 866-867, 943 
bromodomains and, 949-950, 950f 
in transcription initiation, 950f 
RNA polymerase III, 865, 865t, 866 
RNA processing, 132-133. See also 
Transcription 
in bacteria, 863 
in eukaryotes, 869-879 
of pre-rRNA, 869-870, 870f 
RNA polymerase I in, 869-870, 870f 
of tRNA, 870, 871£ 
in prokaryotes, of tRNA, 870 
RNA splicing. See Splicing 
RNA viruses, 122-123, 123f 
RNA world, 447, 881 
RNA-directed RNA polymerase, 122 
RNA- inducing silencing complex 
(RISC), 166 
RNase P, 881 
Roberts, Richard, 131 


Rods, 966-967, 966f 
Rofecoxib (Vioxx), 1044, 1044-1045, 1049 
Rossman fold, 469, 469f, 509, 509f 
Rossman, Michael, 469 
Rotation, flagellar, 1022-1025 
Rotational catalysis, in ATP synthesis, 
547-550, 547£-549f 
Rotenone, 558 
Rough endoplasmic reticulum, 911. See also 
Endoplasmic reticulum 
Roundup, 719-720 
Rous sarcoma virus, 420 
rRNA (ribosomal RNA), 123t, 124. 
See also RNA 
in base-pairing, 892 
bases in, 863 
folding of, 898, 898f 
functions of, 124, 900 
16S, in translation, 892, 901 
transcription and processing of, 863, 
869-870, 870f 
transcription of, 863 
in translation, 892, 898—900, 901, 
903-904 
RsrI endonuclease, 278 
RSS (recombination signal sequence), 988 
Rubisco, 591, 591-597 
evolution of, 600 
light activation of, 598 
Rubisco activase, 592 


Saccharomyces cerevisiae, genome of, 158 
Sakmann, Bert, 383 
Salmonella test, for mutagens, 843-844 
Salt bridges, oxygen release from 
hemoglobin and, 206-207, 207, 207f 
Salting out, 68, 75, 75t 
Salty taste, 962, 965 
Salvage pathways, 740 
purine, 744-745 
pyrimidine, 736, 736f 
Sandwich ELISA, 89, 89f 
Sanger dideoxy method, 143-144, 143f 
Sanger, Frederick, 35, 143, 158 
Sarcomere, 1016-1017, 1016f, 1017f 
Sarcoplasmic reticulum Ca’* ATPase 
(SERCA), 374 
SARS, drug therapy for, 1046-1047, 1047f 
Satiation signals, 795-797, 796f 
Saturated fatty acids. See Fatty acid(s) 
SCAP (SREBP cleavage activating protein), 
770-771, 771£ 
Schiff bases, 681-682, 709 
in amino acid degradation, 681-682, 709 
in amino acid synthesis, 709 
in aminoacrylate, 722, 722 
in glycogen phosphorolysis, 619, 619f 
in retinal, 967-968, 968f 
in transaldolase reaction, 605-606, 606f 
in transamination, 618—682 
Scrapie, 56 
Screw sense, of helix, 39, 40f 
Scurvy, 57, 812-813 


SDS (sodium dodecyl sulfate), in gel 
electrophoresis, 73, 78 
SDS-PAGE, 74, 74£, 75-76, 75£ 
with isoelectric focusing, 74, 74f 
SDS-polyacrylamide gel electrophoresis, of 
membrane proteins, 355, 355f 
Second Law of Thermodynamics, 11-12, 
792. See also Thermodynamics 
Second messengers, 402-403, 403f, 409-411, 
409f, 419 
Secondary active transport, 371 
Secondary protein structures, 38-45, 
50-52, 51t 
alternative conformations of, 51—52, 51f 
prediction of, 51 
Secondary transporters, 381, 381f 
Second-order reactions, 230 
rate of, 230 
Secretory proteins, 911 
Sedimentation coefficients, 76, 76t, 77£ 
Sedimentation-equilibrium technique, 78 
Sedimentation-velocity technique, 78 
Sedoheptulose 1,7-bisphosphate, in Calvin 
cycle, 595 
Sedoheptulose 7-phosphate, 595 
in Calvin cycle, 595f, 596f 
in pentose phosphate pathway, 602, 602f, 
603, 603f 
Segmental flexibility, 982, 982f 
Seizures, ketogenic diet for, 656 
Selectins, 339 
Selective estrogen receptor modulators 
(SERMs), 948 
Selectivity filters, 385-386, 386f, 387f 
Selenolate, 727, 727£ 
Self-antigens, immune response to, 1001 
Self-splicing, 879-881, 880f-882f. See also 
Splicing 
Self-tolerance, 1001 
Semiconservative replication, 
118-120, 120f 
Sense strand, 845, 854 
Sensory systems, 957-975 
hearing, 971-973 
overview of, 957-958 
smell, 958-962 
taste, 962-966 
touch, 973-974 
vision, 966-971 
yet-to-be-studied, 974 
Sequence alignment, 175-181, 176f-182f 
Sequence homologies. See also Evolution 
analysis of, 175-181, 176f-182f 
of cytochrome c, 543, 543f 
evolutionary trees and, 187-188, 187f 
of hemoglobin and myoglobin, 175-181, 
176f-182f, 199 
of histones, 940-941, 940f 
identification of 
databases for, 181, 182f 
sequence alignment in, 175-181, 
176f-182f 
shuffling in, 177-178, 177f 
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statistical analysis in, 175-177 
substitution matrices in, 178-181 
of ion channels, 384, 384f, 386 
repeated motifs and, 184, 184 
RNA secondary structures and, 
186-187, 187f 
three-dimensional structure and, 
183-184 
of tubulin, 1019 
of urea cycle enzymes, 688 
Sequence specifies conformation, 50 
Sequence template, 184 
Sequential model 
of hemoglobin oxygen binding, 202-203, 
203f, 216 
of substrate binding, 294 
Sequential reactions, 236—237 
SERCA, 375-377 
SERCA (sarcoplasmic reticulum Ca?* 
ATPase), 374 
Serine 
in catalytic triads, 258-260. See also 
Catalytic triads 
in chymotrypsin, 255-256, 256f 
in cysteine synthesis, 714-715, 719 
from glycine, 692 
in glycine synthesis, 714-715 
in glycoproteins, 330 
pyruvate formation from, 691, 691f 
in sphingolipid synthesis, 764, 764 
sphingosine from, 726, 726f 
structure of, 30, 30f 
synthesis of, 714-715, 714f, 716, 723 
Serine dehydratase, in amino acid 
degradation, 684, 691 
Serine hydroxymethyltransferase, 715 
Serine kinases, 298-299, 299f, 299t 
Serine proteases 
active sites on, convergent evolution of, 
185, 185-186 
catalytic strategies of, 253, 255-256 
SERMs (selective estrogen receptor 
modulators), 948 
Serotonin, synthesis of, 726f, 727 
Serpentine receptors, 404 
Serum albumin, in drug distribution, 
1003f, 1033 
7TM (seven-transmembrane-helix 
receptors), 404-406, 404f, 404t, 408, 
408-409 
for drugs, 404, 1045 
in olfaction, 959 
in phosphoinositide cascade, 408-409 
in taste, 963-965 
of bitterness, 962, 963-964 
of sweetness, 964—965 
in vision, 967-969, 968f, 969 
70S initiation complex, 902, 903f 
705 subunit, of ribosome, 897-898, 898f 
formation of, 902, 903f 
Severe acute respiratory syndrome 
(SARS), drug therapy for, 
1046-1047, 1047f 
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Severe combined immunodeficiency 
(SCID), 753 
gene therapy for, 167-168 
Sex hormones. See also Hormone(s) 
regulation of, 946 
synthesis of, 780, 780f, 784-785, 784f 
SH2 domain, in signal transduction, 414, 
417, 419, 420 
SH3 domain, in signal transduction, 
417, 420 
Shaker channel, 388, 389f 
shaker gene, 384 
Sharp, Philip, 131 
Shemin, David, 728, 728b 
Shikimate 3-phosphate, 720 
Shikimate, in amino acid synthesis, 
719, 720£ 
Shine-Dalgarno sequences, 130, 901 
Short interspersed elements 
(SINES), 160 
Sialic acid, influenza virus and, 339, 340f 
Sialidase, 339, 340f 
Sickle-cell anemia, 209-210, 209f, 210f 
Side chains 
amino acid. See Amino acid side chains 
polypeptide, 34, 34f 
Side effects, 1031 
genetic variations in, 1047-1048 
o subunit, of RNA polymerase, 853t, 
857-858, 868 
Sigmoid binding curve, 200, 
294-295, 295f 
Sigmoidal kinetics, enzymatic, 200, 291, 
291f, 294, 295f 
Signal peptidase, 912 
Signal sequences, 79 
nuclear localization signal, 79 
in protein targeting, 912, 912f 
Signal transduction, 401-423 
abnormalities in 
in cancer, 420-421 
in cholera, 421-422 
in whooping cough, 422 
adaptor proteins in, 414, 417 
B-adrenergic receptor in, 
404-405, 404£ 
calcium in, 409-411, 409f-411f 
calmodulin in, 410-411, 411f 
in caloric homeostasis, 794—797, 
795£-797E, 795t 
cAMP in, 406-407 
in chemotaxis, 1024, 1025f 
cross talk in, 403 
cytokines in, 998 
definition of, 401 
epidermal growth factor in, 402f, 415-418, 
416£-418f 
epinephrine in, 402-410, 402f, 406f 
evolution of, 419 
G proteins in, 417-418, 418t. See also 
G protein(s) 
gap junctions in, 393 
in gene expression/regulation, 929-930 


GTPases in, 298, 298t, 407-408 
in hearing, 927f, 972 
hormone-receptor complexes in, 405-406 
in immune response, 989-990, 998 
insulin in, 402f, 411-415, 798-800, 799f 
ligands in, 402 
molecular circuits in, 402-403 
nitric oxide in, 728 
in olfaction, 959-960, 960f 
overview of, 401—402, 419 
phosphoinositide cascade in, 408-409, 409f 
phosphorylation in, 298-299, 299t, 
406-407, 408-409, 409f, 411-415, 
417, 419 
primary messengers in, 402 
principles of, 402f 
protein kinase A in, 406-407, 419 
protein kinase C in, 408, 409 
protein kinases in, 298-299, 299t, 419 
in protein sorting, 911-914 
Ras in, 417-418, 418£ 
reactive oxygen species in, 542 
receptor protein kinases in, 411 
receptor tyrosine kinases in, 412, 415 
receptors in, 402, 405-406 
second messengers in, 402—403, 403f, 
409-411, 409f, 419 
7TM receptors in, 404-406, 404t 
specialized domains in, 419 
Src homology domains in, 414, 414f, 417, 
419, 420, 420 
steps in, 402-403 
in taste, 963-966 
termination of, 407f 
in touch, 973-974 
in vision, 968-969, 968f 
Signal-recognition particle (SRP), 124, 
911, 912f 
receptor for, 911 
Sildenafil, 1039 
discovery of, 1039 
Simple diffusion, 372 
SINES, 160 
Singer, S. Jonathan, 362 
Single-strand-binding protein, 832 
siRNA (small interfering RNA), 124. See 
also RNA 
Site-directed mutagenesis, in catalysis 
investigation, 262—263 
16S rRNA, in translation, 892, 901 
Sjéstrand, Fritjof, 526 
Slack, C. Roger, 599 
Sliding DNA clamp, 831-832, 831f, 834 
Sliding filament model, 1016-1017, 1017f 
Small G proteins. See also G protein(s) 
in signal transduction, 418, 418t 
Small interfering RNA (siRNA), 124. See 
also RNA 
Small nuclear ribonucleoproteins (snRNPs), 
852, 875-876, 876t 
Small nuclear RNA (snRNA), 124, 875-876. 
See also RNA 
in splicing, 875-876, 876t 


Small nucleolar ribonucleoproteins 
(snoRNPs), 870 
Smallpox, 1003 
Smell, 958-962. See also Olfaction 
Smith, Emil, 543 
Smith, Hamilton, 141 
Smooth endoplasmic reticulum, 911. See 
also Endoplasmic reticulum 
SNARE proteins, 366, 914 
snoRNPs (small nucleolar 
ribonucleoproteins), 870 
snRNA (small nuclear RNA), 124, 875-876 
in splicing, 875-876, 876t 
snRNPs (small nuclear ribonucleoproteins), 
852, 862t, 875-876 
Snurps, 875 
SOD1 gene, 152, 157 
Sodium dodecyl sulfate (SDS). See also 
under SDS 
in gel electrophoresis, 73, 78 
Sodium ion channels, 382, 384, 384f, 
387-392. See also Ion channels 
amiloride-sensitive, 965, 965f 
inactivation of, 388-389, 389f 
sequence homologies of, 384, 384f, 386 
in taste, 962, 965 
Sodium ion, salty taste of, 962, 965 
Sodium—potassium pump, 364, 364f, 374, 
377, 392 
digitalis inhibition of, 377 
evolution of, 378 
Solid-phase method, 144f 
Solid-phase methods, 97f, 98, 144-145, 144f 
Solid-phase peptide synthesis, 97f, 98 
Somatic mutations, antibody diversity 
and, 989 
Sonication, 353 
Sos, in signal transduction, 417 
Sound, perception of, 971-973 
Sour taste, 962, 962f, 965-966 
Southern blotting, 142, 142f 
Southern, Edwin, 142 
Space-filling molecular models, 21, 23f, 
61, 61f 
Special pairs, 571, 573, 573f 
Specific activity, 67 
Sphingolipids 
functions of, 765 
gangliosides, 764—765, 765f 
synthesis of, 764-765 
Sphingomyelin, 349, 349, 351, 351f 
synthesis of, 764, 764 
Sphingosine, 348, 349, 349, 764, 764 
synthesis of, 726, 726f 
Spicy foods, capsaicin in, 973-974, 973f 
Spin, 101, 101t 
Spina bifida, 755 
Spleen tyrosine kinase (Syk), 989-990 
Spliceosomes, 132, 874, 875-877, 875f 
Splicing, 132-133, 132f, 851-852, 869, 
873-879 
alternative, 133, 133f, 878-879, 878f 
branch points in, 874, 875, 875f£ 


consensus sequences in, 132f, 133, 
873-874, 873£ 
definition of, 873 
errors in, 877-878 
internal guide sequence in, 881, 881f 
mutations affecting, 877-878 
self-splicing and, 879-881, 880f-882f 
sites of, 873-874, 873f, 875-876, 877, 
881, 881f 
spliceosomes in, 132, 874, 875-877, 875f 
transesterification in, 874-875, 880 
Splicing factors, 875 
Split genes, 131-132, 132f, 133f 
evolutionary advantages of, 133 
Split-pool synthesis, 1041, 1042f 
Squalene, 768-770 
Squalene epoxide, 769, 770 
Squalene synthase, 769 
SR Ca?* ATPase, 375-377 
Src 
in cancer, 420 
myristoylation of, 298t 
Src homology domains, 414, 414f, 417, 419, 
420, 420 
SREBP (sterol regulatory element binding 
protein), 770-771, 771f 
SREBP cleavage activating protein (SCAP), 
770-771, 771£ 
SRP (signal-recognition particle), 124, 911, 
912f, 913£ 
SRP receptor, 912, 913f 
SRP54, 911, 913 
Stadtman, Earl, 725 
Stahl, Franklin, 119 
Staining, in gel electrophoresis, 73, 73f 
Staphylococcus aureus, peptidoglycan of, 
244, 244f 
Starch, 328, 329f. See also Carbohydrates 
in plants, 597 
Starvation, metabolic adaptations in, 809-810 
Starved-fed cycle, 807-808 
Statins, 779, 779, 1040, 1040-1041, 1041 
Steady-state assumption, 231 
Steatorrhea, 642 
Steitz, Tom, 821 
Stem cells, pluripotent, 944 
Stem-loop motif, 117, 117f 
Stereochemistry, 21 
notation in, 27 
Stereocilia, 971-973, 971f 
Stereoisomers, 27 
carbohydrate, 321, 321f 
Steric exclusion, 37 
Steroid hormones. See also Hormone(s) 
anabolic, 785, 948 
cardiotonic, 377, 377f 
functions of, 780 
hydroxylation of, 781-782 
structure of, 781, 781 
synthesis of, 780f, 783-785 
Steroid receptors, nuclear localization of, 
90-91, 90f. See also Nuclear hormone 
receptors 


Sterol regulatory element, 770 
Sterol regulatory element binding protein 
(SREBP), 770-771, 771£ 
Sticky ends, 148, 149f 
Stoma, in Crassulacea, 600f, 601 
Stomach. See also under Gastric 
protein digestion in, 674, 674f 
Strand invasion, 845, 845f 
Streptomycin, 909, 909-910, 909t 
Stroma, 567 
Strominger, Jack, 1038 
Structural formulas, 21-22, 23f 
Structure-based drug development, 
1042-1045, 1043f, 1044f 
Substitution(s) 
in amino acid sequences, 178-181, 
179f-181f 
conservative, 178 
production of, 156, 156f 
Substitution matrices, 178-181, 
179f-181f 
Substrate(s), 220. See also Enzyme- 
substrate complex 
binding of, enzyme specificity and, 
275-277 
chromogenic, 256 
Substrate binding. See also Enzyme- 
substrate complex 
concerted mechanism in, 294-296 
cooperative, 200-201, 201f, 294 
energy for, 229, 254, 275-277. See also 
Binding energy 
sequential model of, 294 
specificity of, 220-221, 221f, 275-277 
Substrate channeling 
in amino acid synthesis, 722, 722f 
in citric acid cycle, 512 
in nucleotide synthesis, 737-738 
Substrate cycle, 489 
Substrate-level phosphorylation, 463-464, 
507-508 
Subtilisin, 186, 261, 261f 
active sites on, 186, 186f 
site-directed mutagenesis of, 262, 262f 
Subunit vaccines, 1002 
Subunits, protein, 48, 48f 
Succinate dehydrogenase, in citric acid cycle, 
509-510 
Succinate, in 4-hydroxyproline synthesis, 
812f, 813 
Succinate thiokinase, in citric acid cycle, 
507-508 
Succinate-Q reductase (Complex IT), 531, 
532, 532t, 535-536 
Succinyl CoA, 649 
in amino acid degradation, 693 
in citric acid cycle, 507-508, 508f, 
514, 514f 
formation of, 649-652, 650f, 651f, 693 
porphyrins from, 728-730, 729f 
Succinyl CoA synthetase, in citric acid cycle, 
507-508 
Sucrase, 327, 454 
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Sucrose, 327 
plant synthesis of, 597, 598f 
Sugar(s). See also Carbohydrates 
amino, 330 
disaccharide, 327-329. See also 
Disaccharides 
monosaccharide, 320-327. See also 
Monosaccharides 
in DNA, 110-111, 110f 
in RNA, 110f, 111 
nonreducing, 325 
reducing, 325 
Sugars, phosphorylated, 326-327 
Suicide inhibition, 242, 750, 754 
Sulfate, conjugation of, 1034, 1035 
Sulfation, 298t 
Sulfolipids, in thylakoid membrane, 567 
Supercoiled DNA, 117, 117f, 826, 827f, 828f 
DNA condensation from, 828 
linking number and, 826, 827f 
negative supercoiling in, 826, 829-831, 941 
positive supercoiling in, 826 
relaxation of, 828-829, 829f 
topoisomerases and, 828-831 
twist in, 826-827, 827f 
writhe in, 826-828, 827f 
Supercompensation, 806 
Supernatant, in centrifugation, 67, 68f 
Superoxide anion, 198, 198f 
Superoxide dismutase, 541-542, 542f 
Superoxide ion, 540, 542f 
Supersecondary structures, 47, 47f, 82-83, 
83f, 174, 175f. See also Motifs 
Suppressors of cytokine signaling (SOCS), 
797, 797£, 800 
Surface complementarity, noncovalent 
bonds and, 11 
Svedberg units (S), 76, 76t 
Syk (spleen tyrosine kinase), 989-990 
Symmetry matching, 922 
Symporter, 380-381, 380f, 381f 
Synapses, 389, 390f 
Synaptic cleft, 389, 390f 
Synonyms, 129, 129t 
Synthetic analog system, 269 


T cell(s), 980-981 
activation of, 995-996 
cytotoxic (killer), 980, 994-995, 997-998 
helper, 980, 996-998 
in HIV infection, 999-1000 
major histocompatibility complex and, 
980-981 
memory, 1002 
negative selection of, 1000-1001, 1000f 
positive selection of, 1000, 1000f 
T state, of hemoglobin, 202-204, 203f, 
207-208, 207f 
T1R proteins, as sweet receptors, 964-965 
T2R proteins, as bitter receptors, 963-965 
Tactile sensation, 973-974 
Takepron (lansoprazole), 1046, 1046 
Tamoxifen, 948, 948 


C44 


INDEX 


Tandem mass spectrometry, 93-94 
TAP protein, 992 
Taste 
anatomic structures in, 963, 963f 
tastants in, 962, 962-963 
Taste buds, 963, 963f 
Taste papillae, 963 
TATA box, 126f, 127, 866-867, 867f, 868f 
TATA-box-binding protein (TBP), 867-868, 
867£, 868f, 950 
repeated motifs in, 184, 185 
TATA-box-binding protein associated 
factor, 950 
Taurocholate, 642 
synthesis of, 779-780, 780 
Taxol, 1019-1020 
Tay-Sachs disease, 765-766 
TBP. See TATA-box-binding 
protein (TBP) 
T-cell receptors, 980-981, 994-998 
CD3 and, 995, 995f 
CD4 and, 980 
CD8 and, 994-995, 995f 
cytotoxic T cells and, 980, 994-995, 
997-998 
domains of, 994, 994f 
genes of, 994 
helper T cells and, 980, 996-998 
MHC complex proteins and, 
994-998, 994£ 
number of, 980 
structure of, 993f, 994, 994f 
in T-cell activation, 995-996, 995f 
T-cell selection, 1000-1001, 1000f 
T-DNA, 166 
Telomerase, 124, 837 
Telomerase inhibitors, 837 
Telomeres, 836-837, 836f, 837f 
Temperature perception, 973 
Template 
in replication, 121-122, 122f, 820-821 
sequence, 184 
in transcription, 124, 125f, 126, 126t, 
854, 855f 
in translation, 123-124 
—10 sequence, 857, 857f 
Terminal transferase, 154 
Tertiary protein structure, 45-47, 46f, 47f, 
183-184, 183f. See also Protein(s), 
folding of 
Testosterone, 784 
functions of, 784—785 
reduction of, 785 
synthesis of, 780, 780f, 784-785, 784f 
Tetracycline, 909t 
Tetrahydrobiopterin, 695, 695 
Tetrahydrofolate, 715 
in amino acid synthesis, 715-716, 
716-717, 716£ 
in deoxyribonucleotide synthesis, 
748-749, 749 
Tetrahymena, self-splicing in, 880f, 881 
Tetrodotoxin, 384, 384 


Tetroses, 320 
TFIHL, in transcription, 867-868 
TFPI (tissue factor pathway inhibitor), 312 
Thalassemia, 210-211 
Therapeutic index, 1036 
Thermodynamics, 11-13. See also Energy 
of coupled reactions, 429-430, 431-433 
enzymes and, 222-225 
in metabolism, 429-435 
Thermogenesis, oxidative phosphorylation 
in, 556-557 
Thermogenin, 557 
Thiamine (vitamin B,), 441t 
biosynthesis of, 678, 679f 
deficiency of, 517 
Thiamine pyrophosphate (TPP), 500 
in citric acid cycle, 500, 500f, 501f 
deficiency of, 517 
in pentose phosphate pathway, 605, 605f 
Thiamine pyrophosphate (TPP), 594 
Thick filaments, in myosin, 1016, 1017f 
ThiF, 678, 679f 
Thin filaments, in myosin, 1016, 1017f 
Thioester intermediate, in glycolysis, 
461-463, 462f 
Thiogalactoside transacetylase, 924 
6-Thioguanine, 1047-1048, 1048 
Thiolase, 647-648, 647t 
Thioredoxin, 598-599, 598t, 599 
Thioredoxin reductase, 747 
30S initiation complex, 902, 903f 
305 subunit, of ribosome, 892, 897-898, 
898f, 902, 903f 
—35 sequence, 126f, 127, 857, 857f 
ThiS, 678, 679f 
Thorazine (chlorpromazine), 1038 
discovery of, 1038 
mechanism of action of, 1038, 1039f 
3’ poly(A) tail, 871-872, 871f 
3’ splice sites, 842-843, 873f 
Threonine 
in glycoproteins, 330 
pyruvate formation from, 691, 691f 
structure of, 30, 30f 
synthesis of, 724, 724f 
Threonine deaminase, 723—724, 723f 
Threonine kinases, 298-299, 299f, 299t 
Threonine operon, 932-933, 932f 
Threonyl-tRNA synthetase, active site of, 
894, 894-895 
Threonyl-tRNA synthetase complex, 
structure of, 896, 896 
Thrombin, 221, 221f 
in blood clotting, 308-310, 308f, 312 
y-carboxylation of, 298t 
inhibitors of, 312 
Thromboxanes, 664—665, 665, 665f 
Thylakoid membrane, 567-568, 567f, 
574-575 
proton gradient across, 577-581 
stacked vs. unstacked regions of, 583, 583f 
Thylakoid spaces, 567, 567f 
Thylakoids, 567, 567f, 574-575 


Thymidine, 111 
Thymidine kinase, 740 
Thymidine phosphorylase, 740 
Thymidine triphosphate (TTP), 752 
Thymidylate, 111, 748 
synthesis of, 748, 748f 
salvage pathway for, 740 
Thymidylate inhibitors, for cancer, 749-750 
Thymidylate synthase, 748 
Thymine, 4, 4, 111, 111, 748 
in DNA repair, 841 
synthesis of, 716 
Thymocytes, 1000-1001, 1000f 
Thymus, T-cell selection in, 
1000-1001, 1000f 
Thyroxine, synthesis of, 726, 726f 
Ti plasmids, 166-167, 167f 
Time of flight (TOF) mass analyzer, 92, 92f 
Time, sense of, 974 
Tip links, 972, 972f 
Tissue factor, in blood clotting, 308 
Tissue factor pathway inhibitor (TFPI), 312 
Tissue-type plasminogen activator (TPA), 
312-313, 313f 
Titin, 34 
Titration, 15-16 
Tobacco mosaic virus, 122 
a-Tocopherol (vitamin E), 442, 442, 442t 
in lipid peroxidation prevention, 542 
Toll receptors, 978-979 
Toll-like receptors, 978-979, 978f, 979f 
Tonegawa, Susumu, 987 
Tongue, taste buds of, 963, 963f 
Topoisomerases, 828-831 
bacterial, 831 
type I, 828-829 
structure of, 828, 828-829 
in supercoiled DNA relaxation, 
828-829, 829f 
type II, 829-831, 830f 
structure of, 829-831, 830 
Topoisomers, 826-828, 828f 
Torpedo marmorata, acetylcholine receptor 
in, 389, 390f 
Torr, 200b 
Torsion angles, 37, 37f, 38f 
Tosyl-L-phenylalanine chloromethyl ketone 
(TPCK), 241, 241-242 
Touch, 973-974 
Toxoid vaccines, 1002 
TPA (tissue-type plasminogen activator), 
312-313, 313f 
TPI. See Triose phosphate isomerase (TPI) 
TPP. See Thiamine pyrophosphate (TPP) 
TWC loop, of tRNA, 890f, 891 
Trace elements, 20 
Trans configuration, of peptide bonds, 
36-37, 36f, 37f 
Transacetylase component, of pyruvate 
dehydrogenase complex, 502, 502f 
Transaldolase, in pentose phosphate 
pathway-glycolysis linkage, 601-605, 
605, 606f 


‘Transaminases 
in amino acid degradation, 680 
evolution of, 713 
pyridoxal phosphate-dependent, 712-713 
Transamination 
in amino acid degradation, 680-685 
in amino acid synthesis, 709—710, 
712-713, 712£ 
pyridoxal phosphate enzymes in, 680-685 
reaction mechanism of, 682f, 712f 
Schiff-base intermediates in, 681-682 
stereochemistry of proton addition in, 
713, 713£ 
Transcription, 124-127, 125f, 851-869 
in archaea, 869 
in bacteria, 852-863 
attenuation in, 931-932, 931f, 932f 
chain elongation in, 855-856, 855f 
DNA rewinding in, 859, 859f 
helix unwinding in, 858, 858f 
initiation of, 854-858 
promoters in, 856-858 
proofreading in, 856 
regulation of, 922-928. See also Gene 
expression/regulation 
rho in, 860-861, 861f 
ribonucleases in, 863 
riboswitches in, 860, 860f 
RNA processing and, 133, 863. See also 
RNA processing 
RNA—DNA hybrid in 
backtracking of, 856, 856f 
separation of, 855, 856f, 859f 
translocation of, 855, 855f 
of rRNA, 863 
termination of, 859-860, 859f 
transcript processing in, 863 
transcription bubble in, 858-859, 859f 
transition from open to closed promoter 
complex in, 858 
translation and, 864, 864f 
of tRNA, 863 
vs. in eukaryotes, 864, 864f 
coordination with translation, 864, 864f 
definition of, 851 
direction of, 125, 854-855, 855f 
DNA template in, 124, 125f, 126, 126f, 
854, 855£ 
enhancer sequences in, 127 
in eukaryotes, 864 
carboxyl-terminal domain in, 868, 
877, 877£ 
chain elongation in, 867-868 
coactivators in, 947-948, 948f, 
949-951, 950f 
enhancers in, 127, 868-869, 
943-944, 943f 
histone acetylation in, 949-951, 950f 
initiation of, 866-869, 950 
nuclear membrane in, 864 
polyadenylation in, 871, 871f 
pre-mRNA processing and, 870-872, 
871f, 878-879. See also Splicing 


promoters in, 866-869, 866f 
regulation of, 864, 941-944. See also 
Gene expression/regulation 
repression of, 950-951 
RNA editing in, 872-873, 873f 
RNA polymerases in, 865-867, 866f, 
869-872. See also RNA 
polymerase(s) 
RNA processing and, 133, 864, 869-879. 
See also RNA processing 
splicing in, 132f, 133, 851-852, 873-879. 
See also Splicing 
TATA box in, 866-867, 867f, 868f 
transcript processing in, 869-879 
transcription factors in, 867-868, 867f. 
See also Transcription factors 
vs. in bacteria, 864, 864f 
evolution of, 869 
fundamental reaction of, 853 
in humans, pre-mRNA processing and, 
878-879 
inhibition of, by DNA-binding proteins, 
924-927 
initiation of, 126-127, 126f 
de novo, 854 
RNA chain elongation in, 858-859, 859f 
RNA chain formation in, 858-859 
intron removal in, 132-133, 132f 
overview of, 851-852 
promoters/promoter sites in, 
126-127, 126f 
RNA modification in, 127, 127f 
RNA polymerase in, 124-127, 125f, 852, 
853-858. See also RNA polymerase 
splicing in, 132-133, 132f, 851-852, 
873-879. See also Splicing 
stages of, 852-853 
termination of, 127 
Transcription bubble, 853-854, 854f, 
858-859, 859f 
Transcription factors 
in eukaryotes, 867, 941-944, 943. See also 
DNA-binding proteins and specific 
factors 
chromatin structure and, 946 
definition of, 164, 941 
DNA-binding domains of, 941, 
942-943 
in gene regulation, 942-943 
nuclear hormone receptor, 946-948, 947f 
regulatory domains of, 942-943 
in gene knockout, 164-165 
in prokaryotes, 941, 943 
Transcriptome, 162 
Transducin 
P-loop NTPase domains in, 284, 284f 
rhodopsin and, 968 
Transesterification, in splicing, 874-875 
Transfer RNA. See tRNA 
Transferase(s), 249t. See also Enzyme(s) 
in glycogen metabolism, 620, 620f 
Transferrin, 730, 951 
Transferrin receptor, 951 


C45 


Index 


Transgenic animals, 164, 164f 
Transglutaminase, 309 
Transition state 
catalytic stabilization of, 225-226, 225f, 
227, 243-244 
enzymes and, 225-226, 225f, 227 
Transition-state analogs, 243, 243f 
Transketolase, 594 
in Calvin cycle, 594, 595f 
in pentose phosphate pathway-glycolysis 
linkage, 601-605, 605f 
Translation, 130, 130f. See also Gene 
expression/regulation; Protein(s), 
synthesis of 
accuracy of, 888-889, 889t, 895 
activating enzymes in, 893 
adenylation in, 893-895 
in a helix, 39 
aminoacyl-tRNA synthetases in, 888, 
893-897 
activation sites of, 894, 894-895 
amino acid activation by, 893-895 
classes of, 897, 897t 
editing sites of, 895, 895 
proofreading by, 895 
specificity of, 893-894 
tRNA recognition by, 896 
anticodons in, 889, 891-893, 892f 
attenuation and, 931-932 
in bacteria 
transcription and, 864, 864f 
vs. in eukaryotes, 907-908 
base pairing in, 891-893, 901 
codon-anticodon interactions in, 
891-893 
wobble in, 891-893 
chain elongation in, 902-904, 903f, 904f 
codons in, 889, 891-893, 892f 
coordination with transcription, 
864, 864£ 
definition of, 123, 887 
direction of, 888, 905-906 
elongation factors in, 902-903, 903f 
in eukaryotes, 908 
error rate in, 888-889, 889t, 895 
in eukaryotes 
transcription and, 864, 864f 
vs. in bacteria, 907-908 
fidelity of, 888-889, 889t, 895 
formylmethionyl-tRNA in, 901-902, 
902, 902f 
free energy of, 893-894 
in gene regulation, 951-953 
inhibition of 
by antibiotics, 909-910 
by diphtheria toxin, 910-911 
by ricin, 911 
initiation of, 130, 893-894, 
900-902, 903f 
in eukaryotes, 907-908, 907f, 908f 
sites of, 900-901, 901f 
in iron metabolism, 951-953 
mechanisms of, 905f 
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Translation—( Continued ) 
mRNA in, 123-124, 130, 130f. See also 
mRNA (messenger RNA) 
in initiation, 900-901 
in termination, 901 
translocation of, 904—906, 905f 
overview of, 887-888 
peptide bond formation in, 903-904, 904f 
peptidyl tRNA in, 903-904 
proofreading in, 895 
protein sorting/transport in, 911-914 
release factors in, 906, 907f 
ribosome in, 897-906 
initiation sites in, 902, 903f 
RNA editing in, 872-873, 873f 
rRNA in, 892, 898-900, 901, 903-904 
Shine-Dalgarno sequences in, 130, 901 
signaling in, 911-914 
termination of, 130, 906, 907f 
in eukaryotes, 908 
translocation in, 904—906 
tRNA in, 127-128, 889-893, 901-906. 
See alsotRNA 
Translocase, 904-906, 905f 
Translocon, 911—912 
Transmissible spongiform encephalopathies, 
55-56 
Transpeptidase, penicillin inhibition of, 
245, 245f 
Transplant rejection, 998 
Transport vesicles, 913-914, 914f 
Transporters, 372 
ABC, 378-380, 378f, 379f 
antiporters, 380, 380f 
ATP-ADP translocase, 552-553, 
553f, 554f 
dicarboxylate, 554, 554f 
glucose, 372, 414, 477, 477t 
glucose 6-phosphate, 635 
inhibitors of, 558, 558f 
lactose permease, 381, 381f 
in metabolism, 438-442, 438f-440f 
in oxidative phosphorylation, 438-442, 
438f440f, 439f, 531-543, 532t, 
551-553, 551£-553f 
phosphate, 554, 554f 
in photosynthesis, 568-572, 569f, 596f 
pyruvate, 554, 554f 
secondary, 381, 381f 
symporters, 380-381, 380f, 381f 
tricarboxylate, 554, 554f 
uniporters, 380, 380f 
Tree of life, 3f 
Triacylglycerol(s), 639, 641-642 
energy storage in, 641-642 
hydrolysis of, 643-644, 643f. See also 
Fatty acid metabolism 
mobilization of, 643-644, 643£ 
synthesis of, 760-767, 760f, 761 
hepatic, 773 
transport of, 642, 642f, 773-774 
Triacylglycerol synthetase complex, 761 
Tricarboxylate carrier, 554, 554f 


Tricarboxylic acid cycle. See Citric 
acid cycle 
Triglycerides. See Triacylglycerol(s) 
Trimethoprim, 750 
Trimethoprim-sulfamethoxazole, 750 
Trinucleotide repeats, 842 
Triose kinase, 470 
Triose phosphate isomerase (TPI), 242, 
242, 459 
in glycolysis, 456f, 458-460 
Triose phosphates, in sucrose synthesis, 
597, 598 
Trioses, 320, 320f 
Triple helix, of collagen, 44-45, 44f 
tRNA (transfer RNA), 123t, 124, 127-128, 
128f. See also RNA 
anticodons of, 889, 890f, 891-893, 
891f, 896 
base-pairing in, 890-893, 890f 
wobble in, 891-893 
charged, 891f, 893 
codons of, 889 
functions of, 124 
initiator, 907 
inosine in, 892 
key features of, 889-890 
post-transcriptional processing of, 863 
precursor of, processing of, 870, 871f 
ribosomal binding of 
mechanism of, 903-904, 905f 
sites for, 900, 902-904, 905 
sequencing of, 889, 890f 
structure of, 128f, 889-890, 890f-892f 
transcription and processing of, 863, 
870, 871£ 
in translation, 889-893, 901—906 
as adaptor molecule, 889 
binding sites for, 900, 901f 
in chain elongation, 903-904, 904f, 905f 
function of, 889 
in initiation, 901-902 
translocation of, 904—906, 905f 
Trojan horse strategy, 245 
Trombone model, 833, 833f 
Tropical plants, photosynthesis in, 
599-600 
Tropomyosin, 1017 
Troponin complex, 1016 
TRP channels, 973 
trp operon, 931 
Trypsin, 221, 221f, 305-306 
catalytic triad in, 260-261, 261f 
chymotrypsin and, 260-261, 261f 
in protein cleavage, 80, 82f, 306 
structure of, 260, 261f 
in zymogen activation, 306, 306f 
Trypsin inhibitor, 306, 307f 
Trypsinogen, 306 
Tryptophan, 696 
degradation of, 696, 696f 
nicotinamide from, 726f, 727 
pyruvate formation from, 691 
serotonin from, 726f, 727 


structure of, 28, 29f 
synthesis of, 720-722, 721f, 722f 
Tryptophan operon, 931 
Tryptophan synthase, 722-723, 723 
Ts elongation factor, 903 
t-SNARE, 366, 914 
TTP (thymidine triphosphate), 752 
Tu elongation factor, 902-903, 903f 
Tuberculosis, proteasome inhibitors for, 679 
Tubulin 
in microtubules, 1018-1019, 1019f 
sequence homology of, 1019 
Tumor-inducing (Ti) plasmids, 166-167 
Tumor-suppressor genes, 420, 842 
Turnover number (Rat), 234, 234t 
20S proteasome, 677, 677f£ 
—25 region, 126f, 127 
26S proteasome, 677, 677f 
Twist, in DNA, 826-827, 827f 
Two-dimensional electrophoresis, 74, 74f 
Typing-monkey analogy, 53, 53f 
Tyrosine 
degradation of, 695-696 
epinephrine from, 726, 726f 
melanin from, 726 
structure of, 30, 30f, 696 
synthesis of, 719-720, 721f 
thyroxine from, 726, 726f 
Tyrosine kinases, 298-299, 299f, 412 
in cancer, 420 
receptor, 412 
Tyrosine phosphatase IB, 799 
Tyrosinemia, 698t 
Tyrosyl radical, 745-747 


1 snRNA, 875-876, 875f, 876t 

2 snRNA, 875f, 876, 876t 

4 snRNA, 875f, 876, 876t 

5 snRNA, 875f, 876, 876t 

6 snRNA, 875f, 876, 876t 

biquinol, 535 

biquinone (coenzyme Q), 532, 533 

in fatty acid metabolism, 646 

biquitin, 675-678 

structure of, 675, 675-676 

biquitin-activating enzyme (F1), 
675-676, 675£ 

biquitination, 298t, 675-678 

biquitin-conjugating enzyme (E2), 
675-676, 675£ 

biquitin-protein ligase (E3), 675-676, 675f 

CP-1, 557 

CP-2, 557 

CP-3, 557 

DP (uridine diphosphate), hydrolysis 
of, 431 

DP-a-glucuronic acid, 1035 

DP-galactose, 470, 471 

DP-glucose, 470, 470-471, 627 

in glucose synthesis, 627-630 

in glycogen synthesis, 627-630 

structure of, 627, 627 

in sucrose synthesis, 597, 598 
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DP-glucose phosphorylase, 627 

mami, 962, 962f, 965 

MP (uridine monophosphate), synthesis 
of, 739 

ncompetitive inhibition, 238, 238f, 239, 
239f, 240, 240f 

ncoupling proteins, 557, 558f 

nfolded protein response, 802 

niporters, 380, 380f 

nits of measure 

for distance, 7 

for energy, 7 

nsaturated fatty acids, 648-649, 664-665. 
See also Fatty acid(s) 

pstream promoter elements 

in bacteria, 857 

in eukaryotes, 866 

racil, 111, 111 

in DNA repair, 841 

thymidylate from, 748 

racil DNA glycolase, 841, 841f 

rate 

degradation of, 752, 753f 

in gout, 753-754 

rea, 49 

rea cycle, 673, 685-690 

carbamoyl phosphate formation in, 
685-687 

disorders of, 688-689 

evolution of, 688 

gluconeogenesis and, 687 

reactions in, 685f 

stoichiometry of, 687 

reotelic organisms, 685, 690 

ric acid 

degradation of, 752, 753f 

in gout, 753-754 

ricotelic organisms, 690 

ridine, 111 

ridine diphosphate. See UDP (uridine 
diphosphate) 

ridine monophosphate (UMP), synthesis 
of, 739 

ridine triphosphate. See UTP (uridine 
triphosphate) 

ridylate, 739, 739, 863, 863 

rine, as bleach, 687b 

roporphyrinogen III, 729, 729 

TP (uridine triphosphate) 

in glycogen synthesis, 617 

synthesis of, 739 
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V genes, 987-988 

in antibody switching, 990-991 
Vaccine(s), 1002-1003 

HIV, 1003 

synthetic, 157 
Vaccinia virus vectors, 163-164 
Valine 

degradation of, 694-695 

in maple syrup urine disease, 697 

structure of, 28, 29f 

synthesis of, 723-724 


Van der Waals interactions, 8, 8f, 11, 46. 
See also Bonds 
in antigen-antibody binding, 984, 985, 
985f, 986 
in double helix, 10, 10f 
in enzyme-substrate complex, 228, 228f 
in space-filling models, 61, 61f 
Vanishing white matter, 908-909, 909f 
Variable (V) genes, 987-988 
in class switching, 990-991 
Variable number of tandem repeats (VNTR) 
region, 333, 333f 
Variable regions, 983, 984f 
Vasopressin, 96, 96 
Vectors 
cloning, 148, 149 
expression, 149, 154-155 
lambda phage, 150-151, 150f, 151f 
plasmid, 149-151, 149f, 150f 
viral, 150-151, 150f, 151f, 163-164 
Velcade (bortezomib), 678, 679 
Very-low-density lipoproteins, 773, 773t. 
See also Lipoprotein(s) 
Vesicles 
formation of, 365 
transport, 913-914, 914f 
Viagra (sildenafil), 1039 
discovery of, 1039 
Vibrio cholerae 
ABC transporter of, 378, 378f, 379f 
infection with, 421-422 
Vibrio fischeri, quorum sensing in, 
930-931, 930f 
Vioxx (rofecoxib), 1044, 1044-1045, 1049 
Viral infections, carbohydrates in, 339-340 
Viral vectors, 150-151, 150f, 151f, 163-164 
Viruses 
cell entry by, 340f, 365-366 
HIV, 999-1000, 999f 
infectious mechanisms of, 339-340, 340f, 
365-366 
influenza, 339, 340f 
protein structure of, 48, 48f 
retroviruses, 999-1000 
RNA, 122-123, 123f 
Vision, 966-971 
color, 966, 969-971 
in animals, 970, 970f 
evolution of, 970, 970f 
cones in, 966, 969-970 
evolution of, 970, 970£ 
photoreceptors in, 966 
rhodopsin in, 966-968 
rods in, 966-967, 966f 
signal transduction in, 968-969, 968f 
Visual pigments, 969-970, 970f. See also 
Color vision 
in cones, 969-970 
in rods, 966-967 
Vitamin(s), 20 
B, 441, 441-442, 441t 
coenzyme, 441, 441-442, 441t 
deficiencies of, 441t, 442 
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electron carrier derived from, 441—442 
electron carriers derived from, 440t 
evolution of, 441-442, 441t 
noncoenzyme, 442, 442, 442t 
Vitamin A (retinol), 442, 442, 442t 
deficiency of, alcohol-related, 812 
Vitamin B, (thiamine), 441t 
biosynthesis of, 678, 679f 
deficiency of, 517 
Vitamin Bo, 441t 
Vitamin By (pantothenic acid), 441, 441t 
Vitamin By. (cobalamin), 441t, 649 
in amino acid synthesis, 650-652, 650f, 
651f, 717 
as coenzyme, 650-652, 650f, 651f, 717 
in fatty acid metabolism, 649-652 
structure of, 650, 650-651 
Vitamin C (ascorbic acid), 442, 442t, 
813, 813 
deficiency of, alcohol-related, 
812-813 
forms of, 813, 813 
Vitamin D (calciferol), 442, 442t 
deficiency of, 786 
sources of, 786 
synthesis of, 785, 785f 
Vitamin E (a-tocopherol), 442, 
442, 442t 
in lipid peroxidation prevention, 542 
Vitamin K, 442, 442t 
in blood clotting, 310-311 
deficiency of, 57, 301 
structure of, 310 
VJ recombination, 987-988, 988f 
V(D)J recombination, 988, 988f 
Vmax» 232-234 
calculation of, 233 
turnover number and, 234, 234t 
VNTR region, 333, 333f 
VO,°-, 1011 
Voltage-gated ion channels, 387-388, 388f. 
See also Ion channels 
Von Gierke disease, 634—635, 634t 
VR1 (capsaicin receptor), 973-974, 973f 
v-SNARE, 366, 914 
v-Sre, in cancer, 420 


Wang, James, 828 
Warburg effect, 478 
Warburg, Otto, 454, 478 
Warfarin, 310, 310 
Water 
dissociation of, equilibrium constant 
for, 14 
equilibrium constant for, 14 
hydrogen bonds of, 9-10, 9f 
hydroxide ions in, 13-17 
molecular models of, 23f 
polarity of, 8-9 
properties of, 8-9 
Watson, James, 5, 113 
Watson—Crick base pairs. See Bases/base 
pairs 
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Watson—Crick DNA model, 5, 113-115, 
114f. See also Bases/base pairs; DNA; 


Double helix 


Weight, regulation of, 791-798. See also 


Caloric homeostasis 
Wernicke—Korsakoff syndrome, 812 
Western blotting, 88f, 89, 142 
White adipose tissue, 556 
Whooping cough, 422 
Wiley, Don, 992 
Wilkins, Maurice, 113 
Withering, William, 377 
Wobble hypothesis, 891-893 
Writhe, in DNA, 826-828, 827f 
Wyman, Jeffries, 202 


Xanthine, degradation of, 752, 753f 
Xanthine oxidase inhibitors, 754 
Xanthomas, 776 

Xanthylate, 743, 744 


Xenobiotic compounds, 1034 
Xeroderma pigmentosum, 842 
X-ray crystallography, 98-100, 98f100f 
of enzyme-substrate complexes, 
227, 227 
X-ray diffraction photographs, of DNA, 
113, 113f 
Xylulose 5-phosphate, 596 
in Calvin cycle, 596, 596 
in pentose phosphate pathway, 
602f, 604 


Yamanaka, Shinya, 944 
Yanofsky, Charles, 931 
Yeast artificial chromosomes, 151 
Yeast, genome of, 158, 937, 938f 


Z line, 1016, 1016f 
Z scheme of photosynthesis, 
S70). 57 TE 


Zantac (ranitidine), 1045, 1045 
ZAP-70, 995-996 
Z-DNA, 116, 116f, 116t 
Zellwegger syndrome, 652 
Zinc finger, 109f 
Zinc proteases, 263, 264, 264f 
Zinc sites, in carbonic anhydrase, 267-269, 
267£-269f 
evolution of, 271 
Zinc-based domains, 946 
Zingiberene, 958 
Zonal centrifugation, 
77-78, 77£ 
Zwitterions, 27—28, 27f 
Zymogens, 303-313, 674 
in blood clotting, 307-312 
functions of, 303 
gastric, 303, 303t, 674 
pancreatic, 303-307, 303t, 
304f, 674 


ACIDITY CONSTANTS 


pK, values of some acids 


Acid pK’ (at 25°C) Acid pK’ (at 25°C) 
Acetic acid 4.76 Malic acid, pK; 3.40 
Acetoacetic acid 3.58 pK) 5.11 
Ammonium ion 9.25 Phenol 9.89 
Ascorbic acid, pK; 4.10 Phosphoric acid, pK; 2.12 
pK 11.79 pk 7.21 
Benzoic acid 4.20 oe, PK3 12.67 
peButyne acid 481 Pyridinium ion 3025) 
Cacodylic acid 6.19 Pyrophosphoric acid, a 0.85 
1.49 
Citric acid, pK, 3.14 fe 5.77 
pK, 4.77 nes 
ak os pK4 8.22 
a ; Succinic acid, pK; 4.21 
Ethylammonium ion 10.81 K 5.64 
Formic acid Selb) 5 Car 
Glycine. pK 9 35 Trimethylammonium ion 9.79 
oe pK, 9.78 Tris (hydroxymethyl) aminomethane 8.08 
Imidazolium ion 6.95 Water* 15.74 
Lactic acid 3.86 
Maleic acid, pK; 1.83 
*(H*] (OH ] = 10 '*; [H,0] = 55.5 M. 
Typical pK, values of ionizable groups in proteins 
: Typical F Typical 
Grou Acid — Base Grou Acid ——— Base 
P pK, P pKa 
Terminal ° } 
a-carboxyl ——_ ¢. 3.1 Cysteine 5 WH — _s- 83 
group a eye H oe) = 
H 
Aspartic Tyrosine < )- Oo — {po 10.4 
acid \ Les [ = a 
Glutamic eo Jil UL ~ + H 
acid ie) Lysine —N,,, — —Nua,, 10.0 
H H H 
No N H H 
Histidine +] es a 6.0 \ 
\ N \ N re Me pH \ ) 
SH SH Arginine Ne — ee 1255 
Terminal a nl N-H /N-H 
Q-amino i) — aie H 8.0 H H 
group H H 


Note: pK, values depend on temperature, ionic strength, and the microenvironment of the ionizable group. 


STANDARD BOND LENGTHS 


Bond Structure Length (A) 
C—H RCH, 1.07 
Aromatic 1.08 
RCH; 1.10 
Cc € Hydrocarbon 1.54 
Aromatic 1.40 
C=C Ethylene 1.33 
C=C Acetylene 1.20 
C—N RNH) 1.47 
O=C—N 1.34 
C—O) Alcohol 1.43 
Ester 1.36 
C=O Aldehyde 1.22 
Amide 1.24 
iC) RS 1.82 
N—H Amide 0.99 
O—Ial Alcohol 0.97 
O—O O2 1.21 
P—O Ester 1.56 
S—H Thiol 1.33 


5—S Disulfide Pa05) 


